Speaker and Noise Independent Online Single Channel Speech Enhancement
Title | Speaker and Noise Independent Online Single Channel Speech Enhancement |
Publication Type | Conference Paper |
Year of Publication | 2015 |
Authors | Germain, F. G., and G. J. Mysore |
Conference Name | 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Date Published | 04/2015 |
Publisher | IEEE |
Conference Location | Brisbane, Australia |
ISBN Number | 978-1-4673-6997-8 |
Accession Number | 15361677 |
Abstract | Desirable properties of real-world speech enhancement methods include online operation, single-channel operation, operation in the presence of a variety of noise types including non-stationary noise, and no requirement for isolated training examples of the specific speaker and noise type at hand. Methods in the literature typically possess only a subset of these properties. Source separation methods particularly rarely simultaneously possess the first and last properties. We extend universal speech model-based speech enhancement to adaptively learn a noise model in an online fashion. We learn a model from a general corpus of speech in place of speaker-dependent training examples before deployment. This setup provides all of these desirable properties, making it easy to deploy in real-world systems without the need to provide additional training examples, while explicitly modeling speech. Our experimental results show that our method achieves the same performance as in the case in which speaker-dependent training data is available. |
URL | https://ieeexplore.ieee.org/document/7177934/ |
DOI | 10.1109/ICASSP.2015.7177934 |
Refereed Designation | Refereed |
Full Text | https://ccrma.stanford.edu/~gautham/Site/Publications_files/GermainMysor... |