CNN architectures for large-scale audio classification S Hershey, S Chaudhuri, DPW Ellis, JF Gemmeke, A Jansen, RC Moore, ... 2017 ieee international conference on acoustics, speech and signal …, 2017 | 3062 | 2017 |
Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation A Ephrat, I Mosseri, O Lang, T Dekel, K Wilson, A Hassidim, WT Freeman, ... arXiv preprint arXiv:1804.03619, 2018 | 899 | 2018 |
Learning the speech front-end with raw waveform CLDNNs. TN Sainath, RJ Weiss, AW Senior, KW Wilson, O Vinyals Interspeech, 1-5, 2015 | 621 | 2015 |
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018 | 428 | 2018 |
Speech denoising using nonnegative matrix factorization with priors KW Wilson, B Raj, P Smaragdis, A Divakaran 2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008 | 370 | 2008 |
Speech acoustic modeling from raw multichannel waveforms Y Hoshen, RJ Weiss, KW Wilson 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 283 | 2015 |
Multichannel signal processing with deep neural networks for automatic speech recognition TN Sainath, RJ Weiss, KW Wilson, B Li, A Narayanan, E Variani, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (5), 965-979, 2017 | 274 | 2017 |
Processing multi-channel audio waveforms TN Sainath, RJ Weiss, KW Wilson, AW Senior, A Narayanan, Y Hoshen, ... US Patent 9,697,826, 2017 | 250 | 2017 |
Universal sound separation I Kavalerov, S Wisdom, H Erdogan, B Patton, K Wilson, J Le Roux, ... 2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019 | 217 | 2019 |
Acoustic Modeling for Google Home. B Li, TN Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, ... Interspeech, 399-403, 2017 | 205 | 2017 |
Unsupervised sound separation using mixture invariant training S Wisdom, E Tzinis, H Erdogan, R Weiss, K Wilson, J Hershey Advances in neural information processing systems 33, 3846-3857, 2020 | 201 | 2020 |
Neural network adaptive beamforming for robust multichannel speech recognition. B Li, TN Sainath, RJ Weiss, KW Wilson, M Bacchiani Interspeech, 1976-1980, 2016 | 150 | 2016 |
Regularized non-negative matrix factorization with temporal dependencies for speech denoising. KW Wilson, B Raj, P Smaragdis Interspeech, 411-414, 2008 | 130 | 2008 |
Differentiable consistency constraints for improved deep speech enhancement S Wisdom, JR Hershey, K Wilson, J Thorpe, M Chinen, B Patton, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 122 | 2019 |
Visual speech recognition with loosely synchronized feature streams K Saenko, K Livescu, M Siracusa, K Wilson, J Glass, T Darrell Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 2 …, 2005 | 120 | 2005 |
Low latency video storyboard delivery with selectable resolution levels NO Krahnstoever, KW Wilson US Patent App. 13/785,913, 2014 | 110 | 2014 |
VoiceFilter-Lite: Streaming targeted voice separation for on-device speech recognition Q Wang, IL Moreno, M Saglam, K Wilson, A Chiao, R Liu, Y He, W Li, ... arXiv preprint arXiv:2009.04323, 2020 | 101 | 2020 |
AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech B Patton, Y Agiomyrgiannakis, M Terry, K Wilson, RA Saurous, D Sculley arXiv preprint arXiv:1611.09207, 2016 | 99 | 2016 |
Multiple person and speaker activity tracking with a particle filter N Checka, KW Wilson, MR Siracusa, T Darrell 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 99 | 2004 |
Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms TN Sainath, RJ Weiss, KW Wilson, A Narayanan, M Bacchiani, A Senior 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 94 | 2015 |