Kevin W. Wilson

According to our database1, Kevin W. Wilson authored at least 31 papers between 2002 and 2018.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepage:

On csauthors.net:

Bibliography

2018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking.
CoRR, 2018

AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies.
CoRR, 2018

Exploring Tradeoffs in Models for Low-Latency Speech Enhancement.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies.
Proceedings of the Interspeech 2018, 2018

2017
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017


CNN architectures for large-scale audio classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Raw Multichannel Processing Using Deep Neural Networks.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech.
CoRR, 2016

CNN Architectures for Large-Scale Audio Classification.
CoRR, 2016

Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction.
Proceedings of the Interspeech 2016, 2016

Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Factored spatial and spectral multichannel raw waveform CLDNNs.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Learning the speech front-end with raw waveform CLDNNs.
Proceedings of the INTERSPEECH 2015, 2015

Speech acoustic modeling from raw multichannel waveforms.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2010
Ungrounded independent non-negative factor analysis.
Proceedings of the INTERSPEECH 2010, 2010

Spectrogram dimensionality reductionwith independence constraints.
Proceedings of the IEEE International Conference on Acoustics, 2010

2008
Regularized non-negative matrix factorization with temporal dependencies for speech denoising.
Proceedings of the INTERSPEECH 2008, 2008

Speech denoising using nonnegative matrix factorization with priors.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
An SVM Framework for Genre-Independent Scene Change Detection.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
Estimating uncertainty models for speech source localization in real-world environments.
PhD thesis, 2006

Learning a Precedence Effect-Like Weighting Function for the Generalized Cross-Correlation Framework.
IEEE Trans. Audio, Speech & Language Processing, 2006

2005
Visual Speech Recognition with Loosely Synchronized Feature Streams.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Improving audio source localization by learning the precedence effect.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Real-time audio-visual tracking for meeting analysis.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

Multiple person and speaker activity tracking with a particle filter.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
A multi-modal approach for determining speaker location and focus.
Proceedings of the 5th International Conference on Multimodal Interfaces, 2003

A Probabilistic Framework for Multi-modal Multi-Person Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2003

2002
Audiovisual Arrays for Untethered Spoken Interfaces.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Audio-video array source localization for intelligent environments.
Proceedings of the IEEE International Conference on Acoustics, 2002


  Loading...