Jonathan Le Roux

According to our database1, Jonathan Le Roux authored at least 90 papers between 2005 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Phasebook and Friends: Leveraging Discrete Representations for Source Separation.
J. Sel. Topics Signal Processing, 2019

Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity.
CoRR, 2019

WHAM!: Extending Speech Separation to Noisy Environments.
CoRR, 2019

Universal Sound Separation.
CoRR, 2019

Class-conditional Embeddings for Music Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Bootstrapping Single-channel Source Separation via Unsupervised Spatial Clustering on Stereo Mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2019

The Phasebook: Building Complex Masks via Discrete Representations for Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

SDR - Half-baked or Well Done?
Proceedings of the IEEE International Conference on Acoustics, 2019

Triggered Attention for End-to-end Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Cycle-consistency Training for End-to-end Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Teacher-student Deep Clustering for Low-delay Single Channel Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Class-conditional embeddings for music source separation.
CoRR, 2018

SDR - half-baked or well done?
CoRR, 2018

Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures.
CoRR, 2018

Cycle-consistency training for end-to-end speech recognition.
CoRR, 2018

Phasebook and Friends: Leveraging Discrete Representations for Source Separation.
CoRR, 2018

A Purely End-to-end System for Multi-speaker Speech Recognition.
CoRR, 2018

End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction.
CoRR, 2018

Phase Reconstruction with Learned Time-Frequency Representations for Single-Channel Speech Separation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction.
Proceedings of the Interspeech 2018, 2018

Alternative Objective Functions for Deep Clustering.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Channel Deep Clustering: Discriminative Spectral and Spatial Embeddings for Speaker-Independent Speech Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

End-to-End Multi-Speaker Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An End-to-End Language-Tracking Speech Recognizer for Mixed-Language Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Purely End-to-End System for Multi-speaker Speech Recognition.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Duration-Controlled LSTM for Polyphonic Sound Event Detection.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

Prior-based Binary Masking and Discriminative Methods for Reverberant and Noisy Speech Recognition Using Distant Stereo Microphones.
JIP, 2017

Multi-microphone speech recognition integrating beamforming, robust feature extraction, and advanced DNN/RNN backend.
Computer Speech & Language, 2017

Consistent anisotropic Wiener filtering for audio source separation.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Coupled Initialization of Multi-Channel Non-Negative Matrix Factorization Based on Spatial and Spectral Information.
Proceedings of the Interspeech 2017, 2017

Student-teacher network learning with enhanced features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Deep clustering and conventional networks for music separation: Stronger together.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic Sound Event Detection.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Novel Deep Architectures in Speech Processing.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Deep Recurrent Networks for Separation and Recognition of Single-Channel Speech in Nonstationary Background Audio.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Full-Capacity Unitary Recurrent Neural Networks.
CoRR, 2016

Deep Clustering and Conventional Networks for Music Separation: Stronger Together.
CoRR, 2016

Single-Channel Multi-Speaker Separation using Deep Clustering.
CoRR, 2016

Dialog state tracking with attention-based sequence-to-sequence learning.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Full-Capacity Unitary Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Single-Channel Multi-Speaker Separation Using Deep Clustering.
Proceedings of the Interspeech 2016, 2016

Improved MVDR Beamforming Using Single-Channel Mask Prediction Networks.
Proceedings of the Interspeech 2016, 2016

Deep unfolding for multichannel source separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Deep clustering: Discriminative embeddings for segmentation and separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Phase Processing for Single-Channel Speech Enhancement: History and recent advances.
IEEE Signal Process. Mag., 2015

Deep clustering: Discriminative embeddings for segmentation and separation.
CoRR, 2015

Micbots: Collecting large realistic datasets for speech and audio research using mobile robots.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Deep NMF for speech separation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015

The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Deep Unfolding: Model-Based Inspiration of Novel Deep Architectures.
CoRR, 2014

Discriminative NMF and its application to single-channel source separation.
Proceedings of the INTERSPEECH 2014, 2014

Sequential maximum mutual information linear discriminant analysis for speech recognition.
Proceedings of the INTERSPEECH 2014, 2014

Black box optimization for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Non-negative source-filter dynamical system for speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2014

Ensemble integration of calibrated speaker localization and statistical speech detection in domestic environments.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Discriminatively trained recurrent neural networks for single-channel speech separation.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

Sequence discriminative training for low-rank deep neural networks.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013
Consistent Wiener Filtering for Audio Source Separation.
IEEE Signal Process. Lett., 2013

Block Coordinate Descent for Sparse NMF
Proceedings of the 1st International Conference on Learning Representations, 2013

Hierarchical and coupled non-negative dynamical systems with application to audio modeling.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Ensemble learning for speech enhancement.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Statistical Dialogue Management using Intention Dependency Graph.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines.
Proceedings of the IEEE International Conference on Acoustics, 2013

Source localization in reverberant environments using sparse optimization.
Proceedings of the IEEE International Conference on Acoustics, 2013

Non-negative dynamical system with application to speech and audio.
Proceedings of the IEEE International Conference on Acoustics, 2013

The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

A generalized discriminative training framework for system combination.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Indirect model-based speech enhancement.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Factorial Models for Noise Robust Speech Recognition.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011
Computational auditory induction as a missing-data model-fitting problem with Bregman divergence.
Speech Communication, 2011

Bayesian nonparametric spectrogram modeling based on infinite factorial infinite hidden Markov model.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Infinite-state spectrum model for music signal analysis.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Harmonic and Percussive Sound Separation and Its Application to MIR-Related Tasks.
Proceedings of the Advances in Music Information Retrieval, 2010

A statistical model of speech F0 contours.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2010

Consistent Wiener Filtering: Generalized Time-Frequency Masking Respecting Spectrogram Consistency.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

Nonnegative Matrix Factorization with Markov-Chained Bases for Modeling Time-Varying Patterns in Music Spectrograms.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

Statistical Model of Speech Signals Based on Composite Autoregressive System with Application to Blind Source Separation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

2008
Adaptive Template Matching with Shift-Invariant Semi-NMF.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Computational auditory induction by missing-data non-negative matrix factorization.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2008

Modulation analysis of speech through orthogonal FIR filterbank optimization.
Proceedings of the IEEE International Conference on Acoustics, 2008

Separation of a monaural audio signal into harmonic/percussive components by complementary diffusion on spectrogram.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
Single and Multiple F0 Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments.
IEEE Trans. Audio, Speech & Language Processing, 2007

Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error.
IEEE Trans. Audio, Speech & Language Processing, 2007

Harmonic-Temporal Clustering of Speech for Single and Multiple F0 Contour Estimation in Noisy Environments.
Proceedings of the IEEE International Conference on Acoustics, 2007

MEG Signal Denoising Based on Time-Shift PCA.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Speech analyzer using a joint estimation model of spectral envelope and fine structure.
Proceedings of the INTERSPEECH 2006, 2006

2005
Optimization methods for discriminative training.
Proceedings of the INTERSPEECH 2005, 2005


  Loading...