Ron J. Weiss

According to our database1, Ron J. Weiss authored at least 49 papers between 2006 and 2018.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2018
Hierarchical Generative Modeling for Controllable Speech Synthesis.
CoRR, 2018

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking.
CoRR, 2018

Synthesizing Diverse, High-Quality Audio Textures.
CoRR, 2018

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis.
CoRR, 2018

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron.
CoRR, 2018

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron.
Proceedings of the 35th International Conference on Machine Learning, 2018

Multilingual Speech Recognition with a Single End-to-End Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

On Using Backpropagation for Speech Texture Generation and Voice Conversion.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

On Using Backpropagation for Speech Texture Generation and Voice Conversion.
CoRR, 2017

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions.
CoRR, 2017

State-of-the-art Speech Recognition With Sequence-to-Sequence Models.
CoRR, 2017

Multilingual Speech Recognition With A Single End-To-End Model.
CoRR, 2017

Sequence-to-Sequence Models Can Directly Transcribe Foreign Speech.
CoRR, 2017

Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model.
CoRR, 2017

Online and Linear-Time Attention by Enforcing Monotonic Alignments.
CoRR, 2017

Sequence-to-Sequence Models Can Directly Translate Foreign Speech.
Proceedings of the Interspeech 2017, 2017



Online and Linear-Time Attention by Enforcing Monotonic Alignments.
Proceedings of the 34th International Conference on Machine Learning, 2017

CNN architectures for large-scale audio classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Raw Multichannel Processing Using Deep Neural Networks.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
CNN Architectures for Large-Scale Audio Classification.
CoRR, 2016

Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction.
Proceedings of the Interspeech 2016, 2016

Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Factored spatial and spectral multichannel raw waveform CLDNNs.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Learning the speech front-end with raw waveform CLDNNs.
Proceedings of the INTERSPEECH 2015, 2015

Speech acoustic modeling from raw multichannel waveforms.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Affinity Weighted Embedding.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
Affinity Weighted Embedding
CoRR, 2013

Learning to rank recommendations with the k-order statistic loss.
Proceedings of the Seventh ACM Conference on Recommender Systems, 2013

Nonlinear latent factorization by embedding multiple user interests.
Proceedings of the Seventh ACM Conference on Recommender Systems, 2013

2012
Latent Collaborative Retrieval
CoRR, 2012

Latent Collaborative Retrieval.
Proceedings of the 29th International Conference on Machine Learning, 2012

2011
Combining localization cues and source model constraints for binaural source separation.
Speech Communication, 2011

Unsupervised Discovery of Temporal Structure in Music.
J. Sel. Topics Signal Processing, 2011

Evaluating music sequence models through missing data.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Model-Based Expectation-Maximization Source Separation and Localization.
IEEE Trans. Audio, Speech & Language Processing, 2010

Speech separation using speaker-adapted eigenvoice speech models.
Computer Speech & Language, 2010

Identifying Repeated Patterns in Music Using Sparse Convolutive Non-negative Matrix Factorization.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Clustering Beat-Chroma Patterns in a Large Music Database.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

2009
A variational EM algorithm for learning eigenvoice parameters in mixed signals.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Source separation based on binaural cues and source model constraints.
Proceedings of the INTERSPEECH 2008, 2008

DySANA: dynamic speech and noise adaptation for voice activity detection.
Proceedings of the INTERSPEECH 2008, 2008

2006
Estimating single-channel source separation masks: relevance vector machine classifiers vs. pitch-based masking.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

Model-Based Monaural Source Separation Using a Vector-Quantized Phase-Vocoder Representation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006


  Loading...