R. J. Skerry-Ryan

According to our database1, R. J. Skerry-Ryan authored at least 20 papers between 2017 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
LMs with a Voice: Spoken Language Modeling beyond Speech Tokens.
CoRR, 2023

2022
Learning the joint distribution of two sequences using little or no paired data.
CoRR, 2022

Speaker Generation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling.
CoRR, 2021

Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Wave-Tacotron: Spectrogram-Free End-to-End Text-to-Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Non-saturating GAN training as divergence minimization.
CoRR, 2020

Semi-Supervised Generative Modeling for Controllable Speech Synthesis.
Proceedings of the 8th International Conference on Learning Representations, 2020

Location-Relative Attention Mechanisms for Robust Long-Form Speech Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis.
CoRR, 2019

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning.
Proceedings of the Interspeech 2019, 2019

Semi-supervised Training for Improving Data Efficiency in End-to-end Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Predicting Expressive Speaking Style from Text in End-To-End Speech Synthesis.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis.
Proceedings of the 35th International Conference on Machine Learning, 2018

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron.
Proceedings of the 35th International Conference on Machine Learning, 2018

Complex Evolution Recurrent Neural Networks (ceRNNs).
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions.
CoRR, 2017

Uncovering Latent Style Factors for Expressive Speech Synthesis.
CoRR, 2017

Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model.
CoRR, 2017



  Loading...