Thomas Merritt

According to our database1, Thomas Merritt authored at least 30 papers between 2013 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech.
CoRR, 2023

AE-Flow: Autoencoder Normalizing Flow.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Expressive, Variable, and Controllable Duration Modelling in TTS.
CoRR, 2022

Text-free non-parallel many-to-many voice conversion using normalising flows.
CoRR, 2022

Remap, Warp and Attend: Non-Parallel Many-to-Many Accent Conversion with Normalizing Flows.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion.
Proceedings of the Interspeech 2022, 2022

Creating New Voices using Normalizing Flows.
Proceedings of the Interspeech 2022, 2022

Expressive, Variable, and Controllable Duration Modelling in TTS.
Proceedings of the Interspeech 2022, 2022

Text-Free Non-Parallel Many-To-Many Voice Conversion Using Normalising Flow.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech.
CoRR, 2021

Low-Resource Expressive Text-To-Speech Using Data Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Camp: A Two-Stage Approach to Modelling Prosody in Context.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Parallel WaveNet conditioned on VAE latent vectors.
CoRR, 2020

2019
In Other News: a Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Towards Achieving Robust Universal Neural Vocoding.
Proceedings of the Interspeech 2019, 2019

Effect of Data Reduction on Sequence-to-sequence Neural TTS.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Effect of data reduction on sequence-to-sequence neural TTS.
CoRR, 2018

Robust universal neural vocoding.
CoRR, 2018

Analysing Shortcomings of Statistical Parametric Speech Synthesis.
CoRR, 2018

Comprehensive Evaluation of Statistical Speech Waveform Synthesis.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

2017
Overcoming the limitations of statistical parametric speech synthesis.
PhD thesis, 2017

Phrase Break Prediction for Long-Form Reading TTS: Exploiting Text Structure Information.
Proceedings of the Interspeech 2017, 2017

2016
From HMMS to DNNS: Where do the improvements come from?
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Deep neural network-guided unit selection synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Deep neural network context embeddings for model selection in rich-context HMM synthesis.
Proceedings of the INTERSPEECH 2015, 2015

Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis.
Proceedings of the INTERSPEECH 2014, 2014

Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech.
Proceedings of the INTERSPEECH 2014, 2014

A flexible front-end for HTS.
Proceedings of the INTERSPEECH 2014, 2014

2013
Investigating the shortcomings of HMM synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013


  Loading...