Manuel Sam Ribeiro

According to our database1, Manuel Sam Ribeiro authored at least 22 papers between 2015 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Multilingual context-based pronunciation learning for Text-to-Speech.
CoRR, 2023

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech.
CoRR, 2023

Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings.
CoRR, 2023

2022
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks.
Proceedings of the Interspeech 2022, 2022

Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation.
Proceedings of the Interspeech 2022, 2022

Cross-Speaker Style Transfer for Text-to-Speech Using Data Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Voice Filter: Few-Shot Text-to-Speech Speaker Adaptation Using Voice Conversion as a Post-Processing Module.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors.
Speech Commun., 2021

Automatic audiovisual synchronisation for ultrasound tongue imaging.
Speech Commun., 2021

Tal: A Synchronised Multi-Speaker Corpus of Ultrasound Tongue Imaging, Audio, and Lip Videos.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Silent versus Modal Multi-Speaker Speech Recognition from Ultrasound and Video.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2019
Ultrasound Tongue Imaging for Diarization and Alignment of Child Speech Therapy Sessions.
Proceedings of the Interspeech 2019, 2019

Synchronising Audio and Ultrasound by Learning Cross-Modal Embeddings.
Proceedings of the Interspeech 2019, 2019

Speaker-independent Classification of Phonetic Segments from Raw Ultrasound in Child Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions.
Proceedings of the Interspeech 2018, 2018

2017
Learning Word Vector Representations Based on Acoustic Counts.
Proceedings of the Interspeech 2017, 2017

2016
Parallel and cascaded deep neural networks for text-to-speech synthesis.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Syllable-Level Representations of Suprasegmental Features for DNN-Based Text-to-Speech Synthesis.
Proceedings of the Interspeech 2016, 2016

The SIWIS Database: A Multilingual Speech Database with Acted Emphasis.
Proceedings of the Interspeech 2016, 2016

Wavelet-based decomposition of F0 as a secondary task for DNN-based speech synthesis with multi-task learning.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
A perceptual investigation of wavelet-based decomposition of f0 for text-to-speech synthesis.
Proceedings of the INTERSPEECH 2015, 2015

A multi-level representation of f0 using the continuous wavelet transform and the Discrete Cosine Transform.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015


  Loading...