Vincent Colotte

According to our database1, Vincent Colotte authored at least 34 papers between 1998 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS.
CoRR, 2023

2022
Can We Use Common Voice to Train a Multi-Speaker TTS System?
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Analysis of expressivity transfer in non-autoregressive end-to-end multispeaker TTS systems.
Proceedings of the Interspeech 2022, 2022

Multi-stage attention for fine-grained expressivity transfer in multispeaker text-to-speech system.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
Learning emotions latent representation with CVAE for text-driven expressive audiovisual speech synthesis.
Neural Networks, 2021

Duration modelling and evaluation for Arabic statistical parametric speech synthesis.
Multim. Tools Appl., 2021

Improving transfer of expressivity for end-to-end multispeaker text-to-speech synthesis.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Some consideration on expressive audiovisual speech corpus acquisition using a multimodal platform.
Lang. Resour. Evaluation, 2020

Étude comparative des paramètres d'entrée pour la synthèse expressive audiovisuelle de la parole par DNNs (Comparative study of input parameters for DNN-based expressive audiovisual speech synthesis ).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Deep Variational Metric Learning for Transfer of Expressivity in Multispeaker Text to Speech.
Proceedings of the Statistical Language and Speech Processing, 2020

Transfer Learning of the Expressivity Using FLOW Metric Learning in Multispeaker Text-to-Speech Synthesis.
Proceedings of the Interspeech 2020, 2020

2019
Conditional Variational Auto-Encoder for Text-Driven Expressive AudioVisual Speech Synthesis.
Proceedings of the Interspeech 2019, 2019

\(F_{0}\) Modeling Using DNN for Arabic Parametric Speech Synthesis.
Proceedings of the Recent Advances in Big Data and Deep Learning, 2019

2018
Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic.
Int. J. Speech Technol., 2018

DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation.
Proceedings of the Statistical Language and Speech Processing, 2018

2017
On the quality of an expressive audiovisual corpus: a case study of acted speech.
Proceedings of the Auditory-Visual Speech Processing, 2017

2016
The IFCASL Corpus of French and German Non-native and Native Read Speech.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Acoustic and Visual Analysis of Expressive Speech: A Case Study of French Acted Speech.
Proceedings of the Interspeech 2016, 2016

2014
Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

2013
Acoustic-visual synthesis technique using bimodal unit-selection.
EURASIP J. Audio Speech Music. Process., 2013

Automatic feature selection for acoustic-visual concatenative speech synthesis: towards a perceptual objective measure.
Proceedings of the Auditory-Visual Speech Processing, 2013

2012
ViSAC: acoustic-visual speech synthesis: the system and its evaluation.
Proceedings of the Facial Analysis and Animation 2012, 2012

2011
Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis.
Proceedings of the INTERSPEECH 2011, 2011

Introducing visual target cost within an acoustic-visual unit-selection speech synthesizer.
Proceedings of the Auditory-Visual Speech Processing, 2011

2010
Setup for acoustic-visual speech synthesis by concatenating bimodal units.
Proceedings of the INTERSPEECH 2010, 2010

HMM-based automatic visual speech segmentation using facial data.
Proceedings of the INTERSPEECH 2010, 2010

Towards a true acoustic-visual speech synthesis.
Proceedings of the Auditory-Visual Speech Processing, 2010

2005
Linguistic features weighting for a text-to-speech system without prosody model.
Proceedings of the INTERSPEECH 2005, 2005

2002
Techniques d'analyse et de synthèse de la parole appliquées à l'apprentissage des langues.
PhD thesis, 2002

Higher precision pitch marking for TD-PSOLA.
Proceedings of the 11th European Signal Processing Conference, 2002

2001
Perceptual experiments on enhanced and slowed down speech sentences for second language acquisition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Automatic enhancement of speech intelligibility.
Proceedings of the IEEE International Conference on Acoustics, 2000

Detecting relevant acoustic events for piloting improvement of intelligibility.
Proceedings of the 10th European Signal Processing Conference, 2000

1998
Automatic pitch marking for speech transformations via TD-PSOLA.
Proceedings of the 9th European Signal Processing Conference, 1998


  Loading...