Gérard Bailly

Comput. Speech Lang., 2026

2025

Data-Driven Control of Eye and Head Movements for Triadic Human-Robot Interactions.

[BibT_eX]

[DOI]

Léa Haefflinger

Int. J. Soc. Robotics, June, 2025

Speech/text alignments for Italian end-to-end TTS.

[BibT_eX]

[DOI]

Dataset, February, 2025

Speech/text alignments for Italian end-to-end TTS.

[BibT_eX]

[DOI]

Dataset, February, 2025

THERADIA WoZ: An Ecological Corpus for Appraisal-Based Affect Research in Healthcare.

[BibT_eX]

[DOI]

Franck Tarpin-Bernard

IEEE Trans. Affect. Comput., 2025

Refining the evaluation of speech synthesis: A summary of the Blizzard Challenge 2023.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2025

Cued Speech Generation Leveraging a Pre-trained Audiovisual Text-to-Speech Model.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Speech/text alignments for Italian end-to-end TTS.

[BibT_eX]

[DOI]

Dataset, December, 2024

Speech/text alignments for Italian end-to-end TTS.

[BibT_eX]

[DOI]

Dataset, November, 2024

Speech/text alignments for Italian end-to-end TTS.

[BibT_eX]

[DOI]

Dataset, November, 2024

Speech/text alignments for Italian end-to-end TTS.

[BibT_eX]

[DOI]

Dataset, November, 2024

Ressources for End-to-End French Text-to-Speech Blizzard challenge.

[BibT_eX]

[DOI]

Dataset, October, 2024

Ressources for End-to-End French Text-to-Speech Blizzard challenge.

[BibT_eX]

[DOI]

Dataset, October, 2024

Speech/text alignments for Italian end-to-end TTS.

[BibT_eX]

[DOI]

Dataset, October, 2024

Entraînement de la coordination respiration-parole en apprentissage de la lecture assistée par ordinateur.

[BibT_eX]

[DOI]

Proceedings of the Actes des 35èmes Journées d'Études sur la Parole, 2024

FastLips: an End-to-End Audiovisual Text-to-Speech System with Lip Features Prediction for Virtual Avatars.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Training speech-breathing coordination in computer-assisted reading.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

EVAC 2024 - Empathic Virtual Agent Challenge: Appraisal-based Recognition of Affective States.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Multimodal Interaction, 2024

Impact of verbal instructions and deictic gestures of a cobot on the performance of human coworkers.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE-RAS International Conference on Humanoid Robots, 2024

Probing the Inductive Biases of a Gaze Model for Multi-party Interaction.

[BibT_eX]

[DOI]

Proceedings of the Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024

RoboTrio2: Annotated Interactions of a Teleoperated Robot and Human Dyads for Data-Driven Behavioral Models.

[BibT_eX]

[DOI]

Léa Haefflinger

Proceedings of the Workshops at the Third International Conference on Hybrid Human-Artificial Intelligence co-located with (HHAI 2024), 2024

Emotags: Computer-Assisted Verbal Labelling of Expressive Audiovisual Utterances for Expressive Multimodal TTS.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

Ressources for End-to-End French Text-to-Speech Blizzard challenge.

[BibT_eX]

[DOI]

Dataset, January, 2023

Local Style Tokens: Fine-Grained Prosodic Representations For TTS Expressive Control.

[BibT_eX]

[DOI]

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Advocating for text input in multi-speaker text-to-speech systems.

[BibT_eX]

[DOI]

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Data-Driven Generation of Eyes and Head Movements of a Social Robot in Multiparty Conversation.

[BibT_eX]

[DOI]

Proceedings of the Social Robotics - 15th International Conference, 2023

On the Benefit of Independent Control of Head and Eye Movements of a Social Robot for Multiparty Human-Robot Interaction.

[BibT_eX]

[DOI]

Proceedings of the Human-Computer Interaction, 2023

The Blizzard Challenge 2023.

[BibT_eX]

[DOI]

Proceedings of the 18th Blizzard Challenge Workshop, Grenoble, France, August 29, 2023, 2023

The GIPSA-Lab Text-To-Speech System for the Blizzard Challenge 2023.

[BibT_eX]

[DOI]

Proceedings of the 18th Blizzard Challenge Workshop, Grenoble, France, August 29, 2023, 2023

2022

Ressources for End-to-End French Text-to-Speech Blizzard challenge.

[BibT_eX]

[DOI]

Dataset, December, 2022

Ressources for End-to-End French Text-to-Speech Blizzard challenge.

[BibT_eX]

[DOI]

Dataset, November, 2022

Automatic assessment of oral readings of young pupils.

[BibT_eX]

[DOI]

Erika Godde

Anne-Laure Piat-Marchand

Speech Commun., 2022

Comparing NLP Solutions for the Disambiguation of French Heterophonic Homographs for End-to-End TTS Systems.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 24th International Conference, 2022

Automatic Verbal Depiction of a Brick Assembly for a Robot Instructing Humans.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022

Speaking Rate Control of end-to-end TTS Models by Direct Manipulation of the Encoder's Output Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Ressources for End-to-End French Text-to-Speech Blizzard challenge.

[BibT_eX]

[DOI]

Dataset, March, 2021

Impact of Segmentation and Annotation in French end-to-end Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Impact of Social Presence of Humanoid Robots: Does Competence Matter?

[BibT_eX]

[DOI]

Proceedings of the Social Robotics - 13th International Conference, 2021

Evaluating the Extrapolation Capabilities of Neural Vocoders to Extreme Pitch Values.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Characterizing and Assessing the Oral Reading Fluency of Young Readers.

[BibT_eX]

Proceedings of the Fifth International Conference, 2021

THERADIA: Digital Therapies Augmented by Artificial Intelligence.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neuroergonomics and Cognitive Engineering, 2021

2020

Predicting Multidimensional Subjective Ratings of Children' Readings from the Speech Signals for the Automatic Assessment of Fluency.

[BibT_eX]

[DOI]

Erika Godde

Anne-Laure Piat-Marchand

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019

Reading Prosody Development: Automatic Assessment for a Longitudinal Study.

[BibT_eX]

[DOI]

Erika Godde

Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019

Transfer and Extraction of the Style of Handwritten Letters using Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Agents and Artificial Intelligence, 2019

2018

Introduction to the special issue on auditory-visual expressive speech and gesture in humans and machines.

[BibT_eX]

[DOI]

Jeesun Kim

Chris Davis

Speech Commun., 2018

Audio-visual synchronization in reading while listening to texts: Effects on visual behavior and verbal learning.

[BibT_eX]

[DOI]

Emilie Gerbier

Comput. Speech Lang., 2018

Style Transfer and Extraction for the Handwritten Letters Using Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2018

A Variational Prosody Model for the decomposition and synthesis of speech prosody.

[BibT_eX]

[DOI]

CoRR, 2018

Handwriting Styles: Benchmarks and Evaluation Metrics.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Social Networks Analysis, 2018

A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic Contours.

[BibT_eX]

[DOI]

Branislav Gerazov

Yi Xu

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Comparing Cascaded LSTM Architectures for Generating Head Motion from Speech in Task-Oriented Dialogs.

[BibT_eX]

[DOI]

Duc Canh Nguyen

Proceedings of the Human-Computer Interaction. Interaction Technologies, 2018

2017

Which prosodic features contribute to the recognition of dramatic attitudes?

[BibT_eX]

[DOI]

Adela Barbulescu

Rémi Ronfard

Speech Commun., 2017

Learning off-line vs. on-line models of interactive multimodal behaviors with recurrent neural networks.

[BibT_eX]

[DOI]

Duc Canh Nguyen

Pattern Recognit. Lett., 2017

Critical review of the book "Gaze in Human-Robot Communication".

[BibT_eX]

[DOI]

J. Multimodal User Interfaces, 2017

A Generative Audio-Visual Prosodic Model for Virtual Actors.

[BibT_eX]

[DOI]

Adela Barbulescu

Rémi Ronfard

IEEE Computer Graphics and Applications, 2017

Evaluation of reading performance of primary school children: Objective measurements vs. subjective ratings.

[BibT_eX]

[DOI]

Estelle Gillet-Perret

Proceedings of the 6th International Workshop on Child Computer Interaction, 2017

Improving fluency of young readers: introducing a Karaoke to learn how to breathe during a Reading-while-Listening task.

[BibT_eX]

[DOI]

Proceedings of the 7th ISCA International Workshop on Speech and Language Technology in Education, 2017

Acquiring Human-Robot Interaction skills with Transfer Learning Techniques.

[BibT_eX]

[DOI]

Proceedings of the Companion of the 2017 ACM/IEEE International Conference on Human-Robot Interaction, 2017

2016

Graphical models for social behavior modeling in face-to face interaction.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2016

Statistical conversion of silent articulation into audible speech using full-covariance HMM.

[BibT_eX]

[DOI]

Thomas Hueber

Comput. Speech Lang., 2016

Adaptive Latency for Part-of-Speech Tagging in Incremental Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Introduction to Poster Presentation of Part II.

[BibT_eX]

[DOI]

Jeesun Kim

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Characterization of Audiovisual Dramatic Attitudes.

[BibT_eX]

[DOI]

Adela Barbulescu

Rémi Ronfard

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Quantitative Analysis of Backchannels Uttered by an Interviewer During Neuropsychological Tests.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Conducting neuropsychological tests with a humanoid robot: Design and evaluation.

[BibT_eX]

[DOI]

Duc Canh Nguyen

Proceedings of the 7th IEEE International Conference on Cognitive Infocommunications, 2016

2015

Speaker-Adaptive Acoustic-Articulatory Inversion Using Cascaded Gaussian Mixture Regression.

[BibT_eX]

[DOI]

Thomas Hueber

Laurent Girin

Xavier Alameda-Pineda

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Learning multimodal behavioral models for face-to-face social interaction.

[BibT_eX]

[DOI]

J. Multimodal User Interfaces, 2015

Design and Validation of a Talking Face for the iCub.

[BibT_eX]

[DOI]

Int. J. Humanoid Robotics, 2015

Using Karaoke to enhance reading while listening: impact on word memorization and eye movements.

[BibT_eX]

[DOI]

Emilie Gerbier

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015

HMM training strategy for incremental speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Impact of iris size and eyelids coupling on the estimation of the gaze direction of a robotic talking head by human viewers.

[BibT_eX]

[DOI]

Francois R. Foerster

Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

Beaming the Gaze of a Humanoid Robot.

[BibT_eX]

[DOI]

Miquel Sauze

Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction, 2015

Audiovisual generation of social attitudes from neutral stimuli.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2015

2014

Beyond basic emotions: expressive virtual actors with social attitudes.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Conference on Motion in Games, Playa Vista, CA, USA, November 06, 2014

Assessing objective characterizations of phonetic convergence.

[BibT_eX]

[DOI]

Amélie Martin

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

An articulated talking face for the iCub.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

Modeling perception-action loops: comparing sequential models with frame-based classifiers.

[BibT_eX]

[DOI]

Alaeddine Mihoub

Christian Wolf

Proceedings of the second international conference on Human-agent interaction, 2014

2013

Vizart3d - real-time system of visual articulatory feedback.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013

Speaker adaptation of an acoustic-articulatory inversion model using cascaded Gaussian mixture regressions.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Adaptation of respiratory patterns in collaborative reading.

[BibT_eX]

[DOI]

Amélie Rochet-Capellan

Coriandre Vilain

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Social Behavior Modeling Based on Incremental Discrete Hidden Markov Models.

[BibT_eX]

[DOI]

Alaeddine Mihoub

Christian Wolf

Proceedings of the Human Behavior Understanding - 4th International Workshop, 2013

Audio-visual speaker conversion using prosody features.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2013

2012

I Reach Faster When I See You Look: Gaze Effects in Human-Human and Human-Robot Face-to-Face Cooperation.

[BibT_eX]

[DOI]

Jocelyne Ventre-Dominey

Frontiers Neurorobotics, 2012

Vizart3D : Retour Articulatoire Visuel pour l'Aide à la Prononciation (Vizart3D: Visual Articulatory Feedack for Computer-Assisted Pronunciation Training) [in French].

[BibT_eX]

[DOI]

Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Cross-speaker Acoustic-to-Articulatory Inversion using Phone-based Trajectory HMM for Pronunciation Training.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Continuous Articulatory-to-Acoustic Mapping using Phone-based Trajectory HMM for a Silent Speech Interface.

[BibT_eX]

[DOI]

Thomas Hueber

Bruce Denby

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Pauses and respiratory markers of the structure of book reading.

[BibT_eX]

[DOI]

Cécilia Gouvernayre

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

A pilot study on augmented speech communication based on Electro-Magnetic Articulography.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2011

Toward a Multi-Speaker Visual Articulatory Feedback System.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Synchronous Reading: Learning French Orthography by Audiovisual Training.

[BibT_eX]

[DOI]

Will Barbour

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010

Improvement to a NAM-captured whisper-to-speech system.

[BibT_eX]

[DOI]

Speech Commun., 2010

Speech and face-to-face communication - An introduction.

[BibT_eX]

[DOI]

Marion Dohen

Jean-Luc Schwartz

Speech Commun., 2010

Gaze, conversational agents and face-to-face communication.

[BibT_eX]

[DOI]

Speech Commun., 2010

Can you 'read' tongue movements? Evaluation of the contribution of tongue display to speech understanding.

[BibT_eX]

[DOI]

Speech Commun., 2010

On the importance of eye gaze in a face-to-face collaborative task.

[BibT_eX]

[DOI]

Proceedings of the 3rd international workshop on Affective interaction in natural environments, 2010

Facilitative effects of communicative gaze and speech in human-robot cooperation.

[BibT_eX]

[DOI]

Jean-David Boucher

Jocelyne Ventre-Dominey

Peter Ford Dominey

Proceedings of the 3rd international workshop on Affective interaction in natural environments, 2010

Can tongue be recovered from face? the answer of data-driven statistical models.

[BibT_eX]

[DOI]

Atef Ben Youssef

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speech dominoes and phonetic convergence.

[BibT_eX]

[DOI]

Amélie Lelong

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Exploiting multimodal data fusion in robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Speech, Gaze and Head Motion in a Face-to-Face Collaborative Task.

[BibT_eX]

Proceedings of the Electronic Speech Signal Processing, 2010

Study of the Phenomenon of Phonetic Convergence Thanks to Speech Dominoes.

[BibT_eX]

[DOI]

Amélie Lelong

Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010

Acoustic-to-articulatory inversion in speech based on statistical models.

[BibT_eX]

[DOI]

Atef Ben Youssef

Proceedings of the Auditory-Visual Speech Processing, 2010

2009

Exploiting visual information for NAM recognition.

[BibT_eX]

[DOI]

IEICE Electron. Express, 2009

Animating Virtual Speakers or Singers from Audio: Lip-Synching Facial Animation.

[BibT_eX]

[DOI]

Barry-John Theobald

EURASIP J. Audio Speech Music. Process., 2009

Lip-Synching Using Speaker-Specific Articulation, Shape and Appearance Models.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2009

Acoustic-to-articulatory inversion using speech recognition and trajectory formation based on phoneme hidden Markov models.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Multimodal HMM-based NAM-to-speech conversion.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Improvement to a NAM captured whisper-to-speech system.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

LIPS2008: visual speech synthesis challenge.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

From 3-d speaker cloning to text-to-audiovisual-speech.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A trainable trajectory formation model TD-HMM parameterized for the LIPS 2008 challenge.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Can you "read tongue movements"?

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

German text-to-audiovisual-speech by 3-d speaker cloning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

Retargeting cued speech hand gestures for different talking heads and speakers.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

Speaking with smile or disgust: data and models.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

An Audiovisual Talking Head for Augmented Speech Generation: Models and Animations Based on a Real Speaker's Articulatory Data.

[BibT_eX]

[DOI]

Proceedings of the Articulated Motion and Deformable Objects, 5th International Conference, 2008

2007

Image and Video for Hearing Impaired People.

[BibT_eX]

[DOI]

Alice Caplier

Sébastien Stillittano

EURASIP J. Image Video Process., 2007

Learning optimal audiovisual phasing for an HMM-based control model for facial animation.

[BibT_eX]

[DOI]

Oxana Govokhina

Gaspard Breton

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Analyzing Gaze During Face-to-Face Interaction.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Virtual Agents, 7th International Conference, 2007

Scrutinizing Natural Scenes: Controlling the Gaze of an Embodied Conversational Agent.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Virtual Agents, 7th International Conference, 2007

Gaze Patterns during Face-to-Face Interaction.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE/WIC/ACM International Conference on Web Intelligence and International Conference on Intelligent Agent Technology, 2007

Analyzing and modeling gaze during face-to-face interaction.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing 2007, 2007

Intelligibility of natural and 3d-cloned German speech.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing 2007, 2007

Towards eye gaze aware analysis and synthesis of audiovisual speech.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing 2007, 2007

2006

3D Semi-Landmarks Based Statistical Face Reconstruction.

[BibT_eX]

[DOI]

J. Comput. Inf. Technol., 2006

Rackham: An Interactive Robot-Guide.

[BibT_eX]

[DOI]

Proceedings of the 15th IEEE International Symposium on Robot and Human Interactive Communication, 2006

Embodied Conversational Agents: Computing and Rendering Realistic Gaze Patterns.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing, 2006

Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User's Attention to Points of Interest?

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

A joint prosody evaluation of French text-to-speech synthesis systems.

[BibT_eX]

[DOI]

Marie-Neige Garcia

Michel Morel

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

TDA: a new trainable trajectory formation system for facial animation.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Evaluating a virtual speech cuer.

[BibT_eX]

[DOI]

Guillaume Gibert

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Generating German intonation with a trainable prosodic model.

[BibT_eX]

[DOI]

Jan Gorisch

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

A new trainable trajectory formation system for facial animation.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Experimental Linguistics, 2006

Evaluation of a virtual speech cuer.

[BibT_eX]

[DOI]

Guillaume Gibert

Proceedings of the ISCA Tutorial and Research Workshop on Experimental Linguistics, 2006

Audiovisual speech enhancement experiments for mouth segmentation evaluation.

[BibT_eX]

[DOI]

Pierre Gacon

Pierre-Yves Coulon

Proceedings of the 14th European Signal Processing Conference, 2006

Statistical 3D Cranio-Facial Models.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Computer and Information Technology (CIT 2006), 2006

2005

SFC: A trainable prosodic model.

[BibT_eX]

[DOI]

Bleicke Holm

Speech Commun., 2005

Evaluating the pronunciation of proper names by four French grapheme-to-phoneme converters.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Statistical active model for mouth components segmentation.

[BibT_eX]

[DOI]

Pierre Gacon

Pierre-Yves Coulon

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Missing Data Estimation Using Polynomial Kernels.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Data Mining, 2005

Non-linear active model for mouth inner and outer contours detection.

[BibT_eX]

[DOI]

Pierre Gacon

Pierre-Yves Coulon

Proceedings of the 13th European Signal Processing Conference, 2005

Basic components of a face-to-face interaction with a conversational agent: mutual attention and deixis.

[BibT_eX]

[DOI]

Proceedings of the 2005 joint conference on Smart objects and ambient intelligence, 2005

Capturing data and realistic 3d models for cued speech analysis and audiovisual synthesis.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing 2005, 2005

2004

Tracking talking faces with shape and appearance models.

[BibT_eX]

[DOI]

Matthias Odisio

Speech Commun., 2004

Audiovisual text-to-cued speech synthesis.

[BibT_eX]

[DOI]

Guillaume Gibert

Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Evaluation of a Speech Cuer: From Motion Capture to a Concatenative Text-to-cued Speech System.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

A superposed prosodic model for Chinese text-to-speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Audiovisual perceptual evaluation of resynthesised speech movements.

[BibT_eX]

[DOI]

Matthias Odisio

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

A trainable prosodic model: learning the contours implementing communicative functions within a superpositional model of intonation.

[BibT_eX]

[DOI]

Bleicke Holm

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

3D Meshes Registration: Application to Statistical Skull Model.

[BibT_eX]

[DOI]

Proceedings of the Image Analysis and Recognition: International Conference, 2004

Audiovisual text-to-cued speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003

Audiovisual Speech Synthesis.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2003

Close Shadowing Natural Versus Synthetic Speech.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2003

ISCA special session: hot topics in speech synthesis.

[BibT_eX]

[DOI]

Nick Campbell

Bernd Möbius

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Shape and appearance models of talking faces for model-based tracking.

[BibT_eX]

[DOI]

Matthias Odisio

Proceedings of the AVSP 2003, 2003

2002

Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images.

[BibT_eX]

[DOI]

J. Phonetics, 2002

Seeing tongue movements from outside.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Audiovisual speech synthesis. from ground truth to models.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

Generating prosodic attitudes in French: Data, model and evaluation.

[BibT_eX]

[DOI]

Speech Commun., 2001

Close shadowing natural vs. synthetic speech.

[BibT_eX]

[DOI]

Proceedings of the 4th ITRW on Speech Synthesis, 2001

Visual synthesis.

[BibT_eX]

[DOI]

Proceedings of the 4th ITRW on Speech Synthesis, 2001

Creating and controlling video-realistic talking heads.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2001

2000

The Cost258 Signal Generation Test Array.

[BibT_eX]

[DOI]

Eduardo Rodríguez Banga

Alex I. C. Monaghan

Erhard Rank

Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

MOTHER: a new generation of talking heads providing a flexible articulatory control for video-realistic speech animation.

[BibT_eX]

[DOI]

Lionel Revéret

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Generating prosody by superposing multi-parametric overlapping contours.

[BibT_eX]

[DOI]

Bleicke Holm

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999

Training an application-dependent prosodic model corpus, model and evaluation.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Accurate estimation of sinusoidal parameters in an harmonic+noise model for speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998

Objective evaluation of grapheme to phoneme conversion for text-to-speech synthesis in French.

[BibT_eX]

[DOI]

François Yvon

Douglas D. O'Shaughnessy

Comput. Speech Lang., 1998

Evaluating the adeqnacy of synthetic prosody in signaling syntactic boundaries: methodology and first results.

[BibT_eX]

Proceedings of the First International Conference on Language Resources and Evaluation, 1998

Evaluation of grapheme-to phoneme conversion for text-to-speech synthesis in French.

[BibT_eX]

François Yvon

Jean-Philippe Goldman

Eric Keller

Douglas D. O'Shaughnessy

Steve Pagel

F. Sannier

Jean Véronis

Brigitte Zellner Keller

Proceedings of the First International Conference on Language Resources and Evaluation, 1998

Cooperation and competition of burst and formant transitions for the perception and identification of French stops.

[BibT_eX]

[DOI]

Adrian Neagu

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Synergy between jaw and lips/tongue movements : consequences in articulatory modelling.

[BibT_eX]

[DOI]

Anne Vilain

A three-dimensional linear articulatory model based on MRI data.

[BibT_eX]

[DOI]

1997

Learning to speak. Sensori-motor control of speech movements.

[BibT_eX]

[DOI]

Speech Commun., 1997

Relative contributions of noise burst and vocalic transitions to the perceptual identification of stop consonants.

[BibT_eX]

[DOI]

Adrian Neagu

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Synthesising attitudes with global rhythmic and intonation contours.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Synthesis of fricative consonants by audiovisual-to-articulatory inversion.

[BibT_eX]

[DOI]

Khaled Mawass

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Introduction to Part III.

[BibT_eX]

[DOI]

Proceedings of the Computing Prosody, 1997

1996

Generating intonation by superposing gestures.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Building sensori-motor prototypes from audiovisual exemplars.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995

Synthesis and evaluation of intonation with a superposition model.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Articulatori-acoustic vowel prototypes for speech production.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Generation of intonation: a global approach.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994

Characterisation of rhythmic patterns for text-to-speech synthesis.

[BibT_eX]

[DOI]

Plínio Barbosa

Speech Communication, 1994

Generation of pauses within the z-score model.

[BibT_eX]

[DOI]

Plínio Barbosa

Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

Building prototypes for articulatory speech synthesis.

[BibT_eX]

[DOI]

Eric Castelli

Bernard Gabioud

Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

1993

Resonances as possible representation of speech in the auditory-to-articulatory transform.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

COMPOST: a client-server model for applications using text-to-speech systems.

[BibT_eX]

[DOI]

Mamoun Alissali

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1991

Synthesis-by-rule using compost: modelling resonance trajectories.

[BibT_eX]

[DOI]

M. Guerti

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1990

Automatic labeling of large prosodic databases : tools, methodology and links with a text-to-speech system.

[BibT_eX]

[DOI]

Thierry Barbe

Hai-Dong Wang

Proceedings of the ESCA Workshop on Speech Synthesis, 1990

Generation of articulatory trajectories using sequential networks.

[BibT_eX]

[DOI]

Proceedings of the ESCA Workshop on Speech Synthesis, 1990

Automatic segmentation and alignment of continuous speech based on temporal decomposition model.

[BibT_eX]

[DOI]

Hai-Dong Wang

Denis Tuffelli

Proceedings of the First International Conference on Spoken Language Processing, 1990

1989

Integration of rhythmic and syntactic constraints in a model of generation of French prosody.

[BibT_eX]

[DOI]

Speech Commun., 1989

Compost: a rule-compiler for speech synthesis.

[BibT_eX]

[DOI]

A. Tran

Proceedings of the First European Conference on Speech Communication and Technology, 1989

A new algorithm for temporal decomposition of speech-application to a numerical model of coarticulation.

[BibT_eX]

[DOI]

Pierre-François Marteau

Christian Abry

Proceedings of the IEEE International Conference on Acoustics, 1989

1988

Stochastic model of diphone-like segments based on trajectory concepts.

[BibT_eX]

[DOI]

Pierre-François Marteau

M. T. Janot-Giorgetti

Proceedings of the IEEE International Conference on Acoustics, 1988

1986

Multiparametric generation of French prosody from unrestricted text.

[BibT_eX]

[DOI]