Vincent Wan

According to our database1, Vincent Wan authored at least 44 papers between 1992 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks.
Proceedings of the Interspeech 2022, 2022

2019
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network.
Proceedings of the 36th International Conference on Machine Learning, 2019

2017
Google's Next-Generation Real-Time Unit-Selection Synthesizer Using Sequence-to-Sequence LSTM-Based Autoencoders.
Proceedings of the Interspeech 2017, 2017

2016
Expressive visual text-to-speech as an assistive technology for individuals with autism spectrum conditions.
Comput. Vis. Image Underst., 2016

2014
Building HMM-TTS Voices on Diverse Data.
IEEE J. Sel. Top. Signal Process., 2014

Speech intonation for TTS: study on evaluation methodology.
Proceedings of the INTERSPEECH 2014, 2014

Voice expression conversion with factorised HMM-TTS models.
Proceedings of the INTERSPEECH 2014, 2014

Generating multiple-accent pronunciations for TTS using joint sequence model interpolation.
Proceedings of the INTERSPEECH 2014, 2014

An initial investigation of long-term adaptation for meeting transcription.
Proceedings of the INTERSPEECH 2014, 2014

Cluster adaptive training of average voice models.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Noise robustness in HMM-TTS speaker adaptation.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

An expressive text-driven 3D talking head.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2013


Expressive Visual Text-to-Speech Using Active Appearance Models.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Transcribing Meetings With the AMIDA Systems.
IEEE Trans. Speech Audio Process., 2012

Combining multiple high quality corpora for improving HMM-TTS.
Proceedings of the INTERSPEECH 2012, 2012

Speech factorization for HMM-TTS based on cluster adaptive training.
Proceedings of the INTERSPEECH 2012, 2012

Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training.
Proceedings of the INTERSPEECH 2012, 2012

Unsupervised clustering of emotion and voice styles for expressive TTS.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Extending Audio Notetaker to Browse WebASR Transcriptions.
Proceedings of the INTERSPEECH 2011, 2011

2010
The AMIDA 2009 meeting transcription system.
Proceedings of the INTERSPEECH 2010, 2010

2009
Real-time ASR from meetings.
Proceedings of the INTERSPEECH 2009, 2009

2008
Bob: A lexicon and pronunciation dictionary generator.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Automatic speech recognition for scientific purposes - webASR.
Proceedings of the INTERSPEECH 2008, 2008

Combining neural network and rule-based systems for dysarthria diagnosis.
Proceedings of the INTERSPEECH 2008, 2008

2007
Towards capturing fine phonetic variation in speech using articulatory features.
Speech Commun., 2007

Can unquantised articulatory feature continuums be modelled?
Proceedings of the INTERSPEECH 2007, 2007

Segmentation of speech: child's play?
Proceedings of the INTERSPEECH 2007, 2007

Finding Maximum Margin Segments in Speech.
Proceedings of the IEEE International Conference on Acoustics, 2007

The AMI System for the Transcription of Speech in Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2007

The 2007 AMI(DA) System for Meeting Transcription.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
The AMI Meeting Transcription System: Progress and Performance.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Strategies for Language Model Web-Data Collection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Speech and crosstalk detection in multichannel audio.
IEEE Trans. Speech Audio Process., 2005

Speaker verification using sequence discriminant support vector machines.
IEEE Trans. Speech Audio Process., 2005

The Development of the AMI System for the Transcription of Speech in Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

The 2005 AMI System for the Transcription of Speech in Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

Polynomial dynamic time warping kernel support vector machines for dysarthric speech recognition with sparse training data.
Proceedings of the INTERSPEECH 2005, 2005

Transcription of conference room meetings: an investigation.
Proceedings of the INTERSPEECH 2005, 2005

2003
Speaker verification using support vector machines.
PhD thesis, 2003

Feature selection for the classification of crosstalk in multi-channel audio.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

SVMSVM: support vector machine speaker verification methodology.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Evaluation of kernel methods for speaker verification and identification.
Proceedings of the IEEE International Conference on Acoustics, 2002

1992
Book reviews.
Minds Mach., 1992


  Loading...