Raul Fernandez

Marcelo Carpinette Grave

Julio Nogima

Ron Hoory

Proceedings of the 29th International Conference on Intelligent User Interfaces, 2024

Exploring the Benefits of Tokenization of Discrete Acoustic Units.

[BibT_eX]

[DOI]

Avihu Dekel

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Speak While You Think: Streaming Speech Synthesis During Text Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

A Neural TTS System with Parallel Prosody Transfer from Unseen Speakers.

[BibT_eX]

[DOI]

Slava Shechtman

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Supervised and unsupervised approaches for controlling narrow lexical focus in sequence-to-sequence speech synthesis.

[BibT_eX]

[DOI]

Slava Shechtman

David Haws

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Synthesis of Expressive Speaking Styles with Limited Training Data in a Multi-Speaker, Prosody-Controllable Sequence-to-Sequence Architecture.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Stable Checkpoint Selection and Evaluation in Sequence to Sequence Speech Synthesis.

[BibT_eX]

[DOI]

Slava Shechtman

David Haws

Proceedings of the IEEE International Conference on Acoustics, 2021

2019

Deep Mixture-of-Experts Models for Synthetic Prosodic-Contour Generation.

[BibT_eX]

[DOI]

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

2018

Comparing Prosodic Frameworks: Investigating the Acoustic-Symbolic Relationship in ToBI and RaP.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Data Augmentation Improves Recognition of Foreign Accented Speech.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Measuring the Effect of Linguistic Resources on Prosody Modeling for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Weakly-Supervised Phrase Assignment from Text in a Speech-Synthesis System Using Noisy Labels.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Voice-transformation-based data augmentation for prosodic classification.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Using continuous lexical embeddings to improve symbolic-prosody prediction in a text-to-speech front-end.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Modeling phrasing and prominence using deep recurrent learning.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Using deep bidirectional recurrent neural networks for prosodic-target prediction in a unit-selection text-to-speech system.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Development of a smart insole tracking system for physical therapy and athletics.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on PErvasive Technologies Related to Assistive Environments, 2014

Prosody contour prediction with long short-term memory, bi-directional, deep recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Exploiting vocal-source features to improve ASR accuracy for low-resource languages.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013

F0 contour prediction with a deep belief network-Gaussian process hybrid model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Phrase Boundary Assignment from Text in Multiple Domains.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Prediction of F0 contours from symbolic and numerical variables using continuous conditional random fields.

[BibT_eX]

[DOI]

Steve Minnis

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Recognizing affect from speech prosody using hierarchical graphical models.

[BibT_eX]

[DOI]

Speech Commun., 2011

"What is... Dengue Fever?" - Modeling and Predicting Pronunciation Errors in a Text-to-Speech System.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Exploiting active-learning strategies for annotating prosodic events with limited labeled data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Efficient peerGroup management in JXTA-Overlay P2P system for developing groupware tools.

[BibT_eX]

[DOI]

J. Supercomput., 2010

fMRI Data Visualization with BrainBlend and Blender.

[BibT_eX]

[DOI]

Neuroinformatics, 2010

Supporting Scenario-Based Online Learning with P2P Group-Based Systems.

[BibT_eX]

[DOI]

Proceedings of the 13th International Conference on Network-Based Information Systems, 2010

Discriminative training and unsupervised adaptation for labeling prosodic events with limited training data.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

An autoencoder neural-network based low-dimensionality approach to excitation modeling for HMM-based text-to-speech.

[BibT_eX]

[DOI]

Srikanth Vishnubhotla

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Jxta-Overlay: An interface for efficient peer selection in P2P JXTA-based systems.

[BibT_eX]

[DOI]

Comput. Stand. Interfaces, 2009

2008

Extension and evaluation of JXTA protocols for supporting reliable P2P distributed computing.

[BibT_eX]

[DOI]

Int. J. Web Inf. Syst., 2008

Practical approach to experimentation in a simulation study.

[BibT_eX]

[DOI]

Benny Tjahjono

Proceedings of the 2008 Winter Simulation Conference, Global Gateway to Discovery, 2008

Extending JXTA Protocols for P2P File Sharing Systems.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Complex, 2008

The IBM Submission to the 2008 Text-to-Speech Blizzard Challenge.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2008, 2008

Efficient Peer Selection in P2P JXTA-Based Platforms.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Advanced Information Networking and Applications, 2008

2007

Enabling Efficient Real Time User Modeling in On-Line Campus.

[BibT_eX]

[DOI]

Proceedings of the User Modeling 2007, 11th International Conference, 2007

Automatic exploration of corpus-specific properties for expressive text-to-speech: a case study in emphasis.

[BibT_eX]

[DOI]

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Improvement of JXTA Protocols for Supporting Reliable Distributed Applications in P2P Systems.

[BibT_eX]

[DOI]

Proceedings of the Network-Based Information Systems, First International Conference, 2007

An Experimental Study on Peer Selection in a P2P Network over PlanetLab.

[BibT_eX]

[DOI]

Proceedings of the 2007 International Conference on Parallel Processing Workshops (ICPP Workshops 2007), 2007

Database Mining for Flexible Concatenative Text-to-Speech.

[BibT_eX]

[DOI]

Ellen Eide

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

The IBM expressive text-to-speech synthesis system for American English.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2006

The IBM Submission to the 2006 Blizzard Text-to-Speech Challenge.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2006, Pittsburgh, PA, USA, September 16, 2006, 2006

2005

Toward multiple-language TTS: experiments in English and Mandarin.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Classical and novel discriminant features for affect recognition from speech.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Upending the Uncanny Valley.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2005

2003

Modeling drivers' speech under stress.

[BibT_eX]

[DOI]

Speech Commun., 2003

2001

Frustrating the user on purpose: a step toward building an affective computer.

[BibT_eX]

[DOI]

Interact. Comput., 2001

1999

Expression glasses: a wearable device for facial expression recognition.

[BibT_eX]

[DOI]

Jocelyn Scheirer

Proceedings of the CHI '99 Extended Abstracts on Human Factors in Computing Systems, 1999

1998

Signal processing for recognition of human frustration.

[BibT_eX]

[DOI]