Raul Fernandez

Orcid: 0009-0009-7650-193X

According to our database1, Raul Fernandez authored at least 47 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Creating an African American-Sounding TTS: Guidelines, Technical Challenges, and Surprising Evaluations.
Proceedings of the 29th International Conference on Intelligent User Interfaces, 2024

2023
Speak While You Think: Streaming Speech Synthesis During Text Generation.
CoRR, 2023

2022
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis.
Proceedings of the Interspeech 2022, 2022

2021
Supervised and unsupervised approaches for controlling narrow lexical focus in sequence-to-sequence speech synthesis.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Synthesis of Expressive Speaking Styles with Limited Training Data in a Multi-Speaker, Prosody-Controllable Sequence-to-Sequence Architecture.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Stable Checkpoint Selection and Evaluation in Sequence to Sequence Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2021

2018
Comparing Prosodic Frameworks: Investigating the Acoustic-Symbolic Relationship in ToBI and RaP.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Data Augmentation Improves Recognition of Foreign Accented Speech.
Proceedings of the Interspeech 2018, 2018

Measuring the Effect of Linguistic Resources on Prosody Modeling for Speech Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Weakly-Supervised Phrase Assignment from Text in a Speech-Synthesis System Using Noisy Labels.
Proceedings of the Interspeech 2017, 2017

Voice-transformation-based data augmentation for prosodic classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Using continuous lexical embeddings to improve symbolic-prosody prediction in a text-to-speech front-end.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Modeling phrasing and prominence using deep recurrent learning.
Proceedings of the INTERSPEECH 2015, 2015

Using deep bidirectional recurrent neural networks for prosodic-target prediction in a unit-selection text-to-speech system.
Proceedings of the INTERSPEECH 2015, 2015

2014
Development of a smart insole tracking system for physical therapy and athletics.
Proceedings of the 7th International Conference on PErvasive Technologies Related to Assistive Environments, 2014

Prosody contour prediction with long short-term memory, bi-directional, deep recurrent neural networks.
Proceedings of the INTERSPEECH 2014, 2014

Exploiting vocal-source features to improve ASR accuracy for low-resource languages.
Proceedings of the INTERSPEECH 2014, 2014

2013
F0 contour prediction with a deep belief network-Gaussian process hybrid model.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Phrase Boundary Assignment from Text in Multiple Domains.
Proceedings of the INTERSPEECH 2012, 2012

Prediction of F0 contours from symbolic and numerical variables using continuous conditional random fields.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Recognizing affect from speech prosody using hierarchical graphical models.
Speech Commun., 2011

"What is... Dengue Fever?" - Modeling and Predicting Pronunciation Errors in a Text-to-Speech System.
Proceedings of the INTERSPEECH 2011, 2011

Exploiting active-learning strategies for annotating prosodic events with limited labeled data.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Efficient peerGroup management in JXTA-Overlay P2P system for developing groupware tools.
J. Supercomput., 2010

fMRI Data Visualization with BrainBlend and Blender.
Neuroinformatics, 2010

Supporting Scenario-Based Online Learning with P2P Group-Based Systems.
Proceedings of the 13th International Conference on Network-Based Information Systems, 2010

Discriminative training and unsupervised adaptation for labeling prosodic events with limited training data.
Proceedings of the INTERSPEECH 2010, 2010

An autoencoder neural-network based low-dimensionality approach to excitation modeling for HMM-based text-to-speech.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Jxta-Overlay: An interface for efficient peer selection in P2P JXTA-based systems.
Comput. Stand. Interfaces, 2009

2008
Extension and evaluation of JXTA protocols for supporting reliable P2P distributed computing.
Int. J. Web Inf. Syst., 2008

Practical approach to experimentation in a simulation study.
Proceedings of the 2008 Winter Simulation Conference, Global Gateway to Discovery, 2008

Extending JXTA Protocols for P2P File Sharing Systems.
Proceedings of the Second International Conference on Complex, 2008

Efficient Peer Selection in P2P JXTA-Based Platforms.
Proceedings of the 22nd International Conference on Advanced Information Networking and Applications, 2008

2007
Enabling Efficient Real Time User Modeling in On-Line Campus.
Proceedings of the User Modeling 2007, 11th International Conference, 2007

Automatic exploration of corpus-specific properties for expressive text-to-speech: a case study in emphasis.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Improvement of JXTA Protocols for Supporting Reliable Distributed Applications in P2P Systems.
Proceedings of the Network-Based Information Systems, First International Conference, 2007

An Experimental Study on Peer Selection in a P2P Network over PlanetLab.
Proceedings of the 2007 International Conference on Parallel Processing Workshops (ICPP Workshops 2007), 2007

Database Mining for Flexible Concatenative Text-to-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
The IBM expressive text-to-speech synthesis system for American English.
IEEE Trans. Speech Audio Process., 2006

2005
Toward multiple-language TTS: experiments in English and Mandarin.
Proceedings of the INTERSPEECH 2005, 2005

Classical and novel discriminant features for affect recognition from speech.
Proceedings of the INTERSPEECH 2005, 2005

Upending the Uncanny Valley.
Proceedings of the Proceedings, 2005

2003
Modeling drivers' speech under stress.
Speech Commun., 2003

2001
Frustrating the user on purpose: a step toward building an affective computer.
Interact. Comput., 2001

1999
Expression glasses: a wearable device for facial expression recognition.
Proceedings of the CHI '99 Extended Abstracts on Human Factors in Computing Systems, 1999

1998
Signal processing for recognition of human frustration.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Frustrating the user on purpose: using biosignals in a pilot study to detect the user's emotional state.
Proceedings of the CHI 98 Conference Summary on Human Factors in Computing Systems, 1998


  Loading...