Alan W. Black

Orcid: 0000-0001-8820-8831

Affiliations:
  • Carnegie Mellon University, Pittsburgh, USA


According to our database1, Alan W. Black authored at least 337 papers between 1986 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT.
CoRR, 2023

Self-Supervised Models of Speech Infer Universal Articulatory Kinematics.
CoRR, 2023

Speaker-Independent Acoustic-to-Articulatory Speech Inversion.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution.
Proceedings of the IEEE International Conference on Acoustics, 2023

Articulatory Representation Learning via Joint Factor Analysis and Neural Matrix Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2023

CTC Alignments Improve Autoregressive Translation.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022
Phone Inventories and Recognition for Every Language.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Intent classification using pre-trained language agnostic embeddings for low resource languages.
Proceedings of the Interspeech 2022, 2022

Deep Speech Synthesis from Articulatory Representations.
Proceedings of the Interspeech 2022, 2022

Building African Voices.
Proceedings of the Interspeech 2022, 2022

Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition.
Proceedings of the Interspeech 2022, 2022

ASR2K: Speech Recognition for Around 2000 Languages without Audio.
Proceedings of the Interspeech 2022, 2022

Two-Pass Low Latency End-to-End Spoken Language Understanding.
Proceedings of the Interspeech 2022, 2022

End-to-End Speech Summarization Using Restricted Self-Attention.
Proceedings of the IEEE International Conference on Acoustics, 2022

ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet.
Proceedings of the IEEE International Conference on Acoustics, 2022

On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Zero-shot Learning for Grapheme to Phoneme Conversion with Language Ensemble.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Towards Language Modelling in the Speech Domain Using Sub-word Linguistic Units.
CoRR, 2021

Intent Classification Using Pre-Trained Embeddings For Low Resource Languages.
CoRR, 2021

Speech Summarization using Restricted Self-Attention.
CoRR, 2021

Intent Recognition and Unsupervised Slot Identification for Low Resourced Spoken Dialog Systems.
CoRR, 2021

Task-Specific Pre-Training and Cross Lingual Transfer for Code-Switched Data.
CoRR, 2021

Towards Automatic Route Description Unification in Spoken Dialog Systems.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Focused Attention Improves Document-Grounded Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Case Study: Deontological Ethics in NLP.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Multimodal Speech Summarization Through Semantic Concept Learning.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Hierarchical Phone Recognition with Compositional Phonetics.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues.
Proceedings of the 9th International Conference on Learning Representations, 2021

Multilingual Phonetic Dataset for Low Resource Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Phone Distribution Estimation for Low Resource Languages.
Proceedings of the IEEE International Conference on Acoustics, 2021

Acoustics Based Intent Recognition Using Discovered Phonetic Units for Low Resource Languages.
Proceedings of the IEEE International Conference on Acoustics, 2021

Switch Point biased Self-Training: Re-purposing Pretrained Models for Code-Switching.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

NoiseQA: Challenge Set Evaluation for User-Centric Question Answering.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Cross-Lingual Transfer for Speech Processing Using Acoustic Language Similarity.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Towards Using Heterogeneous Relation Graphs for End-to-End TTS.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Intent Recognition and Unsupervised Slot Identification for Low-Resourced Spoken Dialog Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Breaking Down Walls of Text: How Can NLP Benefit Consumer Privacy?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Grounding 'Grounding' in NLP.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Mixing.
Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching, 2021

Unsupervised Self-Training for Sentiment Analysis of Code-Switched Data.
Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching, 2021

2020
Speech Technology for Unwritten Languages.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Automatically Identifying Language Family from Acoustic Examples in Low Resource Scenarios.
CoRR, 2020

Mere account mein kitna balance hai? - On building voice enabled Banking Services for Multilingual Communities.
CoRR, 2020

Comparison of Interactive Knowledge Base Spelling Correction Models for Low-Resource Languages.
CoRR, 2020

Dissecting the components and factors of Neural Text Generation.
CoRR, 2020

Towards Minimal Supervision BERT-based Grammar Error Correction.
CoRR, 2020

LTIatCMU at SemEval-2020 Task 11: Incorporating Multi-Level Features for Multi-Granular Propaganda Span Identification.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

AlloVera: A Multilingual Allophone Database.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

A Resource for Computational Experiments on Mapudungun.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Nonlinear ISA with Auxiliary Variables for Learning Speech Representations.
Proceedings of the Interspeech 2020, 2020

Style Variation as a Vantage Point for Code-Switching.
Proceedings of the Interspeech 2020, 2020

Augmenting Non-Collaborative Dialog Systems with Explicit Semantic and Strategic Dialog History.
Proceedings of the 8th International Conference on Learning Representations, 2020

Universal Phone Recognition with a Multilingual Allophone System.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Reading Between the Lines: Exploring Infilling in Visual Narratives.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Understanding Linguistic Accommodation in Code-Switched Human-Machine Dialogues.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

Exploring Controllable Text Generation Techniques.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Should You Fine-Tune BERT for Automated Essay Scoring?
Proceedings of the Fifteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2020

Detecting Entailment in Code-Mixed Hindi-English Conversations.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

A Corpus for Large-Scale Phonetic Typology.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Phone Features Improve Speech Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Topological Sort for Sentence Ordering.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Politeness Transfer: A Tag and Generate Approach.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

ClarQ: A large-scale and diverse dataset for Clarification Question Generation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Towards Zero-Shot Learning for Automatic Phonemic Transcription.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Towards Minimal Supervision BERT-Based Grammar Error Correction (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Analyzing Wikipedia Deletion Debates with a Group Decision-Making Forecast Model.
Proc. ACM Hum. Comput. Interact., 2019

Disentangling Speech and Non-Speech Components for Building Robust Acoustic Models from Found Data.
CoRR, 2019

Induction and Reference of Entities in a Visual Story.
CoRR, 2019

CMU GetGoing: An Understandable and Memorable Dialog System for Seniors.
CoRR, 2019

Linguistic Versus Latent Relations for Modeling Coherent Flow in Paragraphs.
CoRR, 2019

WriterForcing: Generating more interesting story endings.
CoRR, 2019

Measuring Bias in Contextualized Word Representations.
CoRR, 2019

"My Way of Telling a Story": Persona based Grounded Story Generation.
CoRR, 2019

A Survey of Code-switched Speech and Language Processing.
CoRR, 2019

The ARIEL-CMU Systems for LoReHLT18.
CoRR, 2019

The Second Conversational Intelligence Challenge (ConvAI2).
CoRR, 2019

A Dynamic Strategy Coach for Effective Negotiation.
Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, 2019

Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Top-Down Structurally-Constrained Neural Response Generation with Lexicalized Probabilistic Context-Free Grammar.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Ordinal Triplet Loss: Investigating Sleepiness Detection from Speech.
Proceedings of the Interspeech 2019, 2019

Variational Attention Using Articulatory Priors for Generating Code Mixed Speech Using Monolingual Corpora.
Proceedings of the Interspeech 2019, 2019

SANTLR: Speech Annotation Toolkit for Low Resource Languages.
Proceedings of the Interspeech 2019, 2019

Multilingual Speech Recognition with Corpus Relatedness Sampling.
Proceedings of the Interspeech 2019, 2019

Unsupervised Phonetic and Word Level Discovery for Speech to Speech Translation for Unwritten Languages.
Proceedings of the Interspeech 2019, 2019


Bag-of-Acoustic-Words for Mental Health Assessment: A Deep Autoencoding Approach.
Proceedings of the Interspeech 2019, 2019

Storyboarding of Recipes: Grounded Contextual Generation.
Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Learning Disentangled Representation in Latent Stochastic Models: A Case Study with Image Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2019

Phoneme Level Language Models for Sequence Based Low Resource ASR.
Proceedings of the IEEE International Conference on Acoustics, 2019

CMU Wilderness Multilingual Speech Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2019

Question Answering for Privacy Policies: Combining Computational and Legal Perspectives.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Learning to Order Graph Elements with Application to Multilingual Surface Realization.
Proceedings of the 2nd Workshop on Multilingual Surface Realisation, 2019

Equity Beyond Bias in Language Technologies for Education.
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019

What A Sunny Day â˜": Toward Emoji-Sensitive Irony Detection.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Formality Style Transfer for Noisy, User-generated Conversations: Extracting Labeled, Parallel Data from Unlabeled Corpora.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Boosting Dialog Response Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Principled Frameworks for Evaluating Ethics in NLP Systems.
Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019

Multimodal, Multilingual Grapheme-to-Phoneme Conversion for Low-Resource Languages.
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019

2018
The ARIEL-CMU situation frame detection pipeline for LoReHLT16: a model translation approach.
Mach. Transl., 2018

Multimodal Polynomial Fusion for Detecting Driver Distraction.
CoRR, 2018

Style Transfer Through Multilingual and Feedback-Based Back-Translation.
CoRR, 2018

Data Augmentation for Neural Online Chat Response Selection.
CoRR, 2018

Generating Mandarin and Cantonese F0 Contours with Decision Trees and BLSTMs.
CoRR, 2018

Towards Improving Intelligibility of Black-Box Speech Synthesizers in Noise.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Domain Robust Feature Extraction for Rapid Low Resource ASR Development.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

An Empirical Study of Self-Disclosure in Spoken Dialogue Systems.
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, 2018

DialCrowd: A toolkit for easy dialog system assessment.
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, 2018

Investigating Utterance Level Representations for Detecting Intent from Acoustics.
Proceedings of the Interspeech 2018, 2018

Multimodal Polynomial Fusion for Detecting Driver Distraction.
Proceedings of the Interspeech 2018, 2018

An Investigation of Convolution Attention Based Models for Multilingual Speech Synthesis of Indian Languages.
Proceedings of the Interspeech 2018, 2018

Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sequence-Based Multi-Lingual Low Resource Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Dataset for Document Grounded Conversations.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Data Augmentation for Neural Online Chats Response Selection.
Proceedings of the 2nd International Workshop on Search-Oriented Conversational AI, 2018

Style Transfer Through Back-Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Automatic Detection of Code-switching Style from Acoustics.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

Tackling Code-Switched NER: Participation of CMU.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

Language Informed Modeling of Code-Switched Text.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

Code-Mixed Question Answering Challenge: Crowd-sourcing Data and Techniques.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

2017
Segment Level Voice Conversion with Recurrent Neural Networks.
Proceedings of the Interspeech 2017, 2017

On Building Mixed Lingual Speech Synthesis Systems.
Proceedings of the Interspeech 2017, 2017

Speech Synthesis for Mixed-Language Navigation Instructions.
Proceedings of the Interspeech 2017, 2017

Learning Conversational Systems that Interleave Task and Non-Task Content.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

WebShodh: A Code Mixed Factoid Question Answering System for Web.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2017

The blizzard machine learning challenge 2017.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

The CMU entry to blizzard machine learning challenge.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Linguistic Markers of Influence in Informal Interactions.
Proceedings of the Second Workshop on NLP and Computational Social Science, 2017

Integrating Verbal and Nonvebval Input into a Dynamic Response Spoken Dialogue System.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Recurrent Neural Network Postfilters for Statistical Parametric Speech Synthesis.
CoRR, 2016

Mining Parallel Corpora from Sina Weibo and Twitter.
Comput. Linguistics, 2016

This Table is Different: A WordNet-Based Approach to Identifying References to Document Entities.
Proceedings of the 8th Global WordNet Conference, 2016

Open-Source Consumer-Grade Indic Text To Speech.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Experiments with Cross-lingual Systems for Synthesis of Code-Mixed Text.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Utterance Selection Techniques for TTS Systems Using Found Speech.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Automatic Recognition of Conversational Strategies in the Service of a Socially-Aware Dialog System.
Proceedings of the SIGDIAL 2016 Conference, 2016

Strategy and Policy Learning for Non-Task-Oriented Conversational Systems.
Proceedings of the SIGDIAL 2016 Conference, 2016

A Wizard-of-Oz Study on A Non-Task-Oriented Dialog Systems That Reacts to User Engagement.
Proceedings of the SIGDIAL 2016 Conference, 2016

Initiations and Interruptions in a Spoken Dialog System.
Proceedings of the SIGDIAL 2016 Conference, 2016

Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning.
Proceedings of the NAACL HLT 2016, 2016

Speech Synthesis of Code-Mixed Text.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Multimodal HALEF: An Open-Source Modular Web-Based Multimodal Dialog Framework.
Proceedings of the Dialogues with Social Robots, 2016

Socially-Aware Virtual Agents: Automatically Assessing Dyadic Rapport from Temporal Patterns of Behavior.
Proceedings of the Intelligent Virtual Agents - 16th International Conference, 2016

User Engagement Study with Virtual Agents Under Different Cultural Contexts.
Proceedings of the Intelligent Virtual Agents - 16th International Conference, 2016

Deriving Phonetic Transcriptions and Discovering Word Segmentations for Speech-to-Speech Translation in Low-Resource Settings.
Proceedings of the Interspeech 2016, 2016

Towards Building an Attentive Artificial Listener: On the Perception of Attentiveness in Feedback Utterances.
Proceedings of the Interspeech 2016, 2016

Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

On data driven parametric backchannel synthesis for expressing attentiveness in conversational agents.
Proceedings of the Workshop on Multimodal Analyses enabling Artificial Agents in Human-Machine Interaction, 2016

2015
Character-based Neural Machine Translation.
CoRR, 2015

Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation.
CoRR, 2015

An Incremental Turn-Taking Model with Active System Barge-in for Spoken Dialog Systems.
Proceedings of the SIGDIAL 2015 Conference, 2015

The Real Challenge 2014: Progress and Prospects.
Proceedings of the SIGDIAL 2015 Conference, 2015

Two/Too Simple Adaptations of Word2Vec for Syntax Problems.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Modulation spectrum-constrained trajectory training algorithm for HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2015, 2015

Universal grapheme-based speech synthesis.
Proceedings of the INTERSPEECH 2015, 2015

Using acoustics to improve pronunciation for synthesis of low resource languages.
Proceedings of the INTERSPEECH 2015, 2015

Distributed representation-based spoken word sense induction.
Proceedings of the INTERSPEECH 2015, 2015

Random forests for statistical speech synthesis.
Proceedings of the INTERSPEECH 2015, 2015

Using articulatory features and inferred phonological segments in zero resource speech processing.
Proceedings of the INTERSPEECH 2015, 2015

Modulation spectrum-constrained trajectory training algorithm for GMM-based Voice Conversion.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Parameter generation algorithm considering Modulation Spectrum for HMM-based speech synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Not All Contexts Are Created Equal: Better Word Representations with Variable Attention.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Utterance classification in speech-to-speech translation for zero-resource languages in the hospital administration domain.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Automatic Keyword Extraction on Twitter.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Introduction to the Issue on Statistical Parametric Speech Synthesis.
IEEE J. Sel. Top. Signal Process., 2014

A Deep Learning Approach to Data-driven Parameterizations for Statistical Parametric Speech Synthesis.
CoRR, 2014

The Dialog State Tracking Challenge Series.
AI Mag., 2014

Crowdsourcing High-Quality Parallel Data Extraction from Twitter.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Automatic discovery of a phonetic inventory for unwritten languages for statistical speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Modified post-filter to recover modulation spectrum for HMM-based speech synthesis.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

Modulation spectrum-based post-filter for GMM-based Voice Conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Text to speech in new languages without a standardized orthography.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Minimum error rate training for phrasing in speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Automatic Prediction of Friendship via Multi-model Dyadic Features.
Proceedings of the SIGDIAL 2013 Conference, 2013

The Dialog State Tracking Challenge.
Proceedings of the SIGDIAL 2013 Conference, 2013

Optimizations and fitting procedures for the liljencrants-fant model for statistical parametric speech synthesis.
Proceedings of the INTERSPEECH 2013, 2013

Analysis and modeling of "focus" in context.
Proceedings of the INTERSPEECH 2013, 2013

Bootstrapping Text-to-Speech for speech processing in languages without an orthography.
Proceedings of the IEEE International Conference on Acoustics, 2013

Improving ASR by integrating lecture audio and slides.
Proceedings of the IEEE International Conference on Acoustics, 2013

A style capturing approach to F0 transformation in voice conversion.
Proceedings of the IEEE International Conference on Acoustics, 2013

Accent Group modeling for improved prosody in statistical parameteric speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2013

Paraphrasing 4 Microblog Normalization.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Improved punctuation recovery through combination of multiple speech streams.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Microblogs as Parallel Corpora.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Recovery of acronyms, out-of-lattice words and pronunciations from parallel multilingual speech.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Intent transfer in speech-to-speech machine translation.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

"Love ya, jerkface": Using Sparse Log-Linear Models to Build Positive and Impolite Relationships with Teens.
Proceedings of the SIGDIAL 2012 Conference, 2012

Future Directions in Spoken Dialog Systems: A Community of Possibilities.
Proceedings of the Workshop on Future directions and needs in the Spoken Dialog Community: Tools and Data, 2012

Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Real Users and Real Dialog Systems: The Hard Challenge for SDS.
Proceedings of the Natural Interaction with Robots, 2012

The IIIT-H Indic Speech Databases.
Proceedings of the INTERSPEECH 2012, 2012

Modeling Pause-Duration for Style-Specific Speech Synthesis.
Proceedings of the INTERSPEECH 2012, 2012

Parallel combination of multilingual speech streams for improved ASR.
Proceedings of the INTERSPEECH 2012, 2012

Modelling a Noisy-channel for Voice Conversion Using Articulatory Features.
Proceedings of the INTERSPEECH 2012, 2012

Text-dependent pathological voice detection.
Proceedings of the INTERSPEECH 2012, 2012

Data-driven phrasing for speech synthesis in low-resource languages.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Articulatory features for expressive speech synthesis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Entropy-based Pruning for Phrase-based Machine Translation.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Text-To-Speech for Languages without an Orthography.
Proceedings of the COLING 2012, 2012

Improving Relative-Entropy Pruning using Statistical Significance.
Proceedings of the COLING 2012, 2012

2011
Segmentation of Monologues in Audio Books for Building Synthetic Voices.
IEEE Trans. Speech Audio Process., 2011

Spoken Dialog Challenge 2010: Comparison of Live and Control Test Results.
Proceedings of the SIGDIAL 2011 Conference, 2011

Named entity translation using anchor texts.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

A Grammar Based Approach to Style Specific Phrase Prediction.
Proceedings of the INTERSPEECH 2011, 2011

Using Speaker ID to Discover Repeat Callers of a Spoken Dialog System.
Proceedings of the INTERSPEECH 2011, 2011

A Statistical Phrase/Accent Model for Intonation Modeling.
Proceedings of the INTERSPEECH 2011, 2011

Discriminative Phrase-based Lexicalized Reordering Models using Weighted Reordering Graphs.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

A Review of Personality in Voice-Based Man Machine Interaction.
Proceedings of the Human-Computer Interaction. Interaction Techniques and Environments, 2011

2010
Spectral Mapping Using Artificial Neural Networks for Voice Conversion.
IEEE Trans. Speech Audio Process., 2010

Learning speaker-specific phrase breaks for text-to-speech systems.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Handling large audio files in audio books for building synthetic voices.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Improving speech synthesis for noisy environments.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

KLATTSTAT: knowledge-based parametric speech synthesis.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Adaptation techniques for speech synthesis in under-resourced languages.
Proceedings of the 2nd Workshop on Spoken Language Technologies for Under-Resourced Languages, 2010

Spoken Dialog Challenge 2010.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Towards Improving the Naturalness of Social Conversations with Dialogue Systems.
Proceedings of the SIGDIAL 2010 Conference, 2010

Improving speech synthesis of machine translation output.
Proceedings of the INTERSPEECH 2010, 2010

Evaluating a dialog language generation system: comparing the mountain system to other NLG approaches.
Proceedings of the INTERSPEECH 2010, 2010

2009
Statistical parametric speech synthesis.
Speech Commun., 2009

The Spoken Dialogue Challenge.
Proceedings of the SIGDIAL 2009 Conference, 2009

Incremental Adaptation of Speech-to-Speech Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Voice convergin: Speaker de-identification by voice transformation.
Proceedings of the IEEE International Conference on Acoustics, 2009

Voice conversion using Artificial Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2009

Optimizing segment label boundaries for statistical speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2009

Speaker de-identification via voice transformation.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Pronunciation modeling for dialectal arabic speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model.
Speech Commun., 2008

Synthesizer voice quality of new languages calibrated with mean mel cepstral distortion.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Global syllable set for building speech synthesis in Indian languages.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Incorporating durational modification in voice transformation.
Proceedings of the INTERSPEECH 2008, 2008

Building sleek synthesizers for multi-lingual screen reader.
Proceedings of the INTERSPEECH 2008, 2008

Improving speech systems built from very little data.
Proceedings of the INTERSPEECH 2008, 2008

Let's go lab: a platform for evaluation of spoken dialog systems with real world users.
Proceedings of the INTERSPEECH 2008, 2008

Is voice transformation a threat to speaker identification?
Proceedings of the IEEE International Conference on Acoustics, 2008

Significance of early tagged contextual graphemes in grapheme based speech synthesis and recognition systems.
Proceedings of the IEEE International Conference on Acoustics, 2008

Speech Translation for Triage of Emergency Phonecalls in Minority Languages.
Proceedings of the workshop on Speech Processing for Safety Critical Translation and Pervasive Applications@COLING 2008, 2008

Building Practical Spoken Dialog Systems.
Proceedings of the ACL 2008, 2008

2007
Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory.
IEEE Trans. Speech Audio Process., 2007

The HMM-based speech synthesis system (HTS) version 2.0.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Using articulatory position data in voice transformation.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Text processing for text-to-speech systems in Indian languages.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Understandable production of massive synthesis.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Building a better Indian English voice using "more data".
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Voice building from insufficient data - classroom experiences with web-based language development tools.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

The Blizzard Challenge: evaluating corpus-based speech synthesis techniques.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Speech synthesis for educational technology.
Proceedings of the Workshop on Speech and Language Technology in Education, 2007

SPICE: web-based tools for rapid language adaptation in speech processing systems.
Proceedings of the INTERSPEECH 2007, 2007

Automatic building of synthetic voices from large multi-paragraph speech databases.
Proceedings of the INTERSPEECH 2007, 2007

ugloss: a framework for improving spoken language generation understandability.
Proceedings of the INTERSPEECH 2007, 2007

2006
Flexible speech translation systems.
IEEE Trans. Speech Audio Process., 2006

Online Supervised Learning of Non-Understanding Recovery Policies.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Learning Pronunciation Dictionaries: Language Complexity and Word Selection Strategies.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Intelligibility of machine translation output in speech synthesis.
Proceedings of the INTERSPEECH 2006, 2006

Doing research on a deployed spoken dialogue system: one year of let's go! experience.
Proceedings of the INTERSPEECH 2006, 2006

Generating time-constrained audio presentations of structured information.
Proceedings of the INTERSPEECH 2006, 2006

Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system.
Proceedings of the INTERSPEECH 2006, 2006

CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling.
Proceedings of the INTERSPEECH 2006, 2006

Visual Evaluation of Voice Transformation Based on Knowledge of Speaker.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Text-Independent Voice Conversion Based on Unit Selection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Challenges with Rapid Adaptation of Speech Translation Systems to New Language Pairs.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Sub-Phonetic Modeling For Capturing Pronunciation Variations For Conversational Speech Synthesis.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Cross-speaker articulatory position data for phonetic feature prediction.
Proceedings of the INTERSPEECH 2005, 2005

Foreign accents in synthetic speech: development and evaluation.
Proceedings of the INTERSPEECH 2005, 2005

Let's go public! taking a spoken dialog system to the real world.
Proceedings of the INTERSPEECH 2005, 2005

Measuring unsupervised acoustic clustering through phoneme pair merge-and-split tests.
Proceedings of the INTERSPEECH 2005, 2005

The blizzard challenge - 2005: evaluating corpus-based speech synthesis on common datasets.
Proceedings of the INTERSPEECH 2005, 2005

Spectral Conversion Based on Maximum Likelihood Estimation Considering Global Variance of Converted Parameter.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Thai Automatic Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Improving the Understandability of Speech Synthesis by Modeling Speech in Noise.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Prediction of Pronunciation Variations for Speech Synthesis: A Data-Driven Approach.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Prominence prediction for supersentential prosodic modeling based on a new database.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Unit selection voice for Amharic using Festvox.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Creating a database of speech in noise for unit selection synthesis.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

The CMU Arctic speech databases.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Impact of durational outlier removal from unit selection catalogs.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

A Thai Speech Translation System for Medical Dialogs.
Proceedings of the Demonstration Papers at HLT-NAACL 2004, 2004

Acoustic-to-articulatory inversion mapping with Gaussian mixture model.
Proceedings of the INTERSPEECH 2004, 2004

Boostrapping phonetic lexicons for new languages.
Proceedings of the INTERSPEECH 2004, 2004

A family-of-models approach to HMM-based segmentation for unit selection speech synthesis.
Proceedings of the INTERSPEECH 2004, 2004

Multilingual text-to-speech synthesis.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Optimal Utterance Selection for Unit Selection Speech Synthesis Databases.
Int. J. Speech Technol., 2003

Speechalator: Two-Way Speech-to-Speech Translation in Your Hand.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Identifying speakers in children's stories for speech synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speechalator: two-way speech-to-speech translation on a consumer PDA.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Arabic in my hand: small-footprint synthesis of egyptian arabic.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

LET's GO: improving spoken dialog systems for the elderly and non-natives.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Evaluating and correcting phoneme segmentation for unit selection synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Unit size in unit selection speech synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Unit selection and emotional speech.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Using acoustic models to choose pronunciation variations for synthetic voices.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Evaluation and collection of proper name pronunciations online.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Field Testing the Tongues Speech-to-Speech Machine Translation System.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Rapid development of speech-to-speech translation systems.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Building voiceXML-based applications.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Heterogeneous relation graphs as a formalism for representing linguistic information.
Speech Commun., 2001

Normalization of non-standard words.
Comput. Speech Lang., 2001

Flite: a small fast run-time synthesis engine.
Proceedings of the 4th ITRW on Speech Synthesis, Perthshire, Scotland, UK, August 29, 2001

Optimal data selection for unit selection synthesis.
Proceedings of the 4th ITRW on Speech Synthesis, Perthshire, Scotland, UK, August 29, 2001

Knowledge of language origin improves pronunciation accuracy of proper names.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A study on speech over the telephone and aging.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Audio signals in speech interfaces.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Task and domain specific modelling in the Carnegie Mellon communicator system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Towards a universal speech interface.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Non-standard word and homograph resolution for asian language text analysis.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Diphone collection and synthesis.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Statistically trained orthographic to sound models for Thai.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Limited domain synthesis.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Speech synthesis by phonological structure matching.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Using decision trees within the tilt intonation model to predict F0 contours.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Assigning phrase breaks from part-of-speech sequences.
Comput. Speech Lang., 1998

The architecture of the Festival speech synthesis system.
Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

Three methods of intonation modeling.
Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

Issues in building general letter to sound rules.
Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

SABLE: a standard for TTS markup.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Letter to sound rules for accented lexicon compression.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

On the use of automatically generated discourse-level information in a concept-to-speech synthesis system.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
Automatically clustering similar units for unit selection in speech synthesis.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Predicting the Intonation of Discourse Segments from Examples in Dialogue Speech.
Proceedings of the Computing Prosody, 1997

1996
Generating F0 contours from toBI labels using linear regression.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Unit selection in a concatenative speech synthesis system using a large speech database.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Optimising selection of units from speech databases for concatenative synthesis.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994
Synthesizing conversational intonation from a linguistically rich input.
Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

Assigning intonation elements and prosodic phrasing for English speech synthesis from high level linguistic input.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

CHATR: a generic speech synthesis system.
Proceedings of the 15th International Conference on Computational Linguistics, 1994

1992
Embedding DRT in a Situation Theoretic Framework.
Proceedings of the 14th International Conference on Computational Linguistics, 1992

1991
Analysis of Unknown Words through Morphological Decomposition.
Proceedings of the EACL 1991, 1991

1989
Finite State Machines from Feature Grammars.
Proceedings of the First International Workshop on Parsing Technologies, 1989

1987
A Computational Framework for Lexical Description.
Comput. Linguistics, 1987

Formalisms For Morphographemic Description.
Proceedings of the EACL 1989, 1987

1986
A Dictionary and Morphological Analyser for English.
Proceedings of the 11th International Conference on Computational Linguistics, 1986


  Loading...