James R. Glass

CoRR, 2020

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos.

[BibT_eX]

[DOI]

CoRR, 2020

CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning.

[BibT_eX]

[DOI]

Sameer Khurana

Antoine Laurent

CoRR, 2020

On the Linguistic Representational Power of Neural Machine Translation Models.

[BibT_eX]

[DOI]

Comput. Linguistics, 2020

Multimodal Association for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Pair Expansion for Learning Multilingual Semantic Embeddings Using Disjoint Visually-Grounded Speech Audio Datasets.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption.

[BibT_eX]

[DOI]

Shang-Wen Li

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Unsupervised Methods for Evaluating Speech Representations.

[BibT_eX]

[DOI]

Michael Gump

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Vector-Quantized Autoregressive Predictive Coding.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

What Does an End-to-End Dialect Identification Model Learn About Non-Dialectal Information?

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Systematic Characterization of Sampling Algorithms for Open-ended Language Generation.

[BibT_eX]

[DOI]

Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

ADI17: A Fine-Grained Arabic Dialect Identification Dataset.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Trilingual Semantic Embeddings of Visually Grounded Speech with Self-Attention Mechanisms.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Audio-Visual Calibration with Polynomial Regression for 2-D Projection Using SVD-PHAT.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning a Subword Inventory Jointly with End-to-End Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Generative Pre-Training for Speech with Autoregressive Predictive Coding.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

We Can Detect Your Bias: Predicting the Political Ideology of News Articles.

[BibT_eX]

[DOI]

Ramy Baly

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Similarity Analysis of Contextual Word Representation Models.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Negative Training for Neural Dialogue Response Generation.

[BibT_eX]

[DOI]

Tianxing He

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Improved Speech Representations with Multi-Target Autoregressive Predictive Coding.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks.

[BibT_eX]

[DOI]

Shang-Wen Li

Proceedings of the 3rd Clinical Natural Language Processing Workshop, 2020

2019

Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification.

[BibT_eX]

[DOI]

Achintya Kumar Sarkar

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Deep Learning for Database Mapping and Asking Clarification Questions in Dialogue Systems.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Analysis Methods in Neural Language Processing: A Survey.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2019

Automatic Fact-Checking Using Context and Discourse Information.

[BibT_eX]

[DOI]

Pepa Atanasova

Lluís Màrquez

ACM J. Data Inf. Qual., 2019

Language processing and learning models for community question answering in Arabic.

[BibT_eX]

[DOI]

Salvatore Romeo

Inf. Process. Manag., 2019

Mix-review: Alleviate Forgetting in the Pretrain-Finetune Framework for Neural Language Generation Models.

[BibT_eX]

[DOI]

CoRR, 2019

DARTS: Dialectal Arabic Transcription System.

[BibT_eX]

[DOI]

Sameer Khurana

CoRR, 2019

Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models.

[BibT_eX]

[DOI]

Wei Fang

CoRR, 2019

Quantifying Exposure Bias for Neural Language Generation.

[BibT_eX]

[DOI]

CoRR, 2019

Adversarial Domain Adaptation for Stance Detection.

[BibT_eX]

[DOI]

Brian Xu

CoRR, 2019

Team QCRI-MIT at SemEval-2019 Task 4: Propaganda Analysis Meets Hyperpartisan News Detection.

[BibT_eX]

[DOI]

Abdelrhman Saleh

Ramy Baly

Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

FAKTA: An Automatic End-to-End Fact Checking System.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Multi-Task Ordinal Regression for Jointly Predicting the Trustworthiness and the Leading Political Ideology of News Media.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Fast and Robust 3-D Sound Source Localization with DSVD-PHAT.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

VoiceID Loss: Speech Enhancement for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Integrating Video Retrieval and Moment Detection in a Unified Corpus for Video Question Answering.

[BibT_eX]

[DOI]

Karthik Krishnamurthy

Brigitte Richardson

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Comparison of Deep Learning Methods for Language Understanding.

[BibT_eX]

[DOI]

Zoe Liu

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Transfer Learning from Audio-Visual Grounding to Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multiple Sound Source Localization with SVD-PHAT.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Deep Residual Network for Large-Scale Acoustic Scene Analysis.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

An Unsupervised Autoregressive Model for Speech Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio.

[BibT_eX]

[DOI]

Emmanuel Azuh

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Detecting Egregious Responses in Neural Sequence-to-sequence Models.

[BibT_eX]

[DOI]

Tianxing He

Proceedings of the 7th International Conference on Learning Representations, 2019

Identifying and Controlling Important Neurons in Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Noise-tolerant Audio-visual Online Person Verification Using an Attention-based Neural Network Fusion.

[BibT_eX]

[DOI]

Tae-Hyun Oh

Proceedings of the IEEE International Conference on Acoustics, 2019

Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Dialogue State Tracking with Convolutional Semantic Taggers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

A Factorial Deep Markov Model for Unsupervised Disentangled Representation Learning from Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Towards Visually Grounded Sub-word Speech Unit Discovery.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

SVD-PHAT: A Fast Sound Source Localization Method.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Subword Regularization and Beam Search Decoding for End-to-end Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Towards Unsupervised Speech-to-text Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Tanbih: Get To Know What You Are Reading.

[BibT_eX]

[DOI]

Yifan Zhang

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Contrastive Language Adaptation for Cross-Lingual Stance Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Sound Event Localization and Detection Using CRNN on Pairs of Microphones.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Learning Words by Drawing Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Grounding Spoken Words in Unlabeled Video.

[BibT_eX]

[DOI]

Rogério Schmidt Feris

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Explicit Alignment of Text and Speech Encodings for Attention-Based End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Improving Neural Language Models by Segmenting, Attending, and Predicting the Future.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

A Low-Power Speech Recognizer and Voice Activity Detector Using Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2018

A Study of the Complexity and Accuracy of Direction of Arrival Estimation Methods Based on GCC-PHAT for a Pair of Close Microphones.

[BibT_eX]

[DOI]

CoRR, 2018

On The Inductive Bias of Words in Acoustics-to-Word Models.

[BibT_eX]

[DOI]

CoRR, 2018

MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation (MCE) Plan, Dataset and Baseline System.

[BibT_eX]

[DOI]

CoRR, 2018

Disentangling by Partitioning: A Representation Learning Framework for Multimodal Sensory Data.

[BibT_eX]

[DOI]

CoRR, 2018

Convolutional Neural Networks and Language Embeddings for End-to-End Dialect Recognition.

[BibT_eX]

[DOI]

CoRR, 2018

Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign.

[BibT_eX]

[DOI]

Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

On Training Recurrent Networks with Truncated Backpropagation Through time in Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Frame-Level Speaker Embeddings for Text-Independent Speaker Recognition and Analysis of End-to-End Model.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Unsupervised Representation Learning of Speech for Dialect Identification.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Convolutional Neural Networks for Dialogue State Tracking without Pre-Trained Word Vectors or Semantic Dictionaries.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Combining End-to-End and Adversarial Training for Low-Resource Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Convolutional Neural Network and Language Embeddings for End-to-End Dialect Recognition.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Automatic Stance Detection Using End-to-End Memory Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Role-specific Language Models for Processing Recorded Neuropsychological Exams.

[BibT_eX]

[DOI]

Rhoda Au

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Supervised and Unsupervised Transfer Learning for Question Answering.

[BibT_eX]

[DOI]

Hung-yi Lee

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Integrating Stance Detection and Fact Checking in a Unified Corpus.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

A Study of Enhancement, Augmentation and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Scalable Factorized Hierarchical Variational Autoencoder Training.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Detecting Depression with Audio/Text Sequence Modeling of Interviews.

[BibT_eX]

[DOI]

Mohammad M. Ghassemi

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

A Noise-Robust Self-Adaptive Multitarget Speaker Detection System.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Exploiting Convolutional Neural Networks for Phonotactic Based Dialect Identification.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Convolutional Neural Networks and Multitask Strategies for Semantic Mapping of Natural Language Input to a Structured Database.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Energy-Efficient Speaker Identification with Low-Precision Networks.

[BibT_eX]

[DOI]

Skanda Koppula

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech.

[BibT_eX]

[DOI]

Galen Chuang

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Learning Word Representations with Cross-Sentence Dependencyfor End-to-End Co-reference Resolution.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Predicting Factuality of Reporting and Bias of News Media Sources.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Fact Checking in Community Forums.

[BibT_eX]

[DOI]

Tsvetomila Mihaylova

Lluís Màrquez

Georgi Karadzhov

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Spoken Language Understanding for a Nutrition Dialogue System.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Learning Word Embeddings from Speech.

[BibT_eX]

[DOI]

CoRR, 2017

Bidirectional Backpropagation: Towards Biologically Plausible Error Signal Transmission in Neural Networks.

[BibT_eX]

[DOI]

Jie Fu

CoRR, 2017

Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

14.4 A scalable speech recognizer with deep-neural-network acoustic models and voice-activated power gating.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Solid-State Circuits Conference, 2017

Character-Based Embedding Models and Reranking Strategies for Understanding Natural Language Meal Descriptions.

[BibT_eX]

[DOI]

Zachary Collins

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

QMDIS: QCRI-MIT Advanced Dialect Identification System.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Learning Latent Representations for Speech Generation and Transformation.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

An Environmental Feature Representation for Robust Speech Recognition and for Environment Identification.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Semantic mapping of natural language input to database entries via convolutional neural networks.

[BibT_eX]

[DOI]

Zachary Collins

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

MIT-QCRI Arabic dialect identification system for the 2017 multi-genre broadcast challenge.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Automatic speech recognition of Arabic multi-genre broadcast media.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Learning modality-invariant representations for speech and images.

[BibT_eX]

[DOI]

Kenneth Leidal

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Unsupervised domain adaptation for robust speech recognition via variational autoencoder-based data augmentation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Spoken language biomarkers for detecting cognitive impairment.

[BibT_eX]

[DOI]

Tuka Alhanai

Rhoda Au

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Learning Word-Like Units from Joint Audio-Visual Analysis.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

What do Neural Machine Translation Models Learn about Morphology?

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

On the Use of Acoustic Unit Discovery for Language Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Recurrent Neural Network Encoder with Attention for Community Question Answering.

[BibT_eX]

[DOI]

CoRR, 2016

Large-Scale Machine Translation between Arabic and Hebrew: Available Corpora and Initial Results.

[BibT_eX]

[DOI]

CoRR, 2016

A Character-level Convolutional Neural Network for Distinguishing Similar Languages and Dialects.

[BibT_eX]

[DOI]

Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

Look, listen, and decode: Multimodal speech recognition with images.

[BibT_eX]

[DOI]

Felix Sun

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

A prioritized grid long short-term memory RNN for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Development of the MIT ASR system for the 2016 Arabic Multi-genre Broadcast Challenge.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

The MGB-2 challenge: Arabic multi-dialect broadcast media recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

SemEval-2016 Task 3: Community Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

SLS at SemEval-2016 Task 3: Neural-based Approaches for Ranking in Community Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Learning Semantic Relatedness in Community Question Answering Using Neural Models.

[BibT_eX]

[DOI]

Henry Nassif

Proceedings of the 1st Workshop on Representation Learning for NLP, 2016

Unsupervised Learning of Spoken Language with Visual Context.

[BibT_eX]

[DOI]

Antonio Torralba

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Memory-Efficient Modeling and Search Techniques for Hardware ASR Decoders.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Dialect Detection in Arabic Broadcast Speech.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Highway long short-term memory RNNS for distant speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Prediction-adaptation-correction recurrent neural networks for low-resource language speech recognition.

[BibT_eX]

[DOI]

Dong Yu

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery.

[BibT_eX]

[DOI]

Nancy F. Chen

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Distributional semantics for understanding spoken meal descriptions.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Multilingual data selection for training stacked bottleneck features.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Neural Attention for Learning to Rank Questions in Community Question Answering.

[BibT_eX]

[DOI]

Salvatore Romeo

Proceedings of the COLING 2016, 2016

2015

Spoken Content Retrieval - Beyond Cascading Speech Recognition with Text Retrieval.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Unsupervised Lexicon Discovery from Acoustic Input.

[BibT_eX]

[DOI]

Timothy J. O'Donnell

Trans. Assoc. Comput. Linguistics, 2015

A 6 mW, 5, 000-Word Real-Time Speech Recognizer Using WFST Models.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2015

A Situationally Aware Voice-commandable Robotic Forklift Working Alongside People in Unstructured Outdoor Environments.

[BibT_eX]

[DOI]

Matthew R. Walter

Matthew E. Antone

J. Field Robotics, 2015

SemEval-2015 Task 3: Answer Selection in Community Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

VectorSLU: A Continuous Word Vector Approach to Answer Selection in Community Question Answering Systems.

[BibT_eX]

[DOI]

Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

A Vector Space Approach for Aspect Based Sentiment Analysis.

[BibT_eX]

[DOI]

Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing, 2015

Mispronunciation detection without nonnative training data.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Speaker adaptation using the i-vector technique for bottleneck features.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

On using heterogeneous data for vehicle-based speech recognition: A DNN-based approach.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Arabic Diacritization with Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Wait-Learning: Leveraging Wait Time for Second Language Education.

[BibT_eX]

[DOI]

Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 2015

Deep multimodal semantic embeddings for speech and images.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Data collection and language understanding of food descriptions.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

A complete KALDI recipe for building Arabic speech recognition systems.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

27.2 A 6mW 5K-Word real-time speech recognizer using WFST models.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Solid-State Circuits Conference, 2014

Limited labels for unlimited data: active learning for speaker recognition.

[BibT_eX]

[DOI]

Stephen H. Shum

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Graph-based re-ranking using acoustic feature similarity between search results for spoken term detection on low-resource languages.

[BibT_eX]

[DOI]

Hung-yi Lee

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Context-dependent pronunciation error pattern discovery with limited annotations.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Speech recognition without a lexicon - bridging the gap between graphemic and phonetic systems.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Lexical modeling for Arabic ASR: a systematic approach.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Language ID-based training of multilingual stacked bottleneck features.

[BibT_eX]

[DOI]

Anne Cutler

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Recent advances in ASR applied to an Arabic transcription system for Al-Jazeera.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Extracting deep neural network bottleneck features using low-rank matrix factorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Speech feature denoising and dereverberation via deep autoencoders for noisy reverberant speech recognition.

[BibT_eX]

[DOI]

Xue Feng

Proceedings of the IEEE International Conference on Acoustics, 2014

A Study of using Syntactic and Semantic Structures for Concept Segmentation and Labeling.

[BibT_eX]

[DOI]

Lluís Màrquez i Villodre

Alessandro Moschitti

Proceedings of the COLING 2014, 2014

One-shot learning of generative speech concepts.

[BibT_eX]

[DOI]

Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Wait-learning: leveraging conversational dead time for second language education.

[BibT_eX]

[DOI]

Proceedings of the CHI Conference on Human Factors in Computing Systems, 2014

2013

Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

Learning Lexicons From Speech Using a Pronunciation Mixture Model.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

Pronunciation assessment via a comparison-based system.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013

Probabilistic Dialogue Modeling for Speech-Enabled Assistive Technology.

[BibT_eX]

[DOI]

Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013

Bayesian distance metric learning on i-vector for speaker verification.

[BibT_eX]

[DOI]

Xiao Fang

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Asgard: A portable architecture for multilingual dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Zero resource spoken audio corpus analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Joint Learning of Phonetic Units and Word Pronunciations for ASR.

[BibT_eX]

[DOI]

Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Query understanding enhanced by hierarchical parsing structures.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

A comparison-based approach to mispronunciation detection.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Towards unsupervised speech processing.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Information Science, 2012

On the Use of Spectral and Iterative Methods for Speaker Diarization.

[BibT_eX]

[DOI]

Stephen Shum

Jim Glass

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Automating Crowd-supervised Learning for Spoken Language Systems.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A Conversational Movie Search System Based on Conditional Random Fields.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Sentence Detection Using Multiple Annotations.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Resource configurable spoken query detection using Deep Boltzmann Machines.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fast spoken query detection using lower-bound Dynamic Time Warping on Graphical Processing Units.

[BibT_eX]

[DOI]

Kiarash Adl

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Handling uncertain observations in unsupervised topic-mixture language model adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Evaluation of multi-level context-dependent acoustic model for large vocabulary speaker adaptation tasks.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A Nonparametric Bayesian Approach to Acoustic Model Discovery.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011

A Piecewise Aggregate Approximation Lower-Bound Estimate for Posteriorgram-Based Dynamic Time Warping.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Exploiting Intra-Conversation Variability for Speaker Diarization.

[BibT_eX]

[DOI]

Stephen Shum

Douglas A. Reynolds

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Growing a Spoken Language Interface on Amazon Mechanical Turk.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An Efferent-Inspired Auditory Model Front-End for Speech Recognition.

[BibT_eX]

[DOI]

Oded Ghitza

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A Transcription Task for Crowdsourcing with Automatic Quality Control.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Robust Voice Activity Detector for Real World Applications Using Harmonicity and Modulation Frequency.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Pronunciation Learning from Continuous Speech.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An inner-product lower-bound estimate for dynamic time warping.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

A channel-blind system for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Multi-level context-dependent acoustic modeling for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Introduction to the Issue on Speech Processing for Natural Interaction With Intelligent Environments.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2010

Combining missing-feature theory, speech enhancement, and speaker-dependent/-independent modeling for speech separation.

[BibT_eX]

[DOI]

Ji Ming

Comput. Speech Lang., 2010

A collective data generation method for speech language models.

[BibT_eX]

[DOI]

Sean Liu

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Spoken command of large mobile robots in outdoor environments.

[BibT_eX]

[DOI]

D. Scott Cyphers

Seth J. Teller

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Cosine Similarity Scoring without Score Normalization Techniques.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Collecting Voices from the Cloud.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2010

Learning new word pronunciations from spoken examples.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A voice-commandable robotic forklift working alongside humans in minimally-prepared outdoor environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Towards multi-speaker unsupervised speech pattern discovery.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Multimodal interaction with an autonomous forklift.

[BibT_eX]

[DOI]

Proceedings of the 5th ACM/IEEE International Conference on Human Robot Interaction, 2010

2009

Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education].

[BibT_eX]

[DOI]

Douglas D. O'Shaughnessy

IEEE Signal Process. Mag., 2009

Developments and directions in speech recognition and understanding, Part 1 [DSP Education].

[BibT_eX]

[DOI]

Douglas D. O'Shaughnessy

IEEE Signal Process. Mag., 2009

Multistream Articulatory Feature-Based Models for Visual Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2009

A back-off discriminative acoustic model for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Speech rhythm guided syllable nuclei detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

On the phonetic information in ultrasonic microphone signals.

[BibT_eX]

[DOI]

Bo Zhu

Proceedings of the IEEE International Conference on Acoustics, 2009

Language model parameter estimation using user transcriptions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Discriminative training of hierarchical acoustic models for large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Syntactic Phrase Reordering for English-to-Arabic Statistical Machine Translation.

[BibT_eX]

[DOI]

Rabih Zbib

Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

City browser: developing a conversational automotive HMI.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Human Factors in Computing Systems, 2009

Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008

Unsupervised Pattern Discovery in Speech.

[BibT_eX]

[DOI]

A. S. Park

IEEE Trans. Speech Audio Process., 2008

Iterative language model estimation: efficient data structure & algorithms.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A turbo-style algorithm for lexical baseforms estimation.

[BibT_eX]

[DOI]

Mesrob I. Ohannessian

Proceedings of the IEEE International Conference on Acoustics, 2008

N-gram Weighting: Reducing Training Data Mismatch in Cross-Domain Language Model Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Segmentation for English-to-Arabic Statistical Machine Translation.

[BibT_eX]

[DOI]

Rabih Zbib

Proceedings of the ACL 2008, 2008

2007

Robust Speaker Recognition in Noisy Conditions.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

An Implementation of Rational Wavelets and Filter Design for Phonetic Classification.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Multimodal speech recognition with ultrasonic sensors.

[BibT_eX]

[DOI]

Bo Zhu

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Recent progress in the MIT spoken lecture processing project.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

New word acquisition using subword modeling.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Noise Robust Phonetic Classificationwith Linear Regularized Least Squares and Second-Order Features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Open-Vocabulary Spoken Utterance Retrieval using Confusion Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Speech recognition with localized time-frequency pattern detectors.

[BibT_eX]

[DOI]

Ken Schutte

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Automatic lexical pronunciations generation and update.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Hierarchical large-margin Gaussian mixture models for phonetic classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Making Sense of Sound: Unsupervised Topic Segmentation over Acoustic Input.

[BibT_eX]

[DOI]

Proceedings of the ACL 2007, 2007

2006

A Novel DTW-Based Distance Measure for speaker Segmentation.

[BibT_eX]

[DOI]

Alex Park

Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

A Comparative Study of Methods for Handheld Speaker Verification in Realistic Noisy Conditions.

[BibT_eX]

[DOI]

Ji Ming

Proceedings of the Odyssey 2006, 2006

Spoken Correction for Chinese Text Entry.

[BibT_eX]

[DOI]

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Unsupervised Word Acquisition from Speech using Pattern Discovery.

[BibT_eX]

[DOI]

Alex Park

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Speaker Verification Over Handheld Devices with Realistic Noisy Speech Data.

[BibT_eX]

[DOI]

Ji Ming

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Flexible Multi-Stream Framework for Speech Recognition using Multi-Tape Finite-State Transducers.

[BibT_eX]

[DOI]

I. Lee Hetherington

Han Shu

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Style & Topic Language Model Adaptation Using HMM-LDA.

[BibT_eX]

[DOI]

Proceedings of the EMNLP 2006, 2006

2005

The MIT Spoken Lecture Processing Project.

[BibT_eX]

[DOI]

Proceedings of the HLT/EMNLP 2005, 2005

Robust detection of sonorant landmarks.

[BibT_eX]

[DOI]

Ken Schutte

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Morphing spectral envelopes using audio flow.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Visual Speech Recognition with Loosely Synchronized Feature Streams.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Production domain modeling of pronunciation for visual speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Automatic Processing of Audio Lectures for Information Retrieval: Vocabulary Selection and Language Modeling.

[BibT_eX]

[DOI]

Alex Park

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

A Wavelet and Filter Bank Framework For Phonetic Classification.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Feature-based Pronunciation Modeling for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Feature-based pronunciation modeling with trainable asynchrony probabilities.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Articulatory features for robust visual speech recognition.

[BibT_eX]

[DOI]

Kate Saenko

Trevor Darrell

Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

A segment-based audio-visual speech recognizer: data collection, development, and initial experiments.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

A Framework for Developing Conversational User Interfaces.

[BibT_eX]

Proceedings of the Computer-Aided Design of User Interfaces IV, 2004

2003

A probabilistic framework for segment-based speech recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2003

Hidden feature models for speech recognition using dynamic Bayesian networks.

[BibT_eX]

[DOI]

Jeff A. Bilmes

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Information-theoretic criteria for unit selection synthesis.

[BibT_eX]

[DOI]

Jon R. W. Yi

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A multi-class approach for modelling out-of-vocabulary words.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

Mokusei: a telephone-based Japanese conversational system in the weather domain.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Segment-based recognition on the phonebook task: initial results and observations on duration modeling.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Speechbuilder: facilitating spoken dialogue system development.

[BibT_eX]

[DOI]

Eugene Weinstein

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Learning units for domain-independent out-of- vocabulary word modelling.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

JUPlTER: a telephone-based conversational interface for weather information.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2000

Guest editorial introduction to the special issue on language modeling and dialogue systems.

[BibT_eX]

[DOI]

Ronald Rosenfeld

IEEE Trans. Speech Audio Process., 2000

Conversational interfaces: advances and challenges.

[BibT_eX]

[DOI]

Proc. IEEE, 2000

A flexible, scalable finite-state transducer architecture for corpus-based concatenative speech synthesis.

[BibT_eX]

[DOI]

Jon R. W. Yi

I. Lee Hetherington

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Data collection and performance evaluation of spoken dialogue systems: the MIT experience.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Modeling out-of-vocabulary words for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Lexical modeling of non-native speech for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

Heterogeneous lexical units for automatic speech recognition: preliminary investigations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Real-time telephone-based speech recognition in the Jupiter domain.

[BibT_eX]

[DOI]

I. Lee Hetherington

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Evaluation methodology for a telephone-based conversational system.

[BibT_eX]

Proceedings of the First International Conference on Language Resources and Evaluation, 1998

Natural-sounding speech synthesis using variable-length units.

[BibT_eX]

[DOI]

Jon R. W. Yi

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Confidence scoring for speech understanding systems.

[BibT_eX]

[DOI]

Christine Pao

Philipp Schmid

Real-time probabilistic segmentation for segment-based speech recognition.

[BibT_eX]

[DOI]

Steven C. Lee

Heterogeneous measurements and multiple classifiers for speech recognition.

[BibT_eX]

[DOI]

Andrew K. Halberstadt

Telephone-based conversational speech recognition in the JUPITER domain.

[BibT_eX]

[DOI]

1997

From interface to content: translingual access and delivery of on-line information.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

YINHE: a Mandarin Chinese version of the GALAXY system.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

MUSE: a scripting language for the development of interactive speech analysis and recognition tools.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

A comparison of novel techniques for instantaneous speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Heterogeneous acoustic measurements for phonetic classification 1.

[BibT_eX]

[DOI]

Andrew K. Halberstadt

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Segmentation and modeling in segment-based recognition.

[BibT_eX]

[DOI]

Jane W. Chang

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996

Multilingual human-computer interactions: from information access to language learning.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

WHEELS: a conversational system in the automobile classifieds domain.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Telephone data collection using the world wide web.

[BibT_eX]

[DOI]

Edward Hurley

Joseph Polifroni

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

A probabilistic framework for feature-based speech recognition.

[BibT_eX]

[DOI]

Jane W. Chang

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995

Multilingual spoken-language understanding in the MIT Voyager system.

[BibT_eX]

[DOI]

Speech Commun., 1995

1994

PEGASUS: A spoken dialogue interface for on-line air travel planning.

[BibT_eX]

[DOI]

Speech Communication, 1994

PEGASUS: A Spoken Language Interface for On-Line Air Travel Planning I.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology, 1994

Empirical acquisition of language models for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Statistical trajectory models for phonetic recognition.

[BibT_eX]

[DOI]

William Goldenthal

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

GALAXY: a human-language interface to on-line travel information.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Multilingual language generation across multiple domains.

[BibT_eX]

[DOI]

Joseph Polifroni

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Porting the bilingual voyager system to Italian.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1993

Empirical acquisition of word and phrase classes in the atis domain.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A* word network search for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Modelling spectral dynamics for vowel classification.

[BibT_eX]

[DOI]

William Goldenthal

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A bilingual Voyager system.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A comparative study of signal representations and classification techniques for speech recognition.

[BibT_eX]

[DOI]

Hong C. Leung

Benjamin Chigier

Proceedings of the IEEE International Conference on Acoustics, 1993

1992

T]he MIT ATIS System: February 1992 Progress Report.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Collection and Analyses of WSJ-CSR Data at MIT.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Collection and analyses of WSJ-CSR corpus at MIT.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Vowel classification based on analysis-by-synthesis.

[BibT_eX]

[DOI]

Rolf Carlson

Proceedings of the Second International Conference on Spoken Language Processing, 1992

1991

Spoken language systems for human/machine interfaces.

[BibT_eX]

[DOI]

Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications) - RIAO 1991, 3rd International Conference, Universitad Autonoma de Barcelona, Spain, April 2, 1991

Development and Preliminary Evaluation of the MIT ATIS System.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language, 1991

Modelling Context Dependency in Acoustic-Phonetic and Lexical Representations.

[BibT_eX]

[DOI]

Michael S. Phillips

Victor Zue

Proceedings of the Speech and Natural Language, 1991

The MIT ATIS system; preliminary development, spontaneous speech data collection, and performance evaluation.

[BibT_eX]

[DOI]

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Automatic learning of lexical representations for sub-word unit based speech recognition systems.

[BibT_eX]

[DOI]

Michael S. Phillips

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Integration of speech recognition and natural language processing in the MIT VOYAGER system.

[BibT_eX]

[DOI]

Proceedings of the 1991 International Conference on Acoustics, 1991

1990

Speech database development at MIT: Timit and beyond.

[BibT_eX]

[DOI]

Victor Zue

Speech Commun., 1990

From Speech Recognition to Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 3, 1990

Phonetic Classification and Recognition Using the Multi-Layer Perceptron.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 3, 1990

Recent Progress on the SUMMIT System.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

Preliminary ATIS Development at MIT.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

Recent Progress on the VOYAGER System.

[BibT_eX]

[DOI]

Michael S. Phillips

Joseph Polifroni

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

Recent progress on the MIT VOYAGER spoken language system.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

Detection and classification of phonemes using context-independent error back-propagation.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Spoken Language Processing, 1990

The SUMMIT speech recognition system: phonological modelling and lexical access.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

The VOYAGER speech understanding system: preliminary development and evaluation.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

1989

The MIT Summit Speech Recognition System: a Progress Report.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Philadelphia, 1989

Preliminary Evaluation of the Voyager Spoken Language System.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989

The Voyager Speech Understanding System: A Progress Report.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989

The Collection and Preliminary Analysis of a Spontaneous Speech Database.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989

Acoustic segmentation and phonetic classification in the SUMMIT system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1989

1988

Finding acoustic regularities in speech: applications to phonetic recognition.

[BibT_eX]

[DOI]

PhD thesis, 1988

Multi-level acoustic segmentation of continuous speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1988

1986

Detection and recognition of nasal consonants in American English.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1986

1985

Detection of nasalized vowels in American English.

[BibT_eX]

[DOI]