Sanjeev Khudanpur

Dan Povey

Proceedings of the 22nd International Conference on Spoken Language Translation, 2025

CS-FLEURS: A Massively Multilingual and Code-Switched Speech Dataset.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

HLTCOE Submission to the VoicePrivacy Attacker Challenge.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Target Speaker ASR with Whisper.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Whisper-UT: A Unified Translation Framework for Speech and Text.

[BibT_eX]

[DOI]

Cihan Xiao

Debashish Chakraborty

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Scalable Controllable Accented TTS.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

CASPER: A Large Scale Spontaneous Speech Dataset.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

MADASR 2.0: Multi-Lingual Multi-Dialect ASR Challenge in 8 Indian Languages.

[BibT_eX]

[DOI]

Srikanth S. Narayanan

Howard Lakougna

Prasanta Kumar Ghosh

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

Rapidly Adapting to New Voice Spoofing: Few-Shot Detection of Synthesized Speech Under Distribution Shifts.

[BibT_eX]

[DOI]

Ashi Garg

Zexin Cai

Henry Li Xinyuan

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

WST: Weakly Supervised Transducer for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Jian Wu

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

GenVC: Self-Supervised Zero-Shot Voice Conversion.

[BibT_eX]

[DOI]

Zexin Cai

Henry Li Xinyuan

Ashi Garg

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024

HLTCOE JHU Submission to the Voice Privacy Challenge 2024.

[BibT_eX]

[DOI]

CoRR, 2024

Clean Label Attacks Against SLU Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Spatialemb: Extract and Encode Spatial Information for 1-Stage Multi-Channel Multi-Speaker ASR on Arbitrary Microphone Arrays.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Privacy Versus Emotion Preservation Trade-Offs in Emotion-Preserving Speaker Anonymization.

[BibT_eX]

[DOI]

Zexin Cai

Henry Li Xinyuan

Ashi Garg

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

On Speaker Attribution with SURT.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages.

[BibT_eX]

[DOI]

Nathaniel R. Robinson

Matthew Dean Stutzman

Bismarck Bamfo Odoom

Nathaniel Romney Robinson

Stephen D. Richardson

Kenton Murray

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

JHU IWSLT 2024 Dialectal and Low-resource System Description.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Spoken Language Translation, 2024

Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Enhancing Neural Transducer for Multilingual ASR with Synchronized Language Diarization.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Enhancing Code-Switching Speech Recognition With Interactive Language Biases.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Speech Collage: Code-Switched Audio Generation by Collaging Monolingual Corpora.

[BibT_eX]

[DOI]

Shammur Absar Chowdhury

Ahmed Ali

Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization.

[BibT_eX]

[DOI]

Amir Hussein

Brian Yan

Antonios Anastasopoulos

Proceedings of the IEEE International Conference on Acoustics, 2024

Less Peaky and More Accurate CTC Forced Alignment by Label Priors.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

SURT 2.0: Advances in Transducer-Based Multi-Talker Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

A dilemma of ground truth in noisy speech separation and an approach to lessen the impact of imperfect training data.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2023

The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios.

[BibT_eX]

[DOI]

CoRR, 2023

JHU IWSLT 2023 Multilingual Speech Translation System Description.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Spoken Language Translation, 2023

JHU IWSLT 2023 Dialect Speech Translation System Description.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Spoken Language Translation, 2023

HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Investigating model performance in language identification: beyond simple error statistics.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

GPU-accelerated Guided Source Separation for Meeting Transcription.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Crosslingual Handwritten Text Generation Using GANs.

[BibT_eX]

[DOI]

Chun Chieh Chang

Proceedings of the Document Analysis and Recognition - ICDAR 2023 Workshops, 2023

Reducing Language Confusion for Code-Switching Speech Recognition with Token-Level Language Diarization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Building Keyword Search System from End-To-End Asr Systems.

[BibT_eX]

[DOI]

Ruizhe Huang

Jan Trmal

Proceedings of the IEEE International Conference on Acoustics, 2023

Adapting Self-Supervised Models to Multi-Talker Speech Recognition Using Speaker Embeddings.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Euro: Espnet Unsupervised ASR Open-Source Toolkit.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Joint Energy-Based Model for Robust Speech Classification System Against Dirty-Label Backdoor Poisoning Attacks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Learning From Flawed Data: Weakly Supervised Automatic Speech Recognition.

[BibT_eX]

[DOI]

Dongji Gao

Hainan Xu

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Efficient Self-Supervised Learning Representations for Spoken Language Identification.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2022

Joint speaker diarization and speech recognition based on region proposal networks.

[BibT_eX]

[DOI]

Zili Huang

Marc Delcroix

Comput. Speech Lang., 2022

Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser.

[BibT_eX]

[DOI]

CoRR, 2022

Enhance Language Identification using Dual-mode Model with Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2022

Characterizing the Details of Spatial Construction: Cognitive Constraints and Variability.

[BibT_eX]

[DOI]

Cogn. Sci., 2022

Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition.

[BibT_eX]

[DOI]

Amir Hussein

Shammur Absar Chowdhury

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Enhancing Language Identification Using Dual-Mode Model with Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

JHU IWSLT 2022 Dialect Speech Translation System Description.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Chunking Defense for Adversarial Attacks on ASR.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification.

[BibT_eX]

[DOI]

Andy W. H. Khong

Suzy J. Styles

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Injecting Text and Cross-Lingual Supervision in Few-Shot Learning from Self-Supervised Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Investigating Self-Supervised Learning for Speech Enhancement and Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2021

Fine-Grained Activity Recognition for Assembly Videos.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2021

Lhotse: a speech data representation library for the modern deep learning ecosystem.

[BibT_eX]

[DOI]

CoRR, 2021

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio.

[BibT_eX]

[DOI]

CoRR, 2021

Adversarial Attacks and Defenses for Speech Recognition Systems.

[BibT_eX]

[DOI]

CoRR, 2021

Learning Policies for Multilingual Training of Neural Machine Translation Systems.

[BibT_eX]

[DOI]

Gaurav Kumar

Philipp Koehn

CoRR, 2021

The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap.

[BibT_eX]

[DOI]

CoRR, 2021

Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora.

[BibT_eX]

[DOI]

Gaurav Kumar

Philipp Koehn

Proceedings of the Sixth Conference on Machine Translation, 2021

Multi-Class Spectral Clustering with Overlaps for Speaker Diarization.

[BibT_eX]

[DOI]

Zili Huang

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Learning Curricula for Multilingual Neural Machine Translation Training.

[BibT_eX]

[DOI]

Gaurav Kumar

Philipp Koehn

Proceedings of the 18th Biennial Machine Translation Summit - Volume 1: Research Track, 2021

Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speaker Verification-Based Evaluation of Single-Channel Speech Separation.

[BibT_eX]

[DOI]

Matthew Maciejewski

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

End-to-End Language Diarization for Bilingual Code-Switching Speech.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10, 000 Hours of Transcribed Audio.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Wake Word Detection with Streaming Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Training Noisy Single-Channel Speech Separation with Noisy Oracle Sources: A Large Gap and a Small Step.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

A Parallelizable Lattice Rescoring Strategy with Neural Language Models.

[BibT_eX]

[DOI]

Ke Li

Proceedings of the IEEE International Conference on Acoustics, 2021

An Asynchronous WFST-Based Decoder for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Frustratingly Easy Noise-aware Training of Acoustic Models.

[BibT_eX]

[DOI]

CoRR, 2020

The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge.

[BibT_eX]

[DOI]

Ashish Arora

Aswin Shanmugam Subramanian

CoRR, 2020

Wake Word Detection with Alignment-Free Lattice-Free MMI.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Neural Language Modeling with Implicit Cache Pointers.

[BibT_eX]

[DOI]

Ke Li

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Efficient MDI Adaptation for n-Gram Language Models.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

An Alternative to MFCCs for ASR.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

OOV Recovery with Efficient 2nd Pass Decoding and Open-vocabulary Word-level RNNLM Rescoring for Hybrid ASR.

[BibT_eX]

[DOI]

Xiaohui Zhang

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

An Empirical Study of Transformer-Based Neural Language Model Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Speaker Diarization with Region Proposal Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Sample Selection for Large-scale MT Discriminative Training.

[BibT_eX]

[DOI]

Yuan Cao

Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, 2020

2019

Analysis of Robustness of Deep Single-Channel Speech Separation Using Corpora Constructed From Multiple Domains.

[BibT_eX]

[DOI]

Matthew Maciejewski

Gregory Sell

Yusuke Fujita

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Toward Computer Vision Systems That Understand Real-World Assembly Processes.

[BibT_eX]

[DOI]

Jonathan D. Jones

Gregory D. Hager

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Multi-PLDA Diarization on Children's Speech.

[BibT_eX]

[DOI]

Jiamin Xie

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network.

[BibT_eX]

[DOI]

Fei Wu

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings.

[BibT_eX]

[DOI]

Adithya Renduchintala

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The JHU ASR System for VOiCES from a Distance Challenge 2019.

[BibT_eX]

[DOI]

Phani Sankar Nidadavolu

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18.

[BibT_eX]

[DOI]

Pedro A. Torres-Carrasquillo

Najim Dehak

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The JHU Speaker Recognition System for the VOiCES 2019 Challenge.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker Recognition Benchmark Using the CHiME-5 Corpus.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Optical Character Recognition with Chinese and Korean Character Decomposition.

[BibT_eX]

[DOI]

Chun-Chieh Chang

Ashish Arora

David Etter

Proceedings of the Second International Workshop on Machine Learning, 2019

Using ASR Methods for OCR.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Speaker Recognition for Multi-speaker Conversations Using X-vectors.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Bottom-Up Unsupervised Word Discovery via Acoustic Units.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Global Conference on Signal and Information Processing, 2019

Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Probing the Information Encoded in X-Vectors.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Incremental Lattice Determinization for WFST Decoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Low Latency Acoustic Modeling Using Temporal Convolution and LSTMs.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2018

Low Resource Multi-modal Data Augmentation for End-to-end ASR.

[BibT_eX]

[DOI]

Adithya Renduchintala

CoRR, 2018

Building Corpora for Single-Channel Speech Separation Across Multiple Domains.

[BibT_eX]

[DOI]

Matthew Maciejewski

Gregory Sell

Suryakanth V. Gangashetty

CoRR, 2018

The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection.

[BibT_eX]

[DOI]

CoRR, 2018

A Teacher-Student Learning Approach for Unsupervised Domain Adaptation of Sequence-Trained ASR Models.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Low-Resource Contextual Topic Identification on Speech.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Improving LF-MMI Using Unconstrained Supervisions for ASR.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Spoken Language Recognition using X-vectors.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

End-to-end Speech Recognition Using Lattice-free MMI.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

End-to-end Deep Neural Network Age Estimation.

[BibT_eX]

[DOI]

Pegah Ghahremani

Phani Sankar Nidadavolu

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Acoustic Modeling from Frequency Domain Representations of Speech.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Output-Gate Projected Gated Recurrent Unit for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

A GPU-based WFST Decoder with Exact Lattice Generation.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Neural Network Language Modeling with Letter-Based Features and Importance Sampling.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Pruned Rnnlm Lattice-Rescoring Algorithm for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

X-Vectors: Robust DNN Embeddings for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Enhancement and Analysis of Conversational Speech: JSALT 2017.

[BibT_eX]

[DOI]

Mahesh Krishnamoorthy

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Time-Restricted Self-Attention Layer for ASR.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Bayesian Models for Unit Discovery on a Very Low Resource Language.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Semi-Supervised Training of Acoustic Models Using Lattice-Free MMI.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Characterizing Performance of Speaker Diarization Systems on Far-Field Speech Using Standard Methods.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Constraints and Development in Children's Block Construction.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2017

A Dataset and Benchmarks for Segmentation and Recognition of Gestures in Robotic Surgery.

[BibT_eX]

[DOI]

IEEE Trans. Biomed. Eng., 2017

Using of heterogeneous corpora for training of an ASR system.

[BibT_eX]

[DOI]

CoRR, 2017

Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Backstitch: Counteracting Finite-Sample Bias via Negative Steps.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

The Kaldi OpenKWS System: Improving Low Resource Keyword Search.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Deep Neural Network Embeddings for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Topic Identification for Speech Without ASR.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Phone Duration Modeling for LVCSR Using Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

An Exploration of Dropout with LSTMs.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

An empirical evaluation of zero resource acoustic unit discovery.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A study on data augmentation of reverberant speech for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Topic identification of spoken documents using unsupervised acoustic unit discovery.

[BibT_eX]

[DOI]

Santosh Kesiraju

Raghavendra Pappagari

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Characterizing spatial construction processes: Toward computational tools to understand cognition.

[BibT_eX]

[DOI]

Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

JHU Kaldi system for Arabic MGB-3 ASR challenge using diarization, audio-transcript alignment and transfer learning.

[BibT_eX]

[DOI]

Vimal Manohar

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Investigation of transfer learning for ASR using LF-MMI trained neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Getting more from automatic transcripts for semi-supervised language modeling.

[BibT_eX]

[DOI]

Scott Novotney

Richard M. Schwartz

Comput. Speech Lang., 2016

Query-by-example surgical activity detection.

[BibT_eX]

[DOI]

Int. J. Comput. Assist. Radiol. Surg., 2016

Deep neural network-based speaker embeddings for end-to-end speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Far-Field ASR Without Parallel Data.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Acoustic Modelling from the Signal Domain Using CNNs.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Unsupervised surgical data alignment with application to automatic activity annotation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Highway long short-term memory RNNS for distant speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Adapting ASR for under-resourced languages using mismatched transcriptions.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Context-dependent point process models for keyword search and detection-based ASR.

[BibT_eX]

[DOI]

Chunxi Liu

Aren Jansen

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Acoustic data-driven pronunciation lexicon generation for logographic languages.

[BibT_eX]

[DOI]

Guoguo Chen

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Parallel training of Deep Neural Networks with Natural Gradient and Parameter Averaging.

[BibT_eX]

[DOI]

Xiaohui Zhang

Proceedings of the 3rd International Conference on Learning Representations, 2015

A diversity-penalizing ensemble training method for deep learning.

[BibT_eX]

[DOI]

Xiaohui Zhang

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Modeling phonetic context with non-random forests for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A time delay neural network architecture for efficient modeling of long temporal contexts.

[BibT_eX]

[DOI]

Vijayaditya Peddinti

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Reverberation robust acoustic modeling using i-vectors with time delay neural networks.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Semi-supervised maximum mutual information training of deep neural network acoustic models.

[BibT_eX]

[DOI]

Vimal Manohar

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Audio augmentation for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Pronunciation and silence probability modeling for ASR.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Structured variability in acoustic realization: a corpus study of voice onset time in American English stops.

[BibT_eX]

[DOI]

Proceedings of the 18th International Congress of Phonetic Sciences, 2015

Librispeech: An ASR corpus based on public domain audio books.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop.

[BibT_eX]

[DOI]

Sri Harish Reddy Mallidi

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

JHU ASpIRE system: Robust LVCSR with TDNNS, iVector adaptation and RNN-LMS.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Combining local and broad topic context to improve term detection.

[BibT_eX]

[DOI]

Jonathan Wintrode

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

A keyword search system using open source software.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Translations of the Callhome Egyptian Arabic corpus for conversational speech translation.

[BibT_eX]

[DOI]

Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Low-resource open vocabulary keyword search using point process models.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Improving deep neural network acoustic models using generalized maxout networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Limited resource term detection for effective topic identification of speech.

[BibT_eX]

[DOI]

Jonathan Wintrode

Proceedings of the IEEE International Conference on Acoustics, 2014

Some insights from translating conversational telephone speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

A pitch extraction algorithm tuned for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection.

[BibT_eX]

[DOI]

Jonathan Wintrode

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Online Learning in Tensor Space.

[BibT_eX]

[DOI]

Yuan Cao

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013

Estimating Confusions in the ASR Channel for Improved Topic-based Language Model Adaptation

[BibT_eX]

[DOI]

CoRR, 2013

String Motif-Based Description of Tool Motion for Detecting Skill and Gestures in Robotic Surgery.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2013, 2013

Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Quantifying the value of pronunciation lexicons for keyword search in lowresource languages.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Using proxies for OOV keywords in the keyword search task.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Revisiting the Case for Explicit Syntactic Information in Language Models.

[BibT_eX]

[DOI]

Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, 2012

Hallucinating system outputs for discriminative language modeling.

[BibT_eX]

[DOI]

Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Sparse Hidden Markov Models for Surgical Gesture Classification and Skill Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Information Processing in Computer-Assisted Interventions, 2012

Phrasal Cohort Based Unsupervised Discriminative Language Modeling.

[BibT_eX]

[DOI]

Brian Roark

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Efficient Structured Language Modeling for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Semi-Supervised Methods for Improving Keyword Search of Unseen Terms.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Deriving conversation-based features from unlabeled speech for discriminative language modeling.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Continuous space discriminative language modeling.

[BibT_eX]

[DOI]

Emily Tucker Prud'hommeaux

Maider Lehr

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Hallucinated n-best lists for discriminative language modeling.

[BibT_eX]

[DOI]

Kenji Sagae

Maider Lehr

Emily Tucker Prud'hommeaux

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Semi-supervised discriminative language modeling for Turkish ASR.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011

Stepwise Optimal Subspace Pursuit for Improving Sparse Recovery.

[BibT_eX]

[DOI]

Trac D. Tran

IEEE Signal Process. Lett., 2011

Unsupervised Arabic Dialect Adaptation with Self-Training.

[BibT_eX]

[DOI]

Scott Novotney

Richard M. Schwartz

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Dirichlet Mixture Models of neural net posteriors for HMM-based speech recognition.

[BibT_eX]

[DOI]

Garimella S. V. S. Sivaram

Proceedings of the IEEE International Conference on Acoustics, 2011

Learning and inference algorithms for partially observed structured switching vector autoregressive models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Hill climbing on speech lattices: A new rescoring framework.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Extensions of recurrent neural network language model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Variational approximation of long-span language models for lvcsr.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Efficient Subsampling for Training Complex Language Models.

[BibT_eX]

[DOI]

Asela Gunawardana

Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Minimum Imputed-Risk: Unsupervised Discriminative Training for Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Randomized maximum entropy language models.

[BibT_eX]

[DOI]

Asela Gunawardana

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Adapting n-gram maximum entropy language models with conditional entropy regularization.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Efficient discriminative training of long-span language models.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Estimating document frequencies in a speech corpus.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Likelihood-Based Semi-Supervised Model Selection With Applications to Speech Processing.

[BibT_eX]

[DOI]

Christopher M. White

Patrick J. Wolfe

IEEE J. Sel. Top. Signal Process., 2010

Joshua 2.0: A Toolkit for Parsing-Based Machine Translation with Syntax, Semirings, Discriminative Training and Other Goodies.

[BibT_eX]

[DOI]

Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

A Comparative Study of Word Co-occurrence for Term Clustering in Language Model-based Sentence Retrieval.

[BibT_eX]

[DOI]

Saeedeh Momtazi

Dietrich Klakow

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Recurrent neural network based language model.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Hypothesis ranking and two-pass approaches for machine translation system combination.

[BibT_eX]

[DOI]

Jason Smith

Proceedings of the IEEE International Conference on Acoustics, 2010

Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets.

[BibT_eX]

[DOI]

Proceedings of the COLING 2010, 2010

2009

Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education].

[BibT_eX]

[DOI]

Douglas D. O'Shaughnessy

IEEE Signal Process. Mag., 2009

Developments and directions in speech recognition and understanding, Part 1 [DSP Education].

[BibT_eX]

[DOI]

Douglas D. O'Shaughnessy

IEEE Signal Process. Mag., 2009

Decoding in JoshuaOpen Source, Parsing-Based Machine Translation.

[BibT_eX]

[DOI]

Prague Bull. Math. Linguistics, 2009

Joshua: An Open Source Toolkit for Parsing-Based Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

Web derived pronunciations for spoken term detection.

[BibT_eX]

[DOI]

Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Efficient Extraction of Oracle-best Translations from Hypergraphs.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Data-Derived Models for Segmentation with Application to Surgical Assessment and Training.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2009

Unsupervised estimation of the language model scaling factor.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Impact of novel sources on content-based image and video retrieval.

[BibT_eX]

[DOI]

Arnab Ghoshal

Dietrich Klakow

Proceedings of the IEEE International Conference on Acoustics, 2009

WEB-derived pronunciations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Self-supervised discriminative training of statistical language models.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Variational Decoding for Statistical Machine Translation.

[BibT_eX]

[DOI]

Jason Eisner

Proceedings of the ACL 2009, 2009

Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the ACL 2009, 2009

2008

A Scalable Decoder for Parsing-Based Machine Translation with Equivalent Language Model State Maintenance.

[BibT_eX]

[DOI]

Proceedings of the Second Workshop on Syntax and Structure in Statistical Translation, 2008

Sequential system combination for machine translation of speech.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Automatic Recognition of Surgical Motions Using Statistical Modeling for Capturing Variability.

[BibT_eX]

Carol E. Reiley

Henry C. Lin

Proceedings of the Medicine Meets Virtual Reality 16, 2008

Computation of Csiszár's mutual Information of order α.

[BibT_eX]

[DOI]

Carey E. Priebe

Proceedings of the 2008 IEEE International Symposium on Information Theory, 2008

An investigation of acoustic models for multilingual code-switching.

[BibT_eX]

[DOI]

Christopher M. White

James K. Baker

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Automatically learning speaker-independent acoustic subword units.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Sample selection for automatic language identification.

[BibT_eX]

[DOI]

David Farris

Christopher M. White

Proceedings of the IEEE International Conference on Acoustics, 2008

Combination of strongly and weakly constrained recognizers for reliable detection of OOVS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Large-scale Discriminative n-gram Language Models for Statistical Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers, 2008

Unsupervised Learning of Acoustic Sub-word Units.

[BibT_eX]

[DOI]

Emmanuel Dupoux

Proceedings of the ACL 2008, 2008

Machine Translation System Combination using ITG-based Alignments.

[BibT_eX]

[DOI]

Proceedings of the ACL 2008, 2008

2007

Comparing Reordering Constraints for SMT Using Efficient BLEU Oracle Computation.

[BibT_eX]

[DOI]

Markus Dreyer

Keith B. Hall

Proceedings of the NAACL-HLT 2007 / AMTA Workshop on Syntax and Structure in Statistical Translation, 2007

Cross-Instance Tuning of Unsupervised Document Clustering Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Error Bounds and Improved Probability Estimation using the Maximum Likelihood Set.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Information Theory, 2007

Large-scale random forest language models for speech recognition.

[BibT_eX]

[DOI]

Yi Su

Frederick Jelinek

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Iterative Denoising using Jensen-Renyi Divergences with an Application to Unsupervised Document Categorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Imperial College and Johns Hopkins University at TRECVID.

[BibT_eX]

[DOI]

Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Language Modeling with the Maximum Likelihood Set: Complexity Issues and the Back-off Formula.

[BibT_eX]

[DOI]

Proceedings of the Proceedings 2006 IEEE International Symposium on Information Theory, 2006

Source Adaptation for Improved Content-Based Video Retrieval.

[BibT_eX]

[DOI]

Arnab Ghoshal

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Generative Content Models for Structural Analysis of Medical Abstracts.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Linking Natural Language and Biology, 2006

2005

Maximum Likelihood Set for Estimating a Probability Mass Function.

[BibT_eX]

[DOI]

Bruno Jedynak

Neural Comput., 2005

TRECVID 2005 Experiment at Johns Hopkins University: Using Hidden Markov Models for Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

Hidden Markov models for automatic annotation and content-based retrieval of images and video.

[BibT_eX]

[DOI]

Arnab Ghoshal

Pavel Ircing

Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Joint visual-text modeling for automatic retrieval of multimedia documents.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM International Conference on Multimedia, 2005

Unsupervised classification via decision trees: an information-theoretic perspective.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Lexical triggers and latent semantic analysis for cross-lingual language model adaptation.

[BibT_eX]

[DOI]

ACM Trans. Asian Lang. Inf. Process., 2004

Pronunciation change in conversational speech and its implications for automatic speech recognition.

[BibT_eX]

[DOI]

Murat Saraclar

Comput. Speech Lang., 2004

Mandarin-English Information (MEI): investigating translingual speech retrieval.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2004

Contemporaneous text as side-information in statistical language modeling.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2004

Improving Passage Retrieval Using Interactive Elicition and Statistical Modeling.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth Text REtrieval Conference, 2004

A Smorgasbord of Features for Statistical Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

Cross-lingual latent semantic analysis for language modeling.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Making MIRACLEs: Interactive translingual search for Cebuano and Hindi.

[BibT_eX]

[DOI]

ACM Trans. Asian Lang. Inf. Process., 2003

Transliteration of proper names in cross-language applications.

[BibT_eX]

[DOI]

Paola Virga

Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

Desparately Seeking Cebuano.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Latent Semantic Information in Maximum Entropy Language Models for Conversational Speech Recognition.

[BibT_eX]

[DOI]

Yonggang Deng

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Language model adaptation using cross-lingual information.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Cross-Lingual Lexical Triggers in Statistical Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2003

Transliteration of Proper Names in Cross-Lingual Information Retrieval.

[BibT_eX]

[DOI]

Paola Virga

Proceedings of the Workshop on Multilingual and Mixed-language Named Entity Recognition, 2003

2002

Order estimation for a special class of hidden Markov sources and binary renewal processes.

[BibT_eX]

[DOI]

Prakash Narayan

IEEE Trans. Inf. Theory, 2002

Using cross-language cues for story-specific language modeling.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Building a topic-dependent maximum entropy model for very large corpora.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Mandarin-English Information: Investigating Translingual Speech Retrieval.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Human Language Technology Research, 2001

Robust Knowledge Discovery from Parallel Speech and Text Sources.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Human Language Technology Research, 2001

Smoothing issues in the structured language model.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

On large vocabulary continuous speech recognition of highly inflectional language - czech.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

Maximum entropy techniques for exploiting syntactic, semantic and collocational dependencies in language modeling.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2000

Efficient training methods for maximum entropy language modeling.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Syntactic heads in statistical language modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

Pronunciation ambiguity vs. pronunciation variability in speech recognition.

[BibT_eX]

[DOI]

Murat Saraçlar

Proceedings of the IEEE International Conference on Acoustics, 2000

Towards language independent acoustic modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Stochastic pronunciation modelling from hand-labelled phonetic corpora.

[BibT_eX]

[DOI]

Speech Commun., 1999

Large Vocabulary Speech Recognition for Read and Broadcast Czech.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999

Combining nonlocal, syntactic and n-gram dependencies in language modeling.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Pronunciation modeling by sharing gaussian densities across phonetic models.

[BibT_eX]

[DOI]

Murat Saraclar

Harriet J. Nock

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A maximum entropy language model integrating N-grams and topic dependencies for conversational speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition.

[BibT_eX]

[DOI]

Ashvin Kannan

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Rapid speech recognizer adaptation to new speakers.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

LVCSR rescoring with modified loss functions: a decision theoretic perspective.

[BibT_eX]

[DOI]

Vaibhava Goel

William Byrne