Bhuvana Ramabhadran

Orcid: 0000-0002-8049-2345

According to our database1, Bhuvana Ramabhadran authored at least 232 papers between 1998 and 2024.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2017, "For contributions to speech recognition and language processing".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data.
CoRR, 2024

2023
Twenty-Five Years of Evolution in Speech and Language Processing.
IEEE Signal Process. Mag., July, 2023

O-1: Self-training with Oracle and 1-best Hypothesis.
CoRR, 2023

Using Text Injection to Improve Recognition of Personal Identifiers in Speech.
CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.
CoRR, 2023

Robust Knowledge Distillation from RNN-T Models with Noisy Training Labels Using Full-Sum Loss.
Proceedings of the IEEE International Conference on Acoustics, 2023

Understanding Shared Speech-Text Representations.
Proceedings of the IEEE International Conference on Acoustics, 2023

Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Large-Scale Language Model Rescoring on Long-Form Data.
Proceedings of the IEEE International Conference on Acoustics, 2023

Modular Conformer Training for Flexible End-to-End ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

Mask-Conformer: Augmenting Conformer with Mask-Predict Decoder.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition.
IEEE J. Sel. Top. Signal Process., 2022

Ask2Mask: Guided Data Selection for Masked Speech Modeling.
IEEE J. Sel. Top. Signal Process., 2022

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data.
CoRR, 2022

G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Modular Hybrid Autoregressive Transducer.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Maestro-U: Leveraging Joint Speech-Text Representation Learning for Zero Supervised Speech ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Non-Parallel Voice Conversion for ASR Augmentation.
Proceedings of the Interspeech 2022, 2022

Improving Rare Word Recognition with LM-aware MWER Training.
Proceedings of the Interspeech 2022, 2022

On Adaptive Weight Interpolation of the Hybrid Autoregressive Transducer.
Proceedings of the Interspeech 2022, 2022

MAESTRO: Matched Speech Text Representations through Modality Matching.
Proceedings of the Interspeech 2022, 2022

Reducing Domain mismatch in Self-supervised speech pre-training.
Proceedings of the Interspeech 2022, 2022

Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Multilingual Second-Pass Rescoring for Automatic Speech Recognition Systems.
Proceedings of the IEEE International Conference on Acoustics, 2022

Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Regularizing Word Segmentation by Creating Misspellings.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Self-Adaptive Distillation for Multilingual Speech Recognition: Leveraging Student Independence.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Conformer Parrotron: A Faster and Stronger End-to-End Speech Conversion and Recognition Model for Atypical Speech.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Mixture Model Attention: Flexible Streaming and Non-Streaming Automatic Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Convolutional Dropout and Wordpiece Augmentation for End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Mixture of Informed Experts for Multilingual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Extending Parrotron: An End-to-End, Speech Conversion and Speech Recognition Model for Atypical Speech.
Proceedings of the IEEE International Conference on Acoustics, 2021

Injecting Text in Self-Supervised Speech Pretraining.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
LSTM Acoustic Models Learn to Align and Pronounce with Graphemes.
CoRR, 2020

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior.
CoRR, 2020

Multilingual Speech Recognition with Self-Attention Structured Parameterization.
Proceedings of the Interspeech 2020, 2020

SCADA: Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR.
Proceedings of the Interspeech 2020, 2020

Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection.
Proceedings of the Interspeech 2020, 2020

Improving Speech Recognition Using Consistent Predictions on Synthesized Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Neural Oracle Search on N-BEST Hypotheses.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Generating Diverse and Natural Text-to-Speech Samples Using a Quantized Fine-Grained VAE and Autoregressive Prosody Prior.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Language-Agnostic Multilingual Modeling.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning.
Proceedings of the Interspeech 2019, 2019

Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model.
Proceedings of the Interspeech 2019, 2019

Comparison of Data Augmentation and Adaptation Strategies for Code-switched Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Speech Recognition with Augmented Synthesized Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Transliteration Based Approaches to Improve Code-Switched Speech Recognition Performance.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Open Problems in Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Data Augmentation Improves Recognition of Foreign Accented Speech.
Proceedings of the Interspeech 2018, 2018

Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Measuring the Effect of Linguistic Resources on Prosody Modeling for Speech Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Whole Sentence Neural Language Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Building Competitive Direct Acoustics-to-Word Models for English Conversational Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Parallel Deep Neural Network Training for Big Data on Blue Gene/Q.
IEEE Trans. Parallel Distributed Syst., 2017

Introduction to the Special Issue on End-to-End Speech and Language Processing.
IEEE J. Sel. Top. Signal Process., 2017

End-to-End ASR-Free Keyword Search From Speech.
IEEE J. Sel. Top. Signal Process., 2017

Recent progress in deep end-to-end models for spoken language processing.
IBM J. Res. Dev., 2017

Symbol Sequence Search from Telephone Conversation.
Proceedings of the Interspeech 2017, 2017

English Conversational Telephone Speech Recognition by Humans and Machines.
Proceedings of the Interspeech 2017, 2017

Bias and Statistical Significance in Evaluating Speech Synthesis with Mean Opinion Scores.
Proceedings of the Interspeech 2017, 2017

Weakly-Supervised Phrase Assignment from Text in a Speech-Synthesis System Using Noisy Labels.
Proceedings of the Interspeech 2017, 2017

Empirical Exploration of Novel Architectures and Objectives for Language Models.
Proceedings of the Interspeech 2017, 2017

Fast Neural Network Language Model Lookups at N-Gram Speeds.
Proceedings of the Interspeech 2017, 2017

Efficient Knowledge Distillation from an Ensemble of Teachers.
Proceedings of the Interspeech 2017, 2017

Direct Acoustics-to-Word Models for English Conversational Speech Recognition.
Proceedings of the Interspeech 2017, 2017

Network architectures for multilingual speech representation learning.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

End-to-end speech recognition and keyword search on low-resource languages.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Harmonic feature fusion for robust neural network-based acoustic modeling.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Effective joint training of denoising feature space transforms and Neural Network based acoustic models.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Voice-transformation-based data augmentation for prosodic classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Knowledge distillation across ensembles of multilingual models for low-resource languages.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Training variance and performance evaluation of neural networks in speech.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Language modeling with highway LSTM.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Invariant Representations for Noisy Speech Recognition.
CoRR, 2016

Multilingual Data Selection for Low Resource Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Domain Adaptation of CNN Based Acoustic Models Under Limited Resource Settings.
Proceedings of the Interspeech 2016, 2016

Acoustic Modeling Using Bidirectional Gated Recurrent Convolutional Units.
Proceedings of the Interspeech 2016, 2016

Using continuous lexical embeddings to improve symbolic-prosody prediction in a text-to-speech front-end.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Efficient one-vs-one kernel ridge regression for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Semantic word embedding neural network language models for automatic speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Deep Convolutional Neural Networks for Large-scale Speech Tasks.
Neural Networks, 2015

Diverse Embedding Neural Network Language Models.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Modeling phrasing and prominence using deep recurrent learning.
Proceedings of the INTERSPEECH 2015, 2015

Using deep bidirectional recurrent neural networks for prosodic-target prediction in a unit-selection text-to-speech system.
Proceedings of the INTERSPEECH 2015, 2015

A multi-region deep neural network model in speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

Efficient GPU implementation of convolutional neural networks for speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

Unnormalized exponential and neural network language models.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Bidirectional recurrent neural network language models for automatic speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Multilingual representations for low resource speech recognition and keyword search.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Converting Neural Network Language Models into Back-off Language Models for Efficient Decoding in Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Editorial for the special issue on spoken content retrieval.
Comput. Speech Lang., 2014

Deep scattering spectra with deep neural networks for LVCSR tasks.
Proceedings of the INTERSPEECH 2014, 2014

Parallel deep neural network training for LVCSR tasks using blue gene/Q.
Proceedings of the INTERSPEECH 2014, 2014

Prosody contour prediction with long short-term memory, bi-directional, deep recurrent neural networks.
Proceedings of the INTERSPEECH 2014, 2014

Exploiting vocal-source features to improve ASR accuracy for low-resource languages.
Proceedings of the INTERSPEECH 2014, 2014

Recent improvements in neural network acoustic modeling for LVCSR in low resource languages.
Proceedings of the INTERSPEECH 2014, 2014

Improving deep neural network acoustic modeling for audio corpus indexing under the IARPA babel program.
Proceedings of the INTERSPEECH 2014, 2014

Dictionary-based pitch tracking with dynamic programming.
Proceedings of the INTERSPEECH 2014, 2014

Static interpolation of exponential n-gram models using features of features.
Proceedings of the IEEE International Conference on Acoustics, 2014

Improvements to filterbank and delta learning within a deep neural network framework.
Proceedings of the IEEE International Conference on Acoustics, 2014

Deep Scattering Spectrum with deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Kernel methods match Deep Neural Networks on TIMIT.
Proceedings of the IEEE International Conference on Acoustics, 2014

Automatic keyword selection for keyword search development and tuning.
Proceedings of the IEEE International Conference on Acoustics, 2014

Semi-supervised term-weighted value rescoring for keyword search.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks.
IEEE Trans. Speech Audio Process., 2013

Improving training time of Hessian-free optimization for deep neural networks using preconditioning and sampling.
CoRR, 2013

Generalized Ambiguity Decomposition for Understanding Ensemble Diversity.
CoRR, 2013

Deep convolutional neural networks for LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2013

Low-rank matrix factorization for Deep Neural Network training with high-dimensional output targets.
Proceedings of the IEEE International Conference on Acoustics, 2013

An evaluation of posterior modeling techniques for phonetic recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

System combination and score normalization for spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2013

A high-performance Cantonese keyword search system.
Proceedings of the IEEE International Conference on Acoustics, 2013

F0 contour prediction with a deep belief network-Gaussian process hybrid model.
Proceedings of the IEEE International Conference on Acoustics, 2013

Developing speech recognition systems for corpus indexing under the IARPA Babel program.
Proceedings of the IEEE International Conference on Acoustics, 2013

Joint training of interpolated exponential n-gram models.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

An empirical study of confusion modeling in keyword search for low resource languages.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Learning filter banks within a deep neural network framework.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Improvements to Deep Convolutional Neural Networks for LVCSR.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Accelerating Hessian-free optimization for Deep Neural Networks by implicit preconditioning and sampling.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Exemplar-Based Processing for Speech Recognition: An Overview.
IEEE Signal Process. Mag., 2012

Trends in Speech and Language Processing [In the Spotlight].
IEEE Signal Process. Mag., 2012

Acoustically discriminative language model training with pseudo-hypothesis.
Speech Commun., 2012

Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech.
Speech Commun., 2012

Deep Neural Network Language Models.
Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, 2012

Enhancing Exemplar-Based Posteriors for Speech Recognition Tasks.
Proceedings of the INTERSPEECH 2012, 2012

Phrase Boundary Assignment from Text in Multiple Domains.
Proceedings of the INTERSPEECH 2012, 2012

Auto-encoder bottleneck features using deep belief networks.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Improved pre-training of Deep Belief Networks using Sparse Encoding Symmetric Machines.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

N-best entropy based data selection for acoustic modeling.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Constructing ensembles of dissimilar acoustic models using hidden attributes of training data.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Prediction of F0 contours from symbolic and numerical variables using continuous conditional random fields.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Creating ensemble of diverse maximum entropy models.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

Trends and advances in speech recognition.
IBM J. Res. Dev., 2011

Performance prediction and shrinking language models.
Proceedings of the 2011 Symposium on Machine Learning in Speech and Language Processing, 2011

Shrinkage-Based Features for Natural Language Call Routing.
Proceedings of the INTERSPEECH 2011, 2011

Reducing Computational Complexities of Exemplar-Based Sparse Representations with Applications to Large Vocabulary Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

"What is... Dengue Fever?" - Modeling and Predicting Pronunciation Errors in a Text-to-Speech System.
Proceedings of the INTERSPEECH 2011, 2011

Clustering with Modified Cosine Distance Learned from Constraints.
Proceedings of the INTERSPEECH 2011, 2011

Improved Spoken Query Transcription Using Co-Occurrence Information.
Proceedings of the INTERSPEECH 2011, 2011

Convergence of Line Search A-Function Methods.
Proceedings of the INTERSPEECH 2011, 2011

Feature Combination Approaches for Discriminative Language Models.
Proceedings of the INTERSPEECH 2011, 2011

Application specific loss minimization using gradient boosting.
Proceedings of the IEEE International Conference on Acoustics, 2011

Speech processing and retrieval in a personal memory aid system for the elderly.
Proceedings of the IEEE International Conference on Acoustics, 2011

Distributed training of large scale exponential language models.
Proceedings of the IEEE International Conference on Acoustics, 2011

Deep belief nets for natural language call-routing.
Proceedings of the IEEE International Conference on Acoustics, 2011

Exemplar-based Sparse Representation phone identification features.
Proceedings of the IEEE International Conference on Acoustics, 2011

Hill climbing on speech lattices: A new rescoring framework.
Proceedings of the IEEE International Conference on Acoustics, 2011

Deep Belief Networks using discriminative features for phone recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

A-Functions: A generalization of Extended Baum-Welch transformations to convex optimization.
Proceedings of the IEEE International Conference on Acoustics, 2011

Exploiting active-learning strategies for annotating prosodic events with limited labeled data.
Proceedings of the IEEE International Conference on Acoustics, 2011

Frame-level AnyBoost for LVCSR with the MMI Criterion.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

A convex hull approach to sparse representations for exemplar-based speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Making Deep Belief Networks effective for large vocabulary continuous speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Pruning exponential language models.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Unsupervised Model Adaptation using Information-Theoretic Criterion.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Data selection for language modeling using sparse representations.
Proceedings of the INTERSPEECH 2010, 2010

Impact of word classing on shrinkage-based language models.
Proceedings of the INTERSPEECH 2010, 2010

Sparse representation features for speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Sparse representations for text categorization.
Proceedings of the INTERSPEECH 2010, 2010

An analysis of sparseness and regularization in exemplar-based methods for speech classification.
Proceedings of the INTERSPEECH 2010, 2010

Incorporating sparse representation phone identification features in automatic speech recognition using exponential families.
Proceedings of the INTERSPEECH 2010, 2010

Discriminative training and unsupervised adaptation for labeling prosodic events with limited training data.
Proceedings of the INTERSPEECH 2010, 2010

Techniques for topic detection based processing in spoken dialog systems.
Proceedings of the INTERSPEECH 2010, 2010

An autoencoder neural-network based low-dimensionality approach to excitation modeling for HMM-based text-to-speech.
Proceedings of the IEEE International Conference on Acoustics, 2010

Continuous space language modeling techniques.
Proceedings of the IEEE International Conference on Acoustics, 2010

Bayesian compressive sensing for phonetic classification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Balancing false alarms and hits in Spoken Term Detection.
Proceedings of the IEEE International Conference on Acoustics, 2010

Improved language modeling for conversational applications using sentence quality.
Proceedings of the IEEE International Conference on Acoustics, 2010

The Use of isometric transformations and bayesian estimation in compressive sensing for fMRI classification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Kalman filtering for compressed sensing.
Proceedings of the 13th Conference on Information Fusion, 2010

2009
An Iterative Relative Entropy Minimization-Based Data Selection Approach for n-Gram Model Adaptation.
IEEE Trans. Speech Audio Process., 2009

Web derived pronunciations for spoken term detection.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Fast decoding for open vocabulary spoken term detection.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Cultural voice markers in speech-to-speech machine translation systems.
Proceedings of the 2009 international workshop on Intercultural collaboration, 2009

Multimodal Classification of Activities of Daily Living Inside Smart Homes.
Proceedings of the Distributed Computing, 2009

Iterative sentence-pair extraction from quasi-parallel corpora for machine translation.
Proceedings of the INTERSPEECH 2009, 2009

Towards using hybrid word and fragment units for vocabulary independent LVCSR systems.
Proceedings of the INTERSPEECH 2009, 2009

Unsupervised pronunciation validation.
Proceedings of the IEEE International Conference on Acoustics, 2009

A new method for OOV detection using hybrid word/fragment system.
Proceedings of the IEEE International Conference on Acoustics, 2009

A generalized family of parameter estimation techniques.
Proceedings of the IEEE International Conference on Acoustics, 2009

Effect of pronounciations on OOV queries in spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2009

Map approach to learning sparse Gaussian Markov networks.
Proceedings of the IEEE International Conference on Acoustics, 2009

An exploration of large vocabulary tools for small vocabulary phonetic recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Constrained discriminative training of N-gram language models.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Query-by-example Spoken Term Detection For OOV terms.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Scaling shrinkage-based language models.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Bag-of-word normalized n-gram models.
Proceedings of the INTERSPEECH 2008, 2008

Phonetic query expansion for spoken document retrieval.
Proceedings of the INTERSPEECH 2008, 2008

Generalization of extended baum-welch parameter estimation for discriminative training and decoding.
Proceedings of the INTERSPEECH 2008, 2008

A study of unsupervised clustering techniques for language modeling.
Proceedings of the INTERSPEECH 2008, 2008

Gradient steepness metrics using extended Baum-Welch transformations for universal pattern recognition tasks.
Proceedings of the IEEE International Conference on Acoustics, 2008

Boosted MMI for model and feature-space discriminative training.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Automatic exploration of corpus-specific properties for expressive text-to-speech: a case study in emphasis.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Vocabulary independent spoken term detection.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Data Driven Approach for Language Model Adaptation using Stepwise Relative Entropy Minimization.
Proceedings of the IEEE International Conference on Acoustics, 2007

Broad phonetic class recognition in a Hidden Markov model framework using extended Baum-Welch transformations.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

The IBM 2007 speech transcription system for European parliamentary speeches.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Fast audio search using vector space modelling.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
On the Effect Ofword Error Rate on Automated Quality Monitoring.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Automated Quality Monitoring for Call Centers using Speech and NLP Technologies.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

The IBM 2006 speech transcription system for european parliamentary speeches.
Proceedings of the INTERSPEECH 2006, 2006

Automated Quality Monitoring in the Call Center with ASR and Maximum Entropy.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Exploiting large quantities of spontaneous speech for unsupervised training of acoustic models.
Proceedings of the INTERSPEECH 2005, 2005

Contructing Ensembles of ASR Systems Using Randomized Decision Trees.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Automatic recognition of spontaneous speech for access to multilingual oral history archives.
IEEE Trans. Speech Audio Process., 2004

Building an information retrieval test collection for spontaneous conversational speech.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

Speech recognition error analysis on the English MALACH corpus.
Proceedings of the INTERSPEECH 2004, 2004

Measuring convergence in language model estimation using relative entropy.
Proceedings of the INTERSPEECH 2004, 2004

Use of metadata to improve recognition of spontaneous speech and named entities.
Proceedings of the INTERSPEECH 2004, 2004

2003
Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automated transcription and topic segmentation of large spoken archives.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Towards automatic transcription of large spoken archives - English ASR for the MALACH project.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Automatic Transcription of Czech Language Oral History in the MALACH Project: Resources and Initial Experiments.
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

Cross-Language Access to Recorded Speech in the MALACH Project.
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

Supporting access to large digital oral history archives.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2002

Access to large spoken archives: Uses and technology. Sponsored by SIG VIS.
Proceedings of the Information, Connetcitons and Community, 2002

2001
Current status of the IBM Trainable Speech Synthesis System.
Proceedings of the 4th ITRW on Speech Synthesis, Perthshire, Scotland, UK, August 29, 2001

Innovative approaches for large vocabulary name recognition.
Proceedings of the IEEE International Conference on Acoustics, 2001


2000
Dynamic selection of feature spaces for robust speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Decision tree based rate of speech modeling for speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Enhanced likelihood computation using regression.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Acoustics-based baseform generation with pronunciation and/or phonotactic models.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Phonological rules for enhancing acoustic enrollment of unknown words.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Speech recognition performance on a new voicemail transcription task.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Factor analysis invariant to linear transformations of data.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Acoustics-only based automatic phonetic baseform generation.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Speech recognition performance on a voicemail transcription task.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998


  Loading...