Bhiksha Raj

According to our database1, Bhiksha Raj authored at least 210 papers between 1995 and 2019.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2017, "For contributions to speech recognition".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Sound Event Detection in the DCASE 2017 Challenge.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

Learning Sound Events from Webly Labeled Data.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Disjoint Mapping Network for Cross-modal Matching of Voices and Faces.
Proceedings of the 7th International Conference on Learning Representations, 2019

Human Behaviour Recognition Using Wifi Channel State Information.
Proceedings of the IEEE International Conference on Acoustics, 2019

Time Signal Classification Using Random Convolutional Features.
Proceedings of the IEEE International Conference on Acoustics, 2019

Cross Modal Audio Search and Retrieval with Joint Embeddings Based on Text and Audio.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
AudioPairBank: towards a large-scale tag-pair-based audio content analysis.
EURASIP J. Audio, Speech and Music Processing, 2018

Speech Analytics for Medical Applications.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

Querying Depression Vlogs.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Analysing Speech for Clinical Applications.
Proceedings of the Statistical Language and Speech Processing, 2018

Classifier Risk Estimation Under Limited Labeling Resources.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2018

Mining Multimodal Repositories for Speech Affecting Diseases.
Proceedings of the Interspeech 2018, 2018

Interactive Evaluation of Classifiers Under Limited Resources.
Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018

A Corrective Learning Approach for Text-Independent Speaker Verification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Content-Based Representations of Audio Using Siamese Neural Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Acoustic Scene Classification Using Discrete Random Hashing for Laplacian Kernel Machines.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Voice Impersonation Using Generative Adversarial Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Framework for Evaluation of Sound Event Detection in Web Videos.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Audition for multimedia computing.
Proceedings of the Frontiers of Multimedia Research, 2018

2017
A two factor transformation for speaker verification through ℓ1 comparison.
Proceedings of the 2017 IEEE Workshop on Information Forensics and Security, 2017

Inferring room semantics using acoustic monitoring.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery.
Proceedings of the Interspeech 2017, 2017

Audio Content Based Geotagging in Multimedia.
Proceedings of the Interspeech 2017, 2017

Audio event and scene recognition: A unified approach using strongly and weakly labeled data.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Supervised monaural source separation based on autoencoders.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Privacy preserving Distance computation using somewhat-trusted third parties.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Discovering sound concepts and acoustic relations in text.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An approach for self-training audio event detectors using web data.
Proceedings of the 25th European Signal Processing Conference, 2017

SphereFace: Deep Hypersphere Embedding for Face Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Topic and Prosodic Modeling for Interruption Management in Multi-User Multitasking Communication Interactions.
Proceedings of the 2017 AAAI Fall Symposia, Arlington, Virginia, USA, November 9-11, 2017, 2017

The REVERB Challenge: A Benchmark Task for Reverberation-Robust ASR Techniques.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization.
IEEE Trans. Signal Processing, 2016

Learning Model-Based Sparsity via Projected Gradient Descent.
IEEE Trans. Information Theory, 2016

A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research.
EURASIP J. Adv. Sig. Proc., 2016

Adaptation of SVM for MIL for inferring the polarity of movies and movie reviews.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Audio Event Detection using Weakly Labeled Data.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Forensic anthropometry from voice: An articulatory-phonetic approach.
Proceedings of the 39th International Convention on Information and Communication Technology, 2016

Short-term analysis for estimating physical parameters of speakers.
Proceedings of the 4th International Conference on Biometrics and Forensics, 2016

Formant manipulations in voice disguise by mimicry.
Proceedings of the 4th International Conference on Biometrics and Forensics, 2016

On the Appropriateness of Complex-Valued Neural Networks for Speech Enhancement.
Proceedings of the Interspeech 2016, 2016

Viral Spread via Entertainment and Voice-Messaging Among Telephone Users in India.
Proceedings of the Eighth International Conference on Information and Communication Technologies and Development, 2016

Weakly supervised scalable audio content analysis.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

The relationship of voice onset time and Voice Offset Time to physical age.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Crowdsourced Video Subtitling with Adaptation Based on User-Corrected Lattices.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

Detecting Psychological Distress in Adults Through Transcriptions of Clinical Interviews.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

The Best of BothWorlds: Combining Data-Independent and Data-Driven Approaches for Action Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

2015
Compositional Models for Audio Processing: Uncovering the structure of sound mixtures.
IEEE Signal Process. Mag., 2015

Secure Modular Hashing.
Proceedings of the 2015 IEEE International Workshop on Information Forensics and Security, 2015

Complex recurrent neural networks for denoising speech signals.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Rapid development of public health education systems in low-literacy multilingual environments: combating ebola through voice messaging.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015

Locality constrained transitive distance clustering on speech data.
Proceedings of the INTERSPEECH 2015, 2015

Privacy-preserving Query-by-Example Speech Search.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Reducing communication overhead in distributed learning by an order of magnitude (almost).
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A novel ranking method for multiple classifier systems.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Beyond Gaussian Pyramid: Multi-skip Feature Stacking for action recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Efficient autism spectrum disorder prediction with eye movement: A machine learning framework.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Privacy-Preserving Important Passage Retrieval.
Proceedings of the Proceeding of the 1st International Workshop on Privacy-Preserving IR: When Information Retrieval Meets Privacy and Security co-located with 37th Annual International ACM SIGIR conference, 2014

Privacy-preserving speaker verification using secure binary embeddings.
Proceedings of the 37th International Convention on Information and Communication Technology, 2014

Post-masking: a hybrid approach to array processing for speech recognition.
Proceedings of the INTERSPEECH 2014, 2014

Active-set newton algorithm for non-negative sparse coding of audio.
Proceedings of the IEEE International Conference on Acoustics, 2014

Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices.
Proceedings of the IEEE International Conference on Acoustics, 2014

Privacy-preserving speaker verification using garbled GMMS.
Proceedings of the 22nd European Signal Processing Conference, 2014

Detecting sound objects in audio recordings.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio.
IEEE Trans. Audio, Speech & Language Processing, 2013

Privacy-Preserving Speaker Verification and Identification Using Gaussian Mixture Models.
IEEE Trans. Audio, Speech & Language Processing, 2013

Privacy-Preserving Speech Processing: Cryptographic and String-Matching Frameworks Show Promise.
IEEE Signal Process. Mag., 2013

Greedy sparsity-constrained optimization.
J. Mach. Learn. Res., 2013

Measuring prevalence of other-oriented transactive contributions using an automated measure of speech style accommodation.
I. J. Computer-Supported Collaborative Learning, 2013

Swara Histogram Based Structural Analysis And Identification Of Indian Classical Ragas.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

A Comparative Study Of Indian And Western Music Forms.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Secure binary embeddings of front-end factor analysis for privacy preserving speaker verification.
Proceedings of the INTERSPEECH 2013, 2013

Discriminatively trained dependency language modeling for conversational speech recognition.
Proceedings of the INTERSPEECH 2013, 2013

Ensemble approach in speaker verification.
Proceedings of the INTERSPEECH 2013, 2013

Scale independent raga identification using chromagram patterns and swara based features.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Doppler based speed estimation of vehicles using passive sensor.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Speaker tracking with spherical microphone arrays.
Proceedings of the IEEE International Conference on Acoustics, 2013

Optimization of the DET curve in speaker verification under noisy conditions.
Proceedings of the IEEE International Conference on Acoustics, 2013

Unsupervised hierarchical structure induction for deeper semantic analysis of audio.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speaker verification using Secure Binary Embeddings.
Proceedings of the 21st European Signal Processing Conference, 2013

Event detection in short duration audio using Gaussian Mixture Model and Random Forest Classifier.
Proceedings of the 21st European Signal Processing Conference, 2013

A hierarchical system for word discovery exploiting DTW-based initialization.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Unsupervised word segmentation from noisy input.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Large Margin Gaussian Mixture Models with Differential Privacy.
IEEE Trans. Dependable Sec. Comput., 2012

Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-Field Sensors.
IEEE Signal Process. Mag., 2012

Ultrasonic Doppler Sensing in HCI.
IEEE Pervasive Computing, 2012

The Markov selection model for concurrent speech recognition.
Neurocomputing, 2012

Optimization of the DET curve in speaker verification.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Unsupervised Structure Discovery for Semantic Analysis of Audio.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Privacy-Preserving Speaker Authentication.
Proceedings of the Information Security - 15th International Conference, 2012

Language identification using spectro-temporal patch features.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Microphone Array Post-filter based on Spatially-Correlated Noise Measurements for Distant Speech Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Plagiarism Detection in Polyphonic Music using Monaural Signal Separation.
Proceedings of the INTERSPEECH 2012, 2012

Exploiting Temporal Sequence Structure for Semantic Analysis of Multimedia.
Proceedings of the INTERSPEECH 2012, 2012

Structured sparse coding for microphone array location calibration.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Attacking a privacy preserving music matching algorithm.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Privacy-preserving speaker verification as password matching.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Audio event detection from acoustic unit occurrence patterns.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Spectrographic seam patterns for discriminative word spotting.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

An Unsupervised Dynamic Bayesian Network Approach to Measuring Speech Style Accommodation.
Proceedings of the EACL 2012, 2012

Microphone array processing for distant speech recognition: Spherical arrays.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Microphone array processing for distant speech recognition: Towards real-world deployment.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Introduction.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

The Basics of Automatic Speech Recognition.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

The Problem of Robustness in Automatic Speech Recognition.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011
Missing Data Imputation for Time-Frequency Representations of Audio Signals.
Signal Processing Systems, 2011

Efficient Protocols for Principal Eigenvector Computation over Private Data.
Trans. Data Privacy, 2011

Preface.
Speech Communication, 2011

On the combination of voice prompt suppression with maximum kurtosis beamforming.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Block-wise incremental adaptation algorithm for maximum kurtosis beamforming.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

Learning contextual relevance of audio segments using discriminative models over AUD sequences.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2011

A Comparison of Latent Variable Models For Conversation Analysis.
Proceedings of the SIGDIAL 2011 Conference, 2011

Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures.
Proceedings of the INTERSPEECH 2011, 2011

Privacy Preserving Speaker Verification Using Adapted GMMs.
Proceedings of the INTERSPEECH 2011, 2011

A Paradigm for Limited Vocabulary Speech Recognition Based on Redundant Spectro-Temporal Feature Sets.
Proceedings of the INTERSPEECH 2011, 2011

Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification.
Proceedings of the INTERSPEECH 2011, 2011

A paired test for recognizer selection with untranscribed data.
Proceedings of the IEEE International Conference on Acoustics, 2011

Privacy preserving probabilistic inference with Hidden Markov Models.
Proceedings of the IEEE International Conference on Acoustics, 2011

Gammatone sub-band magnitude-domain dereverberation for ASR.
Proceedings of the IEEE International Conference on Acoustics, 2011

An iterative least-squares technique for dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2011

On the implementation of a secure musical database matching.
Proceedings of the 19th European Signal Processing Conference, 2011

Maximum kurtosis beamforming with a subspace filter for distant speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

An information filter for voice prompt suppression.
Proceedings of the Conference Record of the Forty Fifth Asilomar Conference on Signals, 2011

Greedy sparsity-constrained optimization.
Proceedings of the Conference Record of the Forty Fifth Asilomar Conference on Signals, 2011

Reconstructing Noise-Corrupted Spectrographic Components for Robust Speech Recognition.
Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011

2010
Scalable Audio-Content Analysis.
EURASIP J. Audio, Speech and Music Processing, 2010

Privacy Preserving Protocols for Eigenvector Computation.
Proceedings of the Privacy and Security Issues in Data Mining and Machine Learning, 2010

Large Margin Multiclass Gaussian Classification with Differential Privacy.
Proceedings of the Privacy and Security Issues in Data Mining and Machine Learning, 2010

Multiparty Differential Privacy via Aggregation of Locally Trained Classifiers.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

The use of sense in unsupervised training of acoustic models for ASR systems.
Proceedings of the INTERSPEECH 2010, 2010

Ungrounded independent non-negative factor analysis.
Proceedings of the INTERSPEECH 2010, 2010

Non-negative matrix factorization based compensation of music for automatic speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Creating a linguistic plausibility dataset with non-expert annotators.
Proceedings of the INTERSPEECH 2010, 2010

Spectrogram dimensionality reductionwith independence constraints.
Proceedings of the IEEE International Conference on Acoustics, 2010

Synthesizing speech from Doppler signals.
Proceedings of the IEEE International Conference on Acoustics, 2010

Ultrasonic sensing for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Latent-variable decomposition based dereverberation of monaural and multi-channel signals.
Proceedings of the IEEE International Conference on Acoustics, 2010

Learning-based auditory encoding for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

A hybrid physical and statistical dynamic articulatory framework incorporating analysis-by-synthesis for improved phone classification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Non-negative Hidden Markov Modeling of Audio with Application to Source Separation.
Proceedings of the Latent Variable Analysis and Signal Separation, 2010

2009
A Sparse Non-Parametric Approach for Single Channel Separation of Known Sounds.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain.
Proceedings of the INTERSPEECH 2009, 2009

Towards fusion of feature extraction and acoustic model training: a top down process for robust speech recognition.
Proceedings of the INTERSPEECH 2009, 2009

Deriving vocal tract shapes from electromagnetic articulograph data via geometric adaptation and matching.
Proceedings of the INTERSPEECH 2009, 2009

Probabilistic Factorization of Non-negative Data with Entropic Co-occurrence Constraints.
Proceedings of the Independent Component Analysis and Signal Separation, 2009

One-handed gesture recognition using ultrasonic Doppler sonar.
Proceedings of the IEEE International Conference on Acoustics, 2009

A joint decoding algorithm for multiple-example-based addition of words to a pronunciation lexicon.
Proceedings of the IEEE International Conference on Acoustics, 2009

Word Particles Applied to Information Retrieval.
Proceedings of the Advances in Information Retrieval, 2009

2008
Probabilistic Latent Variable Models as Nonnegative Factorizations.
Comp. Int. and Neurosc., 2008

Regularized non-negative matrix factorization with temporal dependencies for speech denoising.
Proceedings of the INTERSPEECH 2008, 2008

Speech denoising using nonnegative matrix factorization with priors.
Proceedings of the IEEE International Conference on Acoustics, 2008

Sparse and shift-invariant feature extraction from non-negative data.
Proceedings of the IEEE International Conference on Acoustics, 2008

Ultrasonic Doppler sensor for speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Analysis-by-synthesis features for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Recognizing talking faces from acoustic Doppler reflections.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

2007
Soft Mask Methods for Single-Channel Speaker Separation.
IEEE Trans. Audio, Speech & Language Processing, 2007

Ultrasonic Doppler Sensor for Voice Activity Detection.
IEEE Signal Process. Lett., 2007

An FFT-Based Companding Front End for Noise-Robust Automatic Speech Recognition.
EURASIP J. Audio, Speech and Music Processing, 2007

Sparse Overcomplete Latent Variable Decomposition of Counts Data.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Probabilistic deduction of symbol mappings for extension of lexicons.
Proceedings of the INTERSPEECH 2007, 2007

Sparse Overcomplete Decomposition for Single Channel Speaker Separation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Bandwidth Expansionwith a pólya URN Model.
Proceedings of the IEEE International Conference on Acoustics, 2007

Supervised and Semi-supervised Separation of Sounds from Single-Channel Mixtures.
Proceedings of the Independent Component Analysis and Signal Separation, 2007

Sensor and Data Systems, Audio-Assisted Cameras and Acoustic Doppler Sensors.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Acoustic Doppler sonar for gait recogination.
Proceedings of the Fourth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2007

2006
An acoustic Doppler-Based Front End for Hands Free spoken User Interfaces.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

An integrated approach to improve speech recognition rate for non-native speakers.
Proceedings of the INTERSPEECH 2006, 2006

Latent Dirichlet Decomposition for Single Channel Speaker Separation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Voice driven applications in non-stationary and chaotic environment.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2005

Recognizing speech from simultaneous speakers.
Proceedings of the INTERSPEECH 2005, 2005

Bandwidth expansion of narrowband speech using non-negative matrix factorization.
Proceedings of the INTERSPEECH 2005, 2005

A Comparison Between Spoken Queries and Menu-Based Interfaces for In-car Digital Music Selection.
Proceedings of the Human-Computer Interaction, 2005

A Companding Front End for Noise-Robust Automatic Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Feature compensation with secondary sensor measurements for robust speech recognition.
Proceedings of the 13th European Signal Processing Conference, 2005

Speech Recognizer Based Maximum Likelihood Beamforming.
Proceedings of the Speech Separation by Humans and Machines, 2005

2004
Classification in Likelihood Spaces.
Technometrics, 2004

Likelihood-maximizing beamforming for robust hands-free speech recognition.
IEEE Trans. Speech and Audio Processing, 2004

A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition.
Speech Communication, 2004

Reconstruction of missing features for robust speech recognition.
Speech Communication, 2004

A Speech-in List-out Approach to Spoken User Interfaces.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Spokenquery: an alternate approach to chosing items with speech.
Proceedings of the INTERSPEECH 2004, 2004

Soft mask estimation for single channel speaker separation.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

A minimum mean squared error estimator for single channel speaker separation.
Proceedings of the INTERSPEECH 2004, 2004

On tracking noise with linear dynamical system models.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Classifier-based non-linear projection for adaptive endpointing of continuous speech.
Computer Speech & Language, 2003

Classification with free energy at raised temperatures.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Design of the CMU sphinx-4 decoder.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Tracking noise via dynamical systems with a continuum of states.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Lossless compression of language model structure and word identifiers.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Multi-channel source separation by factorial HMMs.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Automatic generation of subword units for speech recognition systems.
IEEE Trans. Speech and Audio Processing, 2002

The MERL SpokenQuery information retrieval system a system for retrieving pertinent documents from a spoken query.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Speech recognizer-based microphone array processing for robust hands-free speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Comparison of width-wise and length-wise language model compression.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Quantization-based language model compression.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Calibration of microphone arrays for improved speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A boosting approach for confidence scoring.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Speech in Noisy Environments: robust automatic segmentation, feature extraction, and hypothesis combination.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Structured redefinition of sound units by merging and splitting for improved speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Classifier-based mask estimation for missing feature methods of robust speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Reconstruction of damaged spectrographic features for robust speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Automatic generation of phone sets and lexical transcriptions.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Domain adduced state tying for cross-domain acoustic modelling.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Automatic clustering and generation of contextual questions for tied states in hidden Markov models.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Data-driven environmental compensation for speech recognition: A unified approach.
Speech Communication, 1998

Inference of missing spectrographic features for robust speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
The effects of background music on speech recognition accuracy.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Cepstral compensation by polynomial approximation for environment-independent speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

A vector Taylor series approach for environment-independent speech recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
A unified approach for robust speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Multivariate-Gaussian-based cepstral normalization for robust speech recognition.
Proceedings of the 1995 International Conference on Acoustics, 1995


  Loading...