Mathew Magimai-Doss

According to our database1, Mathew Magimai-Doss authored at least 106 papers between 2002 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition.
Speech Communication, 2019

2018
Towards weakly supervised acoustic subword unit discovery and lexicon development using hidden Markov models.
Speech Communication, 2018

SMILE Swiss German Sign Language Dataset.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Implementing Fusion Techniques for the Classification of Paralinguistic Information.
Proceedings of the Interspeech 2018, 2018

Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech.
Proceedings of the Interspeech 2018, 2018

On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs.
Proceedings of the Interspeech 2018, 2018

On Learning to Identify Genders from Raw Speech Signal Using CNNs.
Proceedings of the Interspeech 2018, 2018

Towards Directly Modeling Raw Speech Signal for Speaker Verification Using CNNS.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Long-Term Spectral Statistics for Voice Presentation Attack Detection.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

A Posterior-Based Multistream Formulation for G2P Conversion.
IEEE Signal Process. Lett., 2017

End-to-End convolutional neural network-based voice presentation attack detection.
Proceedings of the 2017 IEEE International Joint Conference on Biometrics, 2017

2016
Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework.
Speech Communication, 2016

Articulatory feature based continuous speech recognition using probabilistic lexical modeling.
Computer Speech & Language, 2016

Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery.
Proceedings of the Interspeech 2016, 2016

HMM-Based Non-Native Accent Assessment Using Posterior Features.
Proceedings of the Interspeech 2016, 2016

Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification.
Proceedings of the 2016 International Conference of the Biometrics Special Interest Group, 2016

2015
Acoustic and lexical resource constrained ASR using language-independent acoustic model and language-dependent probabilistic lexical model.
Speech Communication, 2015

Learning linearly separable features for speech recognition using convolutional neural networks.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Objective intelligibility assessment of text-to-speech systems through utterance verification.
Proceedings of the INTERSPEECH 2015, 2015

Automatic accentedness evaluation of non-native speech using phonetic and sub-phonetic posterior probabilities.
Proceedings of the INTERSPEECH 2015, 2015

Analysis of CNN-based speech recognition system using raw speech as input.
Proceedings of the INTERSPEECH 2015, 2015

Objective speech intelligibility assessment through comparison of phoneme class conditional probability sequences.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

An HMM-based formalism for automatic subword unit derivation and pronunciation generation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Integrated pronunciation learning for automatic speech recognition using probabilistic lexical modeling.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Convolutional Neural Networks-based continuous speech recognition using raw speech signal.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

On recognition of non-native speech using probabilistic lexical model.
Proceedings of the INTERSPEECH 2014, 2014

On modeling context-dependent clustered states: Comparing HMM/GMM, hybrid HMM/ANN and KL-HMM approaches.
Proceedings of the IEEE International Conference on Acoustics, 2014

Joint phoneme segmentation inference and classification using CRFs.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013
Applying Multi- and Cross-Lingual Stochastic Phone Space Transformations to Non-Native Speech Recognition.
IEEE Trans. Audio, Speech & Language Processing, 2013

A Savitzky-Golay Filtering Perspective of Dynamic Feature Computation.
IEEE Signal Process. Lett., 2013

Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks
CoRR, 2013

End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks.
CoRR, 2013

Improving grapheme-based ASR by probabilistic lexical modeling approach.
Proceedings of the INTERSPEECH 2013, 2013

Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks.
Proceedings of the INTERSPEECH 2013, 2013

Grapheme and multilingual posterior features for under-resourced speech recognition: A study on Scottish Gaelic.
Proceedings of the IEEE International Conference on Acoustics, 2013

A probabilistic framework for multiple speaker localization.
Proceedings of the IEEE International Conference on Acoustics, 2013

Probabilistic lexical modeling and unsupervised training for zero-resourced ASR.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
A Fast Parts-Based Approach to Speaker Verification Using Boosted Slice Classifiers.
IEEE Trans. Information Forensics and Security, 2012

Phase AutoCorrelation (PAC) features for noise robust speech recognition.
Speech Communication, 2012

Boosting localized binary features for speech recognition.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Combination of Sparse Classification and Multilayer Perceptron for Noise-robust ASR.
Proceedings of the INTERSPEECH 2012, 2012

Using Sparse Classification Outputs as Feature Observations for Noise-robust ASR.
Proceedings of the INTERSPEECH 2012, 2012

Template-based ASR using posterior features and synthetic references: comparing different TTS systems.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Synthetic References for Template-based ASR using posterior features.
Proceedings of the INTERSPEECH 2012, 2012

Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation.
Proceedings of the INTERSPEECH 2012, 2012

Joint detection and localization of multiple speakers using a probabilistic interpretation of the steered response power.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A TDOA Gaussian mixture model for improving acoustic source tracking.
Proceedings of the 20th European Signal Processing Conference, 2012

2011
Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator.
IEEE Trans. Audio, Speech & Language Processing, 2011

Analysis and Comparison of Recent MLP Features for LVCSR Systems.
Proceedings of the INTERSPEECH 2011, 2011

Hierarchical Tandem Features for ASR in Mandarin.
Proceedings of the INTERSPEECH 2011, 2011

Grapheme-Based Automatic Speech Recognition Using KL-HMM.
Proceedings of the INTERSPEECH 2011, 2011

Improving Non-Native ASR Through Stochastic Multilingual Phoneme Space Transformations.
Proceedings of the INTERSPEECH 2011, 2011

Fast speaker verification on mobile phone data using boosted slice classifiers.
Proceedings of the 2011 IEEE International Joint Conference on Biometrics, 2011

Posterior features for template-based ASR.
Proceedings of the IEEE International Conference on Acoustics, 2011

Phoneme recognition using Boosted Binary Features.
Proceedings of the IEEE International Conference on Acoustics, 2011

Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Language dependent universal phoneme posterior estimation for mixed language speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Improving Articulatory Feature and Phoneme Recognition Using Multitask Learning.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2011, 2011

Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
A comparative large scale study of MLP features for Mandarin ASR.
Proceedings of the INTERSPEECH 2010, 2010

Hierarchical multilayer perceptron based language identification.
Proceedings of the INTERSPEECH 2010, 2010

Towards mixed language speech recognition systems.
Proceedings of the INTERSPEECH 2010, 2010

Boosted binary features for noise-robust speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Evaluating the robustness of privacy-sensitive audio features for speech detection in personal audio log scenarios.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Hierarchical processing of the modulation spectrum for GALE Mandarin LVCSR system.
Proceedings of the INTERSPEECH 2009, 2009

Investigating privacy-sensitive features for speech detection in multiparty conversations.
Proceedings of the INTERSPEECH 2009, 2009

Speaker change detection with privacy-preserving audio cues.
Proceedings of the 11th International Conference on Multimodal Interfaces, 2009

Volterra series for analyzing MLP based phoneme posterior estimator.
Proceedings of the IEEE International Conference on Acoustics, 2009

Non-linear mapping for multi-channel speech separation and robust overlapping spech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Posterior features applied to speech recognition tasks with user-defined vocabulary.
Proceedings of the IEEE International Conference on Acoustics, 2009

MLP based hierarchical system for task adaptation in ASR.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
A Neural Network Based Regression Approach for Recognizing Simultaneous Speech.
Proceedings of the Machine Learning for Multimodal Interaction, 5th International Workshop, 2008

Neural network based regression for robust overlapping speech recognition using microphone arrays.
Proceedings of the INTERSPEECH 2008, 2008

Using KL-based acoustic models in a large vocabulary recognition task.
Proceedings of the INTERSPEECH 2008, 2008

Exploiting contextual information for improved phoneme recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Using comparison of parallel phoneme probability streams for OOV word detection.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

MLP-based log spectral energy mapping for robust overlapping speech recognition.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007
A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems.
Proceedings of the Machine Learning for Multimodal Interaction , 2007

Improving speech translation with automatic boundary prediction.
Proceedings of the INTERSPEECH 2007, 2007

Cross-linguistic analysis of prosodic features for sentence segmentation.
Proceedings of the INTERSPEECH 2007, 2007

Articulatory feature classifiers trained on 2000 hours of telephone speech.
Proceedings of the INTERSPEECH 2007, 2007

Entropy Based Classifier Combination for Sentence Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop.
Proceedings of the IEEE International Conference on Acoustics, 2007

Manual Transcription of Conversational Speech at the Articulatory Feature Level.
Proceedings of the IEEE International Conference on Acoustics, 2007

A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2007

The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPS.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Juicer: A Weighted Finite-State Transducer Speech Decoder.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Threshold Selection for Unsupervised Detection, With an Application to Microphone Arrays.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
A spectrogram model for enhanced source localization and noise-robust ASR.
Proceedings of the INTERSPEECH 2005, 2005

A sector-based, frequency-domain approach to detection and localization of multiple speakers.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Speech recognition with auxiliary information.
IEEE Trans. Speech and Audio Processing, 2004

On the Adequacy of Baseform Pronunciations and Pronunciation Variants.
Proceedings of the Machine Learning for Multimodal Interaction, 2004

Modeling auxiliary features in tandem systems.
Proceedings of the INTERSPEECH 2004, 2004

Spectro-temporal activity pattern (STAP) features for noise robust ASR.
Proceedings of the INTERSPEECH 2004, 2004

Joint decoding for phoneme-grapheme continuous speech recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Enhancement of speech in multispeaker environment.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Using pitch frequency information in speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Dynamic Bayesian network based speech recognition with pitch and energy as auxiliary variables.
Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing, 2002

Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition.
Proceedings of the 16th International Conference on Pattern Recognition, 2002


  Loading...