Vikramjit Mitra

Orcid: 0000-0002-2721-3976

According to our database1, Vikramjit Mitra authored at least 79 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Model-driven Heart Rate Estimation and Heart Murmur Detection based on Phonocardiogram.
CoRR, 2024

Pre-Trained Foundation Model representations to uncover Breathing patterns in Speech.
CoRR, 2024

Investigating Salient Representations and Label Variance in Dimensional Speech Emotion Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Pre-Trained Model Representations and Their Robustness Against Noise for Speech Emotion Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Analysis and Tuning of a Voice Assistant System for Dysfluent Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

SEP-28k: A Dataset for Stuttering Event Detection from Podcasts with People Who Stutter.
Proceedings of the IEEE International Conference on Acoustics, 2021

Estimating Respiratory Rate From Breath Audio Obtained Through Wearable Microphones.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

2020
Investigation and analysis of hyper and hypo neuron pruning to selectively update neurons during unsupervised adaptation.
Digit. Signal Process., 2020

Detecting Emotion Primitives from Speech and Their Use in Discerning Categorical Emotions.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Articulatory and bottleneck features for speaker-independent ASR of dysarthric speech.
Comput. Speech Lang., 2019

Multi-Modal Learning for Speech Emotion Recognition: An Analysis and Comparison of ASR Outputs with Ground Truth Transcription.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Leveraging Acoustic Cues and Paralinguistic Embeddings to Detect Expression from Voice.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Articulatory Features for ASR of Pathological Speech.
CoRR, 2018

Articulatory Features for ASR of Pathological Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Noise Robust Acoustic to Articulatory Speech Inversion.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Interpreting DNN Output Layer Activations: A Strategy to Cope with Unseen Data in Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Articulatory Information and Multiview Features for Large Vocabulary Continuous Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Hybrid convolutional neural networks for articulatory and acoustic information based speech recognition.
Speech Commun., 2017

Multi-microphone speech recognition integrating beamforming, robust feature extraction, and advanced DNN/RNN backend.
Comput. Speech Lang., 2017

Leveraging Deep Neural Network Activation Entropy to cope with Unseen Data in Speech Recognition.
CoRR, 2017

Joint modeling of articulatory and acoustic spaces for continuous speech recognition tasks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Speech recognition in unseen and noisy channel conditions.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Tackling unseen acoustic conditions in query-by-example search using time and frequency convolution for multilingual deep bottleneck features.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Robust Features in Deep-Learning-Based Speech Recognition.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Toward human-assisted lexical unit discovery without text resources.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Vocal Tract Length Normalization for Speaker Independent Acoustic-to-Articulatory Speech Inversion.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Unsupervised Learning of Acoustic Units Using Autoencoders and Kohonen Nets.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Speech Transcription for Low-Resource Languages - The Case of Yoloxóchitl Mixtec (Mexico).
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Fusion Strategies for Robust Speech Recognition and Keyword Spotting for Channel- and Noise-Degraded Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Coping with Unseen Data Conditions: Investigating Neural Net Architectures, Robust Features, and Information Fusion for Robust Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

The SRI System for the NIST OpenSAD 2015 Speech Activity Detection Evaluation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Noise and reverberation effects on depression detection from speech.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A phonetically aware system for speech activity detection.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Speech-based assessment of PTSD in a military population using diverse feature classes.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Analysis of coarticulated speech using estimated articulatory trajectories.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Combating reverberation in large vocabulary continuous speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Cross-corpus depression prediction from speech.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Effects of feature type, learning algorithm and speaking style for depression detection from speech.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improving robustness against reverberation for automatic speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Time-frequency convolutional networks for robust speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Deep convolutional nets and robust features for reverberation-robust speech recognition.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

The SRI AVEC-2014 Evaluation System.
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014

Evaluating robust features on deep neural networks for speech recognition in noisy and channel mismatched conditions.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Recent improvements in SRI's keyword detection system for noisy audio.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Highly accurate phonetic segmentation using boundary correction models and system fusion.
Proceedings of the IEEE International Conference on Acoustics, 2014

Articulatory features from deep neural networks and their role in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Feature fusion for high-accuracy keyword spotting.
Proceedings of the IEEE International Conference on Acoustics, 2014

Medium-duration modulation cepstral feature for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Calibration and multiple system fusion for spoken term detection using linear logistic regression.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Automatic phonetic segmentation using boundary models.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Modulation features for noise robust speaker identification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Damped oscillator cepstral coefficients for robust speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Strategies for high accuracy keyword detection in noisy channels.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Improving language identification robustness to highly channel-degraded speech through multiple system fusion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

All for one: feature combination for highly channel-degraded speech activity detection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A noise-robust system for NIST 2012 speaker recognition evaluation.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Articulatory trajectories for large-vocabulary speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Using multiple versions of speech input in phone recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Normalized amplitude modulation features for large vocabulary noise-robust speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Articulatory Information for Noise Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2011

Speech inversion: Benefits of tract variables over pellet trajectories.
Proceedings of the IEEE International Conference on Acoustics, 2011

Gesture-based Dynamic Bayesian Network for noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Robust speech recognition using articulatory gestures in a Dynamic Bayesian Network framework.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Retrieving Tract Variables From Acoustics: A Comparison of Different Machine Learning Strategies.
IEEE J. Sel. Top. Signal Process., 2010

A procedure for estimating gestural scores from natural speech.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Robust word recognition using articulatory trajectories and gestures.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Noise robustness of tract variables and their application to speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A noise-type and level-dependent MPO-based speech enhancement architecture with variable frame analysis for noise-robust speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

From acoustics to Vocal Tract time functions.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Content based audio classification: a neural network approach.
Soft Comput., 2008

Language and genre detection in audio content analysis.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Language detection in audio content analysis.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Text classification: A least square support vector machine approach.
Appl. Soft Comput., 2007

A Neural Network based Audio Content Classification.
Proceedings of the International Joint Conference on Neural Networks, 2007

2006
Lidar detection of underwater objects using a neuro-SVM-based architecture.
IEEE Trans. Neural Networks, 2006

Prior-shape-based segmentation of various objects in ultrasound images after speckle-reduction using level-set based curvature evolution.
Proceedings of the Medical Imaging 2006: Image Processing, 2006

2005
Lidar Signal Processing for Under-Water Object Detection.
Proceedings of the Advances in Neural Networks - ISNN 2005, Second International Symposium on Neural Networks, Chongqing, China, May 30, 2005


  Loading...