Prasanta Kumar Ghosh

Proceedings of the National Conference on Communications, 2025

A Correlation Profile-Based Adaptive Weighing in Mel-DCT Filter Banks for Voice Activity Detection.

[BibT_eX]

[DOI]

Proceedings of the National Conference on Communications, 2025

Comparison of Acoustic and Textual Features for Dysarthria Severity Classification in Amyotrophic Lateral Sclerosis.

[BibT_eX]

[DOI]

Y. S. Upendra Vishwanath

Deekshitha G

Madassu Keerthipriya

Darshan Chikktimmegowda

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Boosting StoRM Convergence with Metric Guidance and Non-uniform State-Sampling for Optimal Dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

A real-time MRI study on asymmetry in velum dynamics during VCV production with nasal sounds.

[BibT_eX]

[DOI]

Chetan Sharma

Vaishnavi Chandwanshi

Shreya Shrikant Karkun

Aditya Anand Gupta

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

An approach to measuring the performance of Automatic Speech Recognition(ASR) models in the context of Large Language Model(LLM) powered applications.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Jointly Improving Dialect Identification and ASR in Indian Languages using Multimodal Feature Fusion.

[BibT_eX]

[DOI]

Saurabh Kumar

Amartyaveer

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Enhancing Acoustic-to-Articulatory Inversion with Multi-Target Pretraining for Low-Resource Settings.

[BibT_eX]

[DOI]

Jesuraj Bandekar

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Role of the Pretraining and the Adaptation data sizes for low-resource real-time MRI video segmentation.

[BibT_eX]

[DOI]

Masoud Thajudeen Tholan

Vinayaka Hegde

Chetan Sharma

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Improving Dialect Identification in Indian Languages Using Multimodal Features from Dialect Informed ASR.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MADASR 2.0: Multi-Lingual Multi-Dialect ASR Challenge in 8 Indian Languages.

[BibT_eX]

[DOI]

Srikanth S. Narayanan

Howard Lakougna

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

Bottleneck Transformer-Based Approach for Improved Automatic STOI Score Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024

Low Complexity Model with Single Dimensional Feature for Speech Based Classification of Amyotrophic Lateral Sclerosis Patients and Healthy Individuals.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Signal Processing and Communications, 2024

IndicMOS: Multilingual MOS Prediction for 7 Indian languages.

[BibT_eX]

[DOI]

Soumi Maiti

Abhayjeet Singh Savitha Murthy

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Adapter pre-training for improved speech recognition in unseen domains using low resource adapter tuning of self-supervised models.

[BibT_eX]

[DOI]

Priyanka Pai

Raoul Nanavati

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

A comparative study of the impact of voiceless alveolar and palato-alveolar sibilants in English on lip aperture and protrusion during VCV production.

[BibT_eX]

[DOI]

Chetan Sharma

Vaishnavi Chandwanshi

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Exploring Syllable Discriminability during Diadochokinetic Task with Increasing Dysarthria Severity for Patients with Amyotrophic Lateral Sclerosis.

[BibT_eX]

[DOI]

Neelesh Samptur

Anirudh Chakravarty K

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Articulatory synthesis using representations learnt through phonetic label-aware contrastive loss.

[BibT_eX]

[DOI]

Jesuraj Bandekar

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

An Unsupervised Segmentation of Vocal Breath Sounds.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

LIMMITS'24: Multi-Speaker, Multi-Lingual Indic TTS with Voice Cloning.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Philipp Olbrich

Proceedings of the IEEE International Conference on Acoustics, 2024

Spectral Analysis of Vowels and Fricatives at Varied Levels of Dysarthria Severity for Amyotrophic Lateral Sclerosis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Wav2vec 2.0 Model for Heart Sound Analysis.

[BibT_eX]

[DOI]

Alex Paul Kamson

Akshay V. Sawant

Satish S. Jeevannavar

Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2024

2023

Model Adaptation for ASR in low-resource Indian Languages.

[BibT_eX]

[DOI]

CoRR, 2023

An ASR Corpus in Chhattisgarhi, a Low Resource Indian Language.

[BibT_eX]

[DOI]

Abhayjeet Singh

Arjun Singh Mehta

Ashish Khuraishi K. S

Sai Praneeth Reddy Mora

Proceedings of the Speech and Computer - 25th International Conference, 2023

An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian Language.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 25th International Conference, 2023

Study of Indian English Pronunciation Variabilities Relative to Received Pronunciation.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 25th International Conference, 2023

Curriculum Learning Based Approach for Faster Convergence of TTS Model.

[BibT_eX]

[DOI]

Navneet Kaur

Proceedings of the Speech and Computer - 25th International Conference, 2023

Can the decoded text from automatic speech recognition effectively detect spoken grammar errors?

[BibT_eX]

[DOI]

Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023

SPIRE-SIES: A Spontaneous Indian English Speech Corpus.

[BibT_eX]

[DOI]

Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023

Do Vocal Breath Sounds Encode Gender Cues for Automatic Gender Classification?

[BibT_eX]

[DOI]

Mohammad Shaique Solanki

Ashutosh Bharadwaj

Jeevan Kylash

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Classification of Multi-class Vowels and Fricatives From Patients Having Amyotrophic Lateral Sclerosis with Varied Levels of Dysarthria Severity.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence.

[BibT_eX]

[DOI]

Siddarth Chandrasekar

Arvind Ramesh

Tilak Purohit

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Transfer Learning to Aid Dysarthria Severity Classification for Patients with Amyotrophic Lateral Sclerosis.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Weakly supervised glottis segmentation in high-speed videoendoscopy using bounding box labels.

[BibT_eX]

[DOI]

Varun Belagali

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion.

[BibT_eX]

[DOI]

Jesuraja Bandekar

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improved Acoustic-to-Articulatory Inversion Using Representations from Pretrained Self-Supervised Learning Models.

[BibT_eX]

[DOI]

C. Siddarth

Proceedings of the IEEE International Conference on Acoustics, 2023

Real-Time MRI Video Synthesis from Time Aligned Phonemes with Sequence-to-Sequence Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Lightweight, Multi-Speaker, Multi-Lingual Indic Text-to-Speech.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Philipp Olbrich

Proceedings of the IEEE International Conference on Acoustics, 2023

Static and Dynamic Source and Filter Cues for Classification of Amyotrophic Lateral Sclerosis Patients and Healthy Subjects.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Exploring the Role of Fricatives in Classifying Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis and Parkinson's Disease.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Gated Multi Encoders and Multitask Objectives for Dialectal Speech Recognition in Indian Languages.

[BibT_eX]

[DOI]

Raoul Nanavati

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Automatic syllable stress detection under non-parallel label and data condition.

[BibT_eX]

[DOI]

Speech Commun., 2022

A deteriorating food preservation supply chain model with downstream delayed payment and upstream partial prepayment.

[BibT_eX]

[DOI]

RAIRO Oper. Res., 2022

Study of Indian English Pronunciation Variabilities relative to Received Pronunciation.

[BibT_eX]

[DOI]

CoRR, 2022

Voistutor 2.0: A Speech Corpus with Phonetic Transcription for Pronunciation Evaluation of Indian L2 English Learners.

[BibT_eX]

[DOI]

Priyanshi Pal

Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022

Whisper to Neutral Mapping Using I-Vector Space Likelihood and a Cosine Similarity Based Iterative Optimization for Whispered Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 27th National Conference on Communications, 2022

Streaming model for Acoustic to Articulatory Inversion with transformer networks.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Watch Me Speak: 2D Visualization of Human Mouth during Speech.

[BibT_eX]

[DOI]

C. Siddarth

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Air tissue boundary segmentation using regional loss in real-time Magnetic Resonance Imaging video for speech production.

[BibT_eX]

[DOI]

Anwesha Roy

Varun Belagali

Lodagala V. S. V. Durga Prasad

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Gram Vaani ASR Challenge on spontaneous telephone speech recordings in regional variations of Hindi.

[BibT_eX]

[DOI]

Adithya Raj Kolladath

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

SegNet-Based Deep Representation Learning for Dysphagia Classification.

[BibT_eX]

[DOI]

Siddharth Subramani

Anwesha Roy

Prasanna Suresh Hegde

Proceedings of the IEEE International Conference on Acoustics, 2022

An Error Correction Scheme for Improved Air-Tissue Boundary in Real-Time MRI Video for Speech Production.

[BibT_eX]

[DOI]

Anwesha Roy

Varun Belagali

Proceedings of the IEEE International Conference on Acoustics, 2022

Dual Attention Pooling Network for Recording Device Classification Using Neutral and Whispered Speech.

[BibT_eX]

[DOI]

Bhavuk Singhal

Proceedings of the IEEE International Conference on Acoustics, 2022

The impact of cross language on acoustic-to-articulatory inversion and its influence on articulatory speech synthesis.

[BibT_eX]

[DOI]

Aanish Nair

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

A Robust Speaking Rate Estimator Using a CNN-BLSTM Network.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2021

A deep neural network based correction scheme for improved air-tissue boundary prediction in real-time magnetic resonance imaging video.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2021

Multi-modal Point-of-Care Diagnostics for COVID-19 Based On Acoustics and Symptoms.

[BibT_eX]

[DOI]

Srikanth Raj Chetupalli

CoRR, 2021

Multilingual and code-switching ASR challenges for low resource Indian languages.

[BibT_eX]

[DOI]

CoRR, 2021

wSPIRE: A Parallel Multi-Device Corpus in Neutral and Whispered Speech.

[BibT_eX]

[DOI]

Bhavuk Singhal

Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

A Study on Native American English Speech Recognition by Indian Listeners with Varying Word Familiarity Level.

[BibT_eX]

[DOI]

Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

SPIRE VCV: An Acoustic-Articulatory Corpus with Three Different Speaking Rates.

[BibT_eX]

[DOI]

Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

Convolutional Dense Neural Network Based Spirometry Variable FVC Prediction Using Sustained Phonations.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

Noise Robust Pitch Stylization Using Minimum Mean Absolute Error Criterion.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Web Interface for Estimating Articulatory Movements in Speech Production from Acoustics and Text.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Estimating Articulatory Movements in Speech Production with Transformer Networks.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Comparative Study of Different EMG Features for Acoustics-to-EMG Mapping.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

DiCOVA Challenge: Dataset, Task, and Baseline System for COVID-19 Diagnosis Using Acoustics.

[BibT_eX]

[DOI]

Srikanth Raj Chetupalli

Sriram Ganapathy

Shreyas Ramoji

Viral Nanda

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

MUCS 2021: Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Source and Vocal Tract Cues for Speech-Based Classification of Patients with Parkinson's Disease and Healthy Subjects.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Impact of Speaking Rate on the Source Filter Interaction in Speech: A Study.

[BibT_eX]

[DOI]

Tilak Purohit

Preethish-Kumar Veeramani

Proceedings of the IEEE International Conference on Acoustics, 2021

Acoustic-to-Articulatory Inversion for Dysarthric Speech by Using Cross-Corpus Acoustic-Articulatory Data.

[BibT_eX]

[DOI]

Sarthak Kumar Maharana

Proceedings of the IEEE International Conference on Acoustics, 2021

Effect of Noise and Model Complexity on Detection of Amyotrophic Lateral Sclerosis and Parkinson's Disease Using Pitch and MFCC.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Role of breath phase and breath boundaries for the classification between asthmatic and healthy subjects.

[BibT_eX]

[DOI]

Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

Noise Robust Detection of Fundamental Heart Sound using Parametric Mixture Gaussian and Dynamic Programming.

[BibT_eX]

[DOI]

Shailesh BG

Drishti Ramesh Megalmani

Satish S. Jeevannavar

Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

Unsegmented Heart Sound Classification Using Hybrid CNN-LSTM Neural Networks.

[BibT_eX]

[DOI]

Drishti Ramesh Megalmani

Shailesh B. G

Satish S. Jeevannavar

Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

2020

SFNet: A Computationally Efficient Source Filter Model Based Neural Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2020

The impact of speaking rate on acoustic-to-articulatory inversion.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2020

Speech task based automatic classification of ALS and Parkinson's Disease and their severity using log Mel spectrograms.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Signal Processing and Communications, 2020

Speech rate estimation using representations learned from speech with convolutional neural network.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Signal Processing and Communications, 2020

Attention and Encoder-Decoder Based Models for Transforming Articulatory Movements at Different Speaking Rates.

[BibT_eX]

[DOI]

Abhayjeet Singh

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Coswara - A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis.

[BibT_eX]

[DOI]

Srikanth Raj Chetupalli

Nirmala R.

Sriram Ganapathy

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

An Investigation of the Virtual Lip Trajectories During the Production of Bilabial Stops and Nasal at Different Speaking Rates.

[BibT_eX]

[DOI]

Tilak Purohit

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Whisper Activity Detection Using CNN-LSTM Based Attention Pooling Network Trained for a Speaker Identification Task.

[BibT_eX]

[DOI]

Malla Satyapriya

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speech Rate Task-Specific Representation Learning from Acoustic-Articulatory Data.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Air-Tissue Boundary Segmentation in Real Time Magnetic Resonance Imaging Video Using 3-D Convolutional Neural Network.

[BibT_eX]

[DOI]

Navaneetha Gaddam

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Raw Speech Waveform Based Classification of Patients with ALS, Parkinson's Disease and Healthy Controls Using CNN-BLSTM.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaker Conditioned Acoustic-to-Articulatory Inversion Using x-Vectors.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Automatic Glottis Detection and Segmentation in Stroboscopic Videos Using Convolutional Networks.

[BibT_eX]

[DOI]

Veeramani Priyadharshini

Prakash T. K.

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Analysis of Acoustic Features for Speech Sound Based Classification of Asthmatic and Healthy Subjects.

[BibT_eX]

[DOI]

Merugu Keerthana

Sanjeev Kadagathur Vadiraj

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Automatic Identification of Speakers From Head Gestures in a Narration.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Automatic Classification of Volumes of Water Using Swallow Sounds from Cervical Auscultation.

[BibT_eX]

[DOI]

Siddharth Subramani

Divya Giridhar

Prasanna Suresh Hegde

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Comparative Study of Estimating Articulatory Movements from Phoneme Sequences and Acoustic Features.

[BibT_eX]

[DOI]

Abhayjeet Singh

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Pseudo Likelihood Correction Technique for Low Resource Accented ASR.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Voice based classification of patients with Amyotrophic Lateral Sclerosis, Parkinson's Disease and Healthy Controls with CNN-LSTM using transfer learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Glottal Inverse Filtering Using Probabilistic Weighted Linear Prediction.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Dirichlet Latent Variable Model: A Dynamic Model Based on Dirichlet Prior for Audio Processing.

[BibT_eX]

[DOI]

Anurendra Kumar

Tanaya Guha

IEEE ACM Trans. Audio Speech Lang. Process., 2019

P- and T-wave delineation in ECG signals using parametric mixture Gaussian and dynamic programming.

[BibT_eX]

[DOI]

Prakhar Gupta

Biomed. Signal Process. Control., 2019

Comparison of automatic syllable stress detection quality with time-aligned boundaries and context dependencies.

[BibT_eX]

[DOI]

Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019

voisTUTOR: Virtual Operator for Interactive Spoken English TUTORing.

[BibT_eX]

[DOI]

Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019

Noise robust goodness of pronunciation measures using teacher's utterance.

[BibT_eX]

[DOI]

Sweekar Sudhakara

Anurag Das

Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019

Automatic assessment of pronunciation and its dependent factors by exploring their interdependencies using DNN and LSTM.

[BibT_eX]

[DOI]

Aparna Srinivasan

Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019

voisTUTOR corpus: A speech corpus of Indian L2 English learners for pronunciation assessment.

[BibT_eX]

[DOI]

Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

Indic TIMIT and Indic English lexicon: A speech database of Indian speakers using TIMIT stimuli and a lexicon from their mispronunciations.

[BibT_eX]

[DOI]

Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

An acoustic-articulatory database of VCV sequences and words in Toda at different speaking rates.

[BibT_eX]

[DOI]

Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

A SegNet Based Image Enhancement Technique for Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video.

[BibT_eX]

[DOI]

Valliappan Ca

Proceedings of the National Conference on Communications, 2019

SPIRE-fluent: A Self-Learning App for Tutoring Oral Fluency to Second Language English Learners.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

An Improved Goodness of Pronunciation (GoP) Measure for Pronunciation Evaluation with DNN-HMM System Considering HMM Transition Probabilities.

[BibT_eX]

[DOI]

Sweekar Sudhakara

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Low Resource Automatic Intonation Classification Using Gated Recurrent Unit (GRU) Networks Pre-Trained with Synthesized Pitch Patterns.

[BibT_eX]

[DOI]

Atreyee Saha

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

ASR Inspired Syllable Stress Detection for Pronunciation Evaluation Without Using a Supervised Classifier and Syllable Level Features.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Whisper to Neutral Mapping Using Cosine Similarity Maximization in i-Vector Space for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Comparison of Speech Tasks and Recording Devices for Voice Based Automatic Classification of Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Acoustic and Articulatory Feature Based Speech Rate Estimation Using a Convolutional Dense Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

An Investigation on Speaker Specific Articulatory Synthesis with Speaker Independent Articulatory Inversion.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

An Improved Air Tissue Boundary Segmentation Technique for Real Time Magnetic Resonance Imaging Video Using Segnet.

[BibT_eX]

[DOI]

C. A. Valliappan

Avinash Kumar

Proceedings of the IEEE International Conference on Acoustics, 2019

A Study on Robustness of Articulatory Features for Automatic Speech Recognition of Neutral and Whispered Speech.

[BibT_eX]

[DOI]

Gokul Srinivasan

Proceedings of the IEEE International Conference on Acoustics, 2019

Formant-gaps Features for Speaker Verification Using Whispered Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Air-tissue Boundary Segmentation in Real Time Magnetic Resonance Imaging Video Using a Convolutional Encoder-decoder Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Representation Learning Using Convolution Neural Network for Acoustic-to-articulatory Inversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Trend Statistics Network and Channel invariant EEG Network for sleep arousal study.

[BibT_eX]

[DOI]

Anirban Dutta Choudhury

Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

2018

PSFM - A Probabilistic Source Filter Model for Noise Robust Glottal Closure Instant Detection.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Optimal sensor placement in electromagnetic articulography recording for speech production study.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2018

Classification of story-telling and poem recitation using head gesture of the talker.

[BibT_eX]

[DOI]

C. A. Valliappan

Anurag Das

Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Broad Phoneme Class Specific Deep Neural Network Based Speech Enhancement.

[BibT_eX]

[DOI]

Pavan Karjol

Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

SPIRE-SST: An Automatic Web-based Self-learning Tool for Syllable Stress Tutoring (SST) to the Second Language Learners.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video Using Semantic Segmentation with Fully Convolutional Networks.

[BibT_eX]

[DOI]

C. A. Valliappan

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Relating Articulatory Motions in Different Speaking Rates.

[BibT_eX]

[DOI]

Astha Singh

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Automatic Visual Augmentation for Concatenation Based Synthesized Articulatory Videos from Real-time MRI Data for Spoken Language Training.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Reconstructing Neutral Speech from Tracheoesophageal Speech.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Whispered Speech to Neutral Speech Conversion Using Bidirectional LSTMs.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Automatic Glottis Localization and Segmentation in Stroboscopic Videos Using Deep Neural Network.

[BibT_eX]

[DOI]

Rahul Krishnamurthy

Pebbili Gopikishore

Veeramani Priyadharshini

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Subband Weighting for Binaural Speech Source Localization.

[BibT_eX]

[DOI]

Parth Suresh

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Speech Enhancement Using Deep Mixture of Experts Based on Hard Expectation Maximization.

[BibT_eX]

[DOI]

Pavan Karjol

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Low Resource Acoustic-to-articulatory Inversion Using Bi-directional Long Short Term Memory.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Intonation tutor by SPIRE (In-SPIRE): An Online Tool for an Automatic Feedback to the Second Language Learners in Learning Intonation.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

A Supervised Air-Tissue Boundary Segmentation Technique in Real-Time Magnetic Resonance Imaging Video Using a Novel Measure of Contrast and Dynamic Programming.

[BibT_eX]

[DOI]

Advait Koparkar

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Binaural Speech Source Localization Using Template Matching of Interaural Time Difference Patterns.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speech Enhancement Using Multiple Deep Neural Networks.

[BibT_eX]

[DOI]

Pavan Karjol

Ajay Kumar M

Preethish-Kumar Veeramani

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Comparison of Speech Tasks for Automatic Classification of Patients with Amyotrophic Lateral Sclerosis and Healthy Subjects.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Concatenative Articulatory Video Synthesis Using Real-Time MRI Data for Spoken Language Training.

[BibT_eX]

[DOI]

Urvish Desai

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Comparison of Cough, Wheeze and Sustained Phonations for Automatic Classification Between Healthy Subjects and Asthmatic Patients.

[BibT_eX]

[DOI]

Kausthubha NK

Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

A Maximum Likelihood Formulation To Exploit Heart Rate Variability for Robust Heart Rate Estimation From Facial Video.

[BibT_eX]

[DOI]

Raseena K. T

Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

A Heart Rate Driven Kalman Filter for Continuous Arousal Trend Monitoring.

[BibT_eX]

[DOI]

Shreyasi Datta

Deepan Das

Anirban Dutta Choudhury

Arpan Pal

Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

SleepTight: Identifying Sleep Arousals Using Inter and Intra-Relation of Multimodal Signals.

[BibT_eX]

[DOI]

Anirban Dutta Choudhury

Arpan Pal

Proceedings of the Computing in Cardiology, 2018

2017

Spectrogram Enhancement Using Multiple Window Savitzky-Golay (MWSG) Filter for Robust Bird Sound Detection.

[BibT_eX]

[DOI]

Nithin Rao Koluguri

Priyadarshini Savan Roshan

IEEE ACM Trans. Audio Speech Lang. Process., 2017

A high resolution ENF based multi-stage classifier for location forensics of media recordings.

[BibT_eX]

[DOI]

Pradyumna B. Suresha

Supriya Nagesh

Aditya Gaonkar P.

Proceedings of the Twenty-third National Conference on Communications, 2017

Pitch prediction from Mel-frequency cepstral coefficients using sparse spectrum recovery.

[BibT_eX]

[DOI]

Yamini Belur Keshavaprasad

Proceedings of the Twenty-third National Conference on Communications, 2017

Classification of healthy subjects and patients with essential vocal tremor using empirical mode decomposition of high resolution pitch contour.

[BibT_eX]

[DOI]

H. S. Mekhala

Proceedings of the Twenty-third National Conference on Communications, 2017

A comparative study on the effect of different codecs on speech recognition accuracy using various acoustic modeling techniques.

[BibT_eX]

[DOI]

Proceedings of the Twenty-third National Conference on Communications, 2017

Phoneme State Posteriorgram Features for Speech Based Automatic Classification of Speakers in Cold and Healthy Condition.

[BibT_eX]

[DOI]

Akshay Kalkunte Suresh

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A Dual Source-Filter Model of Snore Audio for Snorer Group Classification.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

PRAV: A Phonetically Rich Audio Visual Corpus.

[BibT_eX]

[DOI]

Abhishek Narwekar

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A Robust Voiced/Unvoiced Phoneme Classification from Whispered Speech Using the 'Color' of Whispered Phonemes and Deep Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Subband Selection for Binaural Speech Source Localization.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

An Information Theoretic Analysis of the Temporal Synchrony Between Head Gestures and Prosodic Patterns in Spontaneous Speech.

[BibT_eX]

[DOI]

Gaurav Fotedar

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Automatic detection of syllable stress using sonority based prominence features for pronunciation evaluation.

[BibT_eX]

[DOI]

Om D. Deshmukh

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A comparative study of acoustic-to-articulatory inversion for neutral and whispered speech.

[BibT_eX]

[DOI]

Nisha Meenakshi

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Low resource point process models for keyword spotting using unsupervised online learning.

[BibT_eX]

[DOI]

Samik Sadhu

Proceedings of the 25th European Signal Processing Conference, 2017

Automatic prediction of spirometry readings from cough and wheeze for monitoring of asthma severity.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

Pitch prediction from Mel-generalized cepstrum - a computationally efficient pitch modeling approach for speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

2016

Cumulative Impulse Strength for Epoch Extraction.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2016

A mode-shape classification technique for robust speech rate estimation and syllable nuclei detection.

[BibT_eX]

[DOI]

Om D. Deshmukh

Speech Commun., 2016

Information theoretic optimal vocal tract region selection from real time magnetic resonance images for broad phonetic class recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2016

Speaker verification based on the fusion of speech acoustics and inverted articulatory signals.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2016

Robust real-time pulse rate estimation from facial video using sparse spectral peak tracking.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

A comparative study of articulatory features from facial video and acoustic-to-articulatory inversion for phonetic discrimination.

[BibT_eX]

[DOI]

Abhishek Narwekar

Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

A Class-Specific Speech Enhancement for Phoneme Recognition: A Dictionary Learning Approach.

[BibT_eX]

[DOI]

Nazreen P. M.

A. G. Ramakrishnan

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Recognition of Social Roles Using Long Term Role Transitions in Small Group Interactions.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A robust speech rate estimation based on the activation profile from the selected acoustic unit dictionary.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Better acoustic normalization in subject independent acoustic-to-articulatory inversion: Benefit to recognition.

[BibT_eX]

[DOI]

Amber Afshan

Navaneet K. Lakshminarasimha Murthy

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Multiple Spectral Peak Tracking for Heart Rate Monitoring from Photoplethysmography Signal During Intensive Physical Exercise.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2015

Robust Whisper Activity Detection Using Long-Term Log Energy Variation of Sub-Band Signal.

[BibT_eX]

[DOI]

Nisha Meenakshi

IEEE Signal Process. Lett., 2015

Improved subject-independent acoustic-to-articulatory inversion.

[BibT_eX]

[DOI]

Amber Afshan

Speech Commun., 2015

Automatic gender classification using the mel frequency cepstrum of neutral and whispered speech: A comparative study.

[BibT_eX]

[DOI]

Proceedings of the Twenty First National Conference on Communications, 2015

An error correction scheme for GCI detection algorithms using pitch smoothness criterion.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Automatic classification of eating conditions from speech using acoustic feature selection and a set of hierarchical support vector machine classifiers.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Estimation of the air-tissue boundaries of the vocal tract in the mid-sagittal plane from electromagnetic articulograph data.

[BibT_eX]

[DOI]

Satyabrata Parida

Ashok Kumar Pattem

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A discriminative analysis within and across voiced and unvoiced consonants in neutral and whispered speech in multiple indian languages.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Estimation of the invariant and variant characteristics in speech articulation and its application to speaker identification.

[BibT_eX]

[DOI]

Vijitha Periyasamy

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Bayesian learning for time-varying linear prediction of speech.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

2014

Missing samples estimation in electromagnetic articulography data using equality constrained kalman smoother.

[BibT_eX]

[DOI]

P. Sujith

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Sparse smoothing of articulatory features from Gaussian mixture model based acoustic-to-articulatory inversion: benefit to speech recognition.

[BibT_eX]

[DOI]

Prasad Sudhakar

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Selection of optimal vocal tract regions using real-time magnetic resonance imaging for robust voice activity detection.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Comparison of speech quality with and without sensors in electromagnetic articulograph AG 501 recording.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Classification of clean and noisy bilingual movie audio for speech-to-speech translation corpora design.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Maximum a-posteriori estimation of missing samples with continuity constraint in Electromagnetic Articulography data.

[BibT_eX]

[DOI]

P. Sujith

Proceedings of the IEEE International Conference on Acoustics, 2014

A sparse smoothing approach for Gaussian Mixture Model based Acoustic-to-Articulatory Inversion.

[BibT_eX]

[DOI]

Prasad Sudhakar

Laurent Jacques

Proceedings of the IEEE International Conference on Acoustics, 2014

Multi-pitch tracking using Gaussian mixture model with time varying parameters and Grating Compression Transform.

[BibT_eX]

[DOI]

M. N. Abhijith

K. Rajgopal

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

High-quality bilingual subtitle document alignments with application to spontaneous speech translation.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

Multi-band long-term signal variability features for robust voice activity detection.

[BibT_eX]

[DOI]

Maarten Van Segbroeck

Alexandros Potamianos

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Speaker verification based on fusion of acoustic and articulatory information.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Information theoretic acoustic feature selection for acoustic-to-articulatory inversion.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Spatial and temporal alignment of multimodal human speech production data: Real time imaging, flesh point tracking and audio.

[BibT_eX]

[DOI]

Jangwon Kim

Adam C. Lammert

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Exploiting speech production information for automatic speech and speaker modeling and recognition - possibilities and new opportunities.

[BibT_eX]

[DOI]

Vikram Ramanarayanan

Adam C. Lammert

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

A study of emotional information present in articulatory movements estimated using acoustic-to-articulatory inversion.

[BibT_eX]

[DOI]

Jangwon Kim

Sungbok Lee

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

Robust Voice Activity Detection Using Long-Term Signal Variability.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter.

[BibT_eX]

[DOI]

Speech Commun., 2011

A Multimodal Real-Time MRI Articulatory Corpus for Speech Research.

[BibT_eX]

[DOI]

Erik Bresch

Louis Goldstein

Athanasios Katsamanis

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Analysis of Inter-Articulator Correlation in Acoustic-to-Articulatory Inversion Using Generalized Smoothness Criterion.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Overlapped speech detection using long-term spectro-temporal similarity in stereo recording.

[BibT_eX]

[DOI]

Bo Xiao

Proceedings of the IEEE International Conference on Acoustics, 2011

Bilingual audio-subtitle extraction using automatic segmentation of movie audio.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

A subject-independent acoustic-to-articulatory inversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Bark Frequency Transform Using an Arbitrary Order Allpass Filter.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2010

Robust voice activity detection in stereo recording with crosstalk.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Pitch Contour Stylization Using an Optimal Piecewise Polynomial Approximation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2009

Context-driven automatic bilingual movie subtitle alignment.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Estimation of articulatory gesture patterns from speech acoustics.

[BibT_eX]

[DOI]

Pierre L. Divenyi

Louis Goldstein

Elliot Saltzman

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Robust word boundary detection in spontaneous speech using acoustic and lexical cues.

[BibT_eX]

[DOI]

Sankaranarayanan Ananthakrishnan

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Automatic classification of question turns in spontaneous speech using lexical and prosodic evidence.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Pitch period estimation using multipulse model and wavelet transform.

[BibT_eX]

[DOI]

Antonio Ortega

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Speech Segmentation using Extrema-Based Signal Track Length Measure.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Time-varying filter interpretation of Fourier transform and its variants.

[BibT_eX]

[DOI]

T. V. Sreenivas

Signal Process., 2006

Dynamic Programming Based Optimum Non-Uniform Samples For Speech Reconstruction and Coding.

[BibT_eX]

[DOI]