Anil Kumar Vuppala

Orcid: 0000-0002-1313-7917

According to our database1, Anil Kumar Vuppala authored at least 84 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Attempt Towards Stress Transfer in Speech-to-Speech Machine Translation.
CoRR, 2024

2023
IIITH-CSTD Corpus: Crowdsourced Strategies for the Collection of a Large-scale Telugu Speech Corpus.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2023

Enhancing Stutter Detection in Speech Using Zero Time Windowing Cepstral Coefficients and Phase Information.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Enhancing Language Identification in Indian Context Through Exploiting Learned Features with Wav2Vec2.0.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Hardware Accelerator for Transformer based End-to-End Automatic Speech Recognition System.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

2022
Novel feature representation using single frequency filtering and nonlinear energy operator for speech emotion recognition.
Digit. Signal Process., 2022

An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations.
CoRR, 2022

Study of Indian English Pronunciation Variabilities relative to Received Pronunciation.
CoRR, 2022

Decoding self-automated and motivated finger movements using novel single-frequency filtering method - An EEG study.
Biomed. Signal Process. Control., 2022

Exploring High Spectro-Temporal Resolution for Alzheimer's Dementia Detection.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

How do Phonological Properties Affect Bilingual Automatic Speech Recognition?
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Exploring the Effect of Dialect Mismatched Language Models in Telugu Automatic Speech Recognition.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, 2022

Multi-Task End-to-End Model for Telugu Dialect and Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Investigation of Subword-Based Bilingual Automatic Speech Recognition for Indian Languages.
Proceedings of the 2022 Fourteenth International Conference on Contemporary Computing, 2022

Towards improving Disfluency Detection from Speech using Shifted Delta Cepstral Coefficients.
Proceedings of the 2022 Fourteenth International Conference on Contemporary Computing, 2022

Shifted Delta Cepstral Coefficients with RNN to Improve the Detection of Parkinson's Disease from the Speech.
Proceedings of the 2022 Fourteenth International Conference on Contemporary Computing, 2022

Implementation of Zero-Phase Zero Frequency Resonator Algorithm on FPGA.
Proceedings of the 2022 Fourteenth International Conference on Contemporary Computing, 2022

2021
Detection of Fricative Landmarks Using Spectral Weighting: A Temporal Approach.
Circuits Syst. Signal Process., 2021

Toward Improving the Performance of Epoch Extraction from Telephonic Speech.
Circuits Syst. Signal Process., 2021

Reed: An Approach Towards Quickly Bootstrapping Multilingual Acoustic Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

An Investigation of Hybrid architectures for Low Resource Multilingual Speech Recognition system in Indian context.
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021

IE-CPS Lexicon: An Automatic Speech Recognition Oriented Indian-English Pronunciation Dictionary.
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021

Comparative Study of Different Epoch Extraction Methods for Speech Associated with Voice Disorders.
Proceedings of the IEEE International Conference on Acoustics, 2021

Acoustic Features, Bert Model and their complementary Nature for Alzheimer's Dementia Detection.
Proceedings of the IC3 2021: Thirteenth International Conference on Contemporary Computing, Noida, India, August 5, 2021

Outcomes of Speech to Speech Translation for Broadcast Speeches and Crowd Source Based Speech Data Collection Pilot Projects.
Proceedings of the Big Data Analytics - 9th International Conference, 2021

Detecting Multiple Disfluencies from Speech using Pre-linguistic Automatic Syllabification with Acoustic and Prosody Features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

CSTD-Telugu Corpus: Crowd-Sourced Approach for Large-Scale Speech data collection.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Comparative Study of Filter Banks to Improve the Performance of Voice Disorder Assessment Systems using LTAS Features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Duration of the rhotic approximant /ɹ/ in spastic dysarthria of different severity levels.
Speech Commun., 2020

Analytic phase features for dysarthric speech detection and intelligibility assessment.
Speech Commun., 2020

Towards Emotion Independent Language Identification System.
Proceedings of the International Conference on Signal Processing and Communications, 2020

Study on the Effect of Emotional Speech on Language Identification.
Proceedings of the 2020 National Conference on Communications, 2020

Towards Automatic Assessment of Voice Disorders: A Clinical Approach.
Proceedings of the Interspeech 2020, 2020

Single Frequency Filter Bank Based Long-Term Average Spectra for Hypernasality Detection and Assessment in Cleft Lip and Palate Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Stable Implementation of Zero Frequency Filtering of Speech Signals for Efficient Epoch Extraction.
IEEE Signal Process. Lett., 2019

Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition.
Mob. Networks Appl., 2019

Replay spoofing countermeasures using high spectro-temporal resolution features.
Int. J. Speech Technol., 2019

Sound Privacy: A Conversational Speech Corpus for Quantifying the Experience of Privacy.
Proceedings of the Interspeech 2019, 2019

IIIT-H Spoofing Countermeasures for Automatic Speaker Verification Spoofing and Countermeasures Challenge 2019.
Proceedings of the Interspeech 2019, 2019

Perceptually Enhanced Single Frequency Filtering for Dysarthric Speech Detection and Intelligibility Assessment.
Proceedings of the IEEE International Conference on Acoustics, 2019

Multi-Head Self-Attention Networks for Language Identification.
Proceedings of the 2019 Twelfth International Conference on Contemporary Computing, 2019

Attention based Residual-Time Delay Neural Network for Indian Language Identification.
Proceedings of the 2019 Twelfth International Conference on Contemporary Computing, 2019

An Investigation of LSTM-CTC based Joint Acoustic Model for Indian Language Identification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points.
Multim. Tools Appl., 2018

Prosody modification for speech recognition in emotionally mismatched conditions.
Int. J. Speech Technol., 2018

Combining evidences from excitation source and vocal tract system features for Indian language identification using deep neural networks.
Int. J. Speech Technol., 2018

Application of non-negative frequency-weighted energy operator for vowel region detection.
Int. J. Speech Technol., 2018

Curriculum learning based approach for noise robust language identification using DNN with attention.
Expert Syst. Appl., 2018

Automatic Detection of Retroflex Approximants in a Continuous Tamil Speech.
Circuits Syst. Signal Process., 2018

Improved Language Identification Using Stacked SDC Features and Residual Neural Network.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

IIITH-ILSC Speech Database for Indain Language Identification.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Automatic Detection of Palatalized Consonants in Kashmiri.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Incorporating Speaker Normalizing Capabilities to an End-to-End Speech Recognition System.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

An Exploration towards Joint Acoustic Modeling for Indian Languages: IIIT-H Submission for Low Resource Speech Recognition Challenge for Indian Languages, INTERSPEECH 2018.
Proceedings of the Interspeech 2018, 2018

2017
Investigative study of various activation functions for speech recognition.
Proceedings of the Twenty-third National Conference on Communications, 2017

DNN-HMM Acoustic Modeling for Large Vocabulary Telugu Speech Recognition.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2017

Detection of Replay Attacks Using Single Frequency Filtering Cepstral Coefficients.
Proceedings of the Interspeech 2017, 2017

SFF Anti-Spoofer: IIIT-H Submission for Automatic Speaker Verification Spoofing and Countermeasures Challenge 2017.
Proceedings of the Interspeech 2017, 2017

Significance of neural phonotactic models for large-scale spoken language identification.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Sentiment analysis using relative prosody features.
Proceedings of the Tenth International Conference on Contemporary Computing, 2017

Residual neural networks for speech recognition.
Proceedings of the 25th European Signal Processing Conference, 2017

Importance of non-uniform prosody modification for speech recognition in emotion conditions.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Vowel-Based Non-uniform Prosody Modification for Emotion Conversion.
Circuits Syst. Signal Process., 2016

A Study on Vowel Region Detection from a Continuous Speech.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2016

A Study on Text-Independent Speaker Recognition Systems in Emotional Conditions Using Different Pattern Recognition Models.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2016

Significance of automatic detection of vowel regions for automatic shout detection in continuous speech.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

An Investigation of Deep Neural Network Architectures for Language Recognition in Indian Languages.
Proceedings of the Interspeech 2016, 2016

2015
A language model based approach towards large scale and lightweight language identification systems.
CoRR, 2015

Significance of Emotionally Significant Regions of Speech for Emotive to Neutral Conversion.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2015

Improved Language Identification in Presence of Speech Coding.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2015

2014
Speech Processing in Mobile Environments
Springer Briefs in Electrical and Computer Engineering, Springer, ISBN: 978-3-319-03116-3, 2014

Automatic detection of breathy voiced vowels in Gujarati speech.
Int. J. Speech Technol., 2014

Application of Zero-Frequency Filtering for Vowel Onset Point Detection.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2014

2013
Non-uniform time scale modification using instants of significant excitation and vowel onset points.
Speech Commun., 2013

Vowel onset point detection for noisy speech using spectral energy at formant frequencies.
Int. J. Speech Technol., 2013

Neutral Speech to Anger Speech Conversion Using Prosody Modification.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2013

2012
Vowel Onset Point Detection for Low Bit Rate Coded Speech.
IEEE Trans. Speech Audio Process., 2012

Neural network based feature transformation for emotion independent speaker identification.
Int. J. Speech Technol., 2012

Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points.
Circuits Syst. Signal Process., 2012

2011
Recognition of consonant-vowel (CV) units under background noise using combined temporal and spectral preprocessing.
Int. J. Speech Technol., 2011

Effect of Noise on Vowel Onset Point Detection.
Proceedings of the Contemporary Computing - 4th International Conference, 2011

Effect of Noise on Recognition of Consonant-Vowel (CV) Units.
Proceedings of the Contemporary Computing - 4th International Conference, 2011

2010
Effect of Speech Coding on Recognition of Consonant-Vowel (CV) Units.
Proceedings of the Contemporary Computing - Third International Conference, 2010

2009
IITKGP-SESC: Speech Database for Emotion Analysis.
Proceedings of the Contemporary Computing - Second International Conference, 2009


  Loading...