Sudarsana Reddy Kadiri

Orcid: 0000-0001-5806-3053

According to our database1, Sudarsana Reddy Kadiri authored at least 86 papers between 2013 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Can Layer-wise SSL Features Improve Zero-Shot ASR Performance for Children's Speech?
CoRR, August, 2025

Layer-Wise Analysis of Self-Supervised Representations for Age and Gender Classification in Children's Speech.
CoRR, August, 2025

Decoding Neural Signatures of Semantic Evaluations in Depression and Suicidality.
CoRR, July, 2025

Neural Responses to Affective Sentences Reveal Signatures of Depression.
CoRR, June, 2025

Towards disentangling the contributions of articulation and acoustics in multimodal phoneme recognition.
CoRR, May, 2025

Deep Learning Characterizes Depression and Suicidal Ideation from Eye Movements.
CoRR, April, 2025

Towards robust heart failure detection in digital telephony environments by utilizing transformer-based codec inversion.
Speech Commun., 2025

Automatic classification of vocal intensity categories from amplitude-normalized speech signals by comparing acoustic features and classifier models.
Speech Commun., 2025

Zero-shot KWS for children's speech using layer-wise features from SSL models.
Pattern Recognit. Lett., 2025

Enhancing Traditional Kaldi Dysarthric Speech Recognition Using SSL-Features.
Proceedings of the National Conference on Communications, 2025

Enhancing Listened Speech Decoding from EEG via Parallel Phoneme Sequence Prediction.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Wavelet Scattering Network Features for Intensity Category Classification and Prediction of SPL from Speech.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Exploring the Impact of Fine-Tuning the Wav2vec2 Model in Database-Independent Detection of Dysarthric Speech.
IEEE J. Biomed. Health Informatics, August, 2024

Spectral warping based data augmentation for low resource children's speaker verification.
Multim. Tools Appl., May, 2024

Automatic classification of the severity level of Parkinson's disease: A comparison of speaking tasks, features, and classifiers.
Comput. Speech Lang., January, 2024

Investigation of self-supervised pre-trained models for classification of voice quality from speech and neck surface accelerometer signals.
Comput. Speech Lang., January, 2024

A comparison of data augmentation methods in voice pathology detection.
Comput. Speech Lang., January, 2024

Pre-trained models for detection and severity level classification of dysarthria from speech.
Speech Commun., 2024

AVID: A speech database for machine learning studies on vocal intensity.
Speech Commun., 2024

Can a Machine Distinguish High and Low Amount of Social Creak in Speech?
CoRR, 2024

Evaluation of state-of-the-art ASR Models in Child-Adult Interactions.
CoRR, 2024

Towards Child-Inclusive Clinical Video Understanding for Autism Spectrum Disorder.
CoRR, 2024

Effect of Speech Modification on Wav2Vec2 Models for Children Speech Recognition.
Proceedings of the International Conference on Signal Processing and Communications, 2024

MMSD-Net: Towards Multi-modal Stuttering Detection.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Toward Fully-End-to-End Listened Speech Decoding from EEG Signals.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Fine-tuning of Pre-trained Models for Classification of Vocal Intensity Category from Speech Signals.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Improving End-to-End Speech Recognition for Dysarthric Speech through In-Domain Data Augmentation.
Proceedings of the 58th Asilomar Conference on Signals, 2024

Systematic Study of Dysarthric Speech Recognition: Spectral Features and Acoustic Models.
Proceedings of the 58th Asilomar Conference on Signals, 2024

2023
Refining a deep learning-based formant tracker using linear prediction methods.
Comput. Speech Lang., June, 2023

Analysis of Instantaneous Frequency Components of Speech Signals for Epoch Extraction.
Comput. Speech Lang., 2023

Classification of Phonation Modes in Classical Singing Using Modulation Power Spectral Features.
IEEE Access, 2023

Classification of Vocal Intensity Category from Speech using the Wav2vec2 and Whisper Embeddings.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Utilizing Wav2Vec In Database-Independent Voice Disorder Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

Automatic Classification of Vocal Intensity Category from Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023

Wav2vec-Based Detection and Severity Level Classification of Dysarthria From Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Data Augmentation Using Spectral Warping for Low Resource Children ASR.
J. Signal Process. Syst., December, 2022

A formant modification method for improved ASR of children's speech.
Speech Commun., 2022

Subjective Evaluation of Basic Emotions from Audio-Visual Data.
Sensors, 2022

End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks.
CoRR, 2022

Wav2vec2-based Paralinguistic Systems to Recognise Vocalised Emotions and Stuttering.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Convolutional Neural Networks for Classification of Voice Qualities from Speech and Neck Surface Accelerometer Signals.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Comparing 1-dimensional and 2-dimensional spectral feature representations in voice pathology detection using machine learning and deep learning classifiers.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Extraction and Utilization of Excitation Information of Speech: A Review.
Proc. IEEE, 2021

Glottal features for classification of phonation type from speech and neck surface accelerometer signals.
Comput. Speech Lang., 2021

Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks.
IEEE Access, 2021

Spectral modification for recognition of children's speech undermismatched conditions.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

2020
Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Detection of glottal closure instant and glottal open region from speech signals using spectral flatness measure.
Speech Commun., 2020

Analysis and classification of phonation types in speech and singing voice.
Speech Commun., 2020

Analysis and Detection of Pathological Voice Using Glottal Source Features.
IEEE J. Sel. Top. Signal Process., 2020

Time Delay Estimation from Mixed Multispeaker Speech Signals Using Single Frequency Filtering.
Circuits Syst. Signal Process., 2020

Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference.
Circuits Syst. Signal Process., 2020

Determination of glottal closure instants from clean and telephone quality speech signals using single frequency filtering.
Comput. Speech Lang., 2020

Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge.
CoRR, 2020

Mel-Weighted Single Frequency Filtering Spectrogram for Dialect Identification.
IEEE Access, 2020

Excitation Features of Speech for Speaker-Specific Emotion Detection.
IEEE Access, 2020

Zero-Time Windowing Cepstral Coefficients for Dialect Classification.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Spectral Features derived from Single Frequency Filter for Multispeaker Localization.
Proceedings of the 2020 National Conference on Communications, 2020

Parkinson's Disease Detection from Speech Using Single Frequency Filtering Cepstral Coefficients.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Learning Filterbanks from Raw Waveform for Accent Classification.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Study of Formant Modification for Children ASR.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Comparison of Glottal Closure Instants Detection Algorithms for Emotional Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Spectral and temporal manipulations of SFF envelopes for enhancement of speech intelligibility in noise.
Comput. Speech Lang., 2019

Mel-Frequency Cepstral Coefficients of Voice Source Waveforms for Classification of Phonation Types in Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Quantitative Comparison of Epoch Extraction Algorithms for Telephone Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Significance of phase in single frequency filtering outputs of speech signals.
Speech Commun., 2018

Discriminating Nasals and Approximants in English Language Using Zero Time Windowing.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Analysis and Detection of Phonation Modes in Singing Voice using Excitation Source Features and Single Frequency Filtering Cepstral Coefficients (SFFCC).
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Breathy to Tense Voice Discrimination using Zero-Time Windowing Cepstral Coefficients (ZTWCCs).
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Detection of Glottal Closure Instants in Degraded Speech Using Single Frequency Filtering Analysis.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Epoch extraction from emotional speech using single frequency filtering approach.
Speech Commun., 2017

Locating Burst Onsets Using SFF Envelope and Phase Information.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Detection of Replay Attacks Using Single Frequency Filtering Cepstral Coefficients.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

SFF Anti-Spoofer: IIIT-H Submission for Automatic Speaker Verification Spoofing and Countermeasures Challenge 2017.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Speech polarity detection using strength of impulse-like excitation extracted from speech epochs.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Analysis of Emotional Speech - A Review.
Proceedings of the Toward Robotic Socially Believable Behaving Systems - Volume I, 2016

Vowel-Based Non-uniform Prosody Modification for Emotion Conversion.
Circuits Syst. Signal Process., 2016

Robust Estimation of Fundamental Frequency Using Single Frequency Filtering Approach.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Analysis of excitation source features of speech for emotion recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Analysis of singing voice for epoch extraction using Zero Frequency Filtering method.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Excitation source features for discrimination of anger and happy emotions.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Discriminating Neutral and Emotional Speech using Neural Networks.
Proceedings of the 11th International Conference on Natural Language Processing, 2014

Naturalistic Audio-Visual Emotion Database.
Proceedings of the 11th International Conference on Natural Language Processing, 2014

2013
Analysis of emotional speech at subsegmental level.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013


  Loading...