Hemant Kumar Kathania

Bhanuja Karumuru

J. Signal Process. Syst., June, 2026

A study on the layer-wise transferability of self-supervised learning features for children's speech processing tasks.

[BibT_eX]

[DOI]

Speech Commun., 2026

Role of SSL models: Finetuning and feature optimization for dysarthric speech recognition and keyword spotting.

[BibT_eX]

[DOI]

Subham Kutum

Comput. Electr. Eng., 2026

Autoencoder Based Optimized SSL Representations: Complexity Minimization and Improved Dysarthric ASR.

[BibT_eX]

[DOI]

Proceedings of the National Conference on Communications, 2026

2025

Empowering dysarthric communication: a self-supervised approach to keyword spotting.

[BibT_eX]

[DOI]

Subham Kutum

Mahesh Chandra Govil

Signal Image Video Process., December, 2025

Enhancing Speaker-Independent Dysarthric Speech Severity Classification with DSSCNet and Cross-Corpus Adaptation.

[BibT_eX]

[DOI]

Arnab Kumar Roy

CoRR, September, 2025

Layer-Wise Analysis of Self-Supervised Representations for Age and Gender Classification in Children's Speech.

[BibT_eX]

[DOI]

Harishankar Kumar

Mohit Joshi

CoRR, August, 2025

Can Layer-Wise SSL Features Improve Zero-Shot ASR Performance for Children's Speech?

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2025

ResEmoteNet: Bridging Accuracy and Loss Reduction in Facial Emotion Recognition.

[BibT_eX]

[DOI]

Arnab Kumar Roy

Adhitiya Sharma

Abhishek Dey

Md. Sarfaraj Alam Ansari

IEEE Signal Process. Lett., 2025

Do all features matter? Layer-wise feature probing of self-supervised speech models for dysarthria severity classification.

[BibT_eX]

[DOI]

Harsh Srivastava

Speech Commun., 2025

Zero-shot KWS for children's speech using layer-wise features from SSL models.

[BibT_eX]

[DOI]

Subham Kutum

Mahesh Chandra Govil

Pattern Recognit. Lett., 2025

Enhancing Traditional Kaldi Dysarthric Speech Recognition Using SSL-Features.

[BibT_eX]

[DOI]

Proceedings of the National Conference on Communications, 2025

Beyond Traditional Speech Modifications : Utilizing Self Supervised Features for Enhanced Zero-Shot Children ASR.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Bridging the Gap in Children's Speech Recognition: Zero-Speech Approaches with Speech Modifications and ASR Architectures.

[BibT_eX]

[DOI]

Proceedings of the 33rd European Signal Processing Conference, 2025

2024

Spectral warping based data augmentation for low resource children's speaker verification.

[BibT_eX]

[DOI]

Virender Kadyan

Multim. Tools Appl., May, 2024

Improvement in Facial Emotion Recognition using Synthetic Data Generated by Diffusion Model.

[BibT_eX]

[DOI]

Arnab Kumar Roy

Adhitiya Sharma

CoRR, 2024

Effect of Speech Modification on Wav2Vec2 Models for Children Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Signal Processing and Communications, 2024

Role of Acoustics and Prosodic Features for Children's Age Classification.

[BibT_eX]

[DOI]

Vishakha Kumari

Proceedings of the International Conference on Signal Processing and Communications, 2024

In-Domain Data Augmentation to Enhance Severity Level Classification of Dysarthria from Speech.

[BibT_eX]

[DOI]

Bhanuja Karumuru

Proceedings of the International Conference on Signal Processing and Communications, 2024

Improving End-to-End Speech Recognition for Dysarthric Speech through In-Domain Data Augmentation.

[BibT_eX]

[DOI]

Proceedings of the 58th Asilomar Conference on Signals, 2024

Systematic Study of Dysarthric Speech Recognition: Spectral Features and Acoustic Models.

[BibT_eX]

[DOI]

Proceedings of the 58th Asilomar Conference on Signals, 2024

2023

Gammatone-Filterbank Based Pitch-Normalized Cepstral Coefficients for Zero-Resource Children's ASR.

[BibT_eX]

[DOI]

Ankita

Avinash Kumar

Proceedings of the Speech and Computer - 25th International Conference, 2023

Effect of Linear Prediction Order to Modify Formant Locations for Children Speech Recognition.

[BibT_eX]

[DOI]

Udara Laxman Kumar

Proceedings of the Speech and Computer - 25th International Conference, 2023

2022

Data Augmentation Using Spectral Warping for Low Resource Children ASR.

[BibT_eX]

[DOI]

Virender Kadyan

J. Signal Process. Syst., December, 2022

A formant modification method for improved ASR of children's speech.

[BibT_eX]

[DOI]

Paavo Alku

Speech Commun., 2022

End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks.

[BibT_eX]

[DOI]

Tamás Grósz

CoRR, 2022

2021

Synthesis Speech Based Data Augmentation for Low Resource Children ASR.

[BibT_eX]

[DOI]

Virender Kadyan

Prajjval Govil

Proceedings of the Speech and Computer - 23rd International Conference, 2021

Spectral modification for recognition of children's speech undermismatched conditions.

[BibT_eX]

[DOI]

Paavo Alku

Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Speaker Verification Experiments for Adults and Children Using Shared Embedding Spaces.

[BibT_eX]

[DOI]

Tuomas Kaseva

Aku Rouhe

Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Vowel Non-Vowel Based Spectral Warping and Time Scale Modification for Improvement in Children's ASR.

[BibT_eX]

[DOI]

Avinash Kumar

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Creating speaker independent ASR system through prosody modification based data augmentation.

[BibT_eX]

[DOI]

B. Tarun Sai

Pattern Recognit. Lett., 2020

Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge.

[BibT_eX]

[DOI]

Tamás Grósz

CoRR, 2020

Data Augmentation Using Prosody and False Starts to Recognize Non-Native Children's Speech.

[BibT_eX]

[DOI]

Tamás Grósz

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Study of Formant Modification for Children ASR.

[BibT_eX]

[DOI]

Paavo Alku

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Developing speaker independent ASR system using limited data through prosody modification based on fuzzy classification of spectral bins.

[BibT_eX]

[DOI]

Digit. Signal Process., 2019

Role of Linear, Mel and Inverse-Mel Filterbanks in Automatic Recognition of Speech from High-Pitched Speakers.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2019

Speaking-Rate Adaptation of Automatic Speech Recognition System through Fuzzy Classification based Time-Scale Modification.

[BibT_eX]

[DOI]

B. Tarun Sai

Proceedings of the National Conference on Communications, 2019

On the Role of Linear, Mel and Inverse-Mel Filterbank in the Context of Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the National Conference on Communications, 2019

2018

Improving children's mismatched ASR using structured low-rank feature projection.

[BibT_eX]

[DOI]

Abhishek Dey

Rohit Sinha

Speech Commun., 2018

Studying the role of pitch-adaptive spectral estimation and speaking-rate normalization in automatic speech recognition.

[BibT_eX]

[DOI]

Rohit Sinha

Digit. Signal Process., 2018

An Experimental Study on the Significance of Variable Frame-Length and Overlap in the Context of Children's Speech Recognition.

[BibT_eX]

[DOI]

Chaman Singh

Circuits Syst. Signal Process., 2018

Explicit Pitch Mapping for Improved Children's Speech Recognition.

[BibT_eX]

[DOI]

Arun B. Samaddar

Circuits Syst. Signal Process., 2018

Exploring the Role of Speaking-Rate Adaptation on Children's Speech Recognition.

[BibT_eX]

[DOI]

Chaman Singh

Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Improving Children's Speech Recognition Through Time Scale Modification Based Speaking Rate Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Role of Prosodic Features on Children's Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Effect of Prosody Modification on Children's ASR.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2017

Improving children speech recognition in acoustically mismatched condition using eigenvoices and feature projections.

[BibT_eX]

[DOI]

Rohit Sinha

Proceedings of the Twenty-third National Conference on Communications, 2017

Improving Children's Speech Recognition Through Explicit Pitch Scaling Based on Iterative Spectrogram Inversion.

[BibT_eX]

[DOI]