S. R. Mahadeva Prasanna

Proceedings of the National Conference on Communications, 2025

Development of Resource Efficient Direct Speech to Speech Translation System using Transformer, Reformer, and Linformer.

[BibT_eX]

[DOI]

Proceedings of the National Conference on Communications, 2025

Leveraging AM and FM Rhythm Spectrograms for Dementia Classification and Assessment.

[BibT_eX]

[DOI]

Vishwanath Pratap Singh

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Tone recognition in low-resource languages of North-East India: peeling the layers of SSL-based speech models.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Cross-lingual Evaluation Of Hypernasality Using Wav2Vec2 Features.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Fusion of Modulation Spectrogram and ssl with Multi-Head Attention for Fake Speech Detection.

[BibT_eX]

[DOI]

Abhishek Bedge

Saisha Suresh Bore

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

Parameter-Efficient Fine-Tuning of Foundation Models for CLP Speech Classification.

[BibT_eX]

[DOI]

Susmita Bhattacharjee

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

2024

Cross-linguistic rhythm analysis of Mising and Assamese.

[BibT_eX]

[DOI]

ACM Trans. Asian Low Resour. Lang. Inf. Process., October, 2024

Exploration of Speech and Music Information for Movie Genre Classification.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., August, 2024

NeaSource Localization and Beamforming in the Spherical Sector Harmonics Domain.

[BibT_eX]

[DOI]

Shekhar Kumar Yadav

Nithin V. George

IEEE J. Sel. Top. Signal Process., May, 2024

Milestones in speaker recognition.

[BibT_eX]

[DOI]

Artif. Intell. Rev., March, 2024

Implicit Self-Supervised Language Representation for Spoken Language Diarization.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Spectro-Temporally Compressed Source Features for Replay Attack Detection.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2024

Generative attention based framework for implicit language change detection.

[BibT_eX]

[DOI]

Digit. Signal Process., 2024

Biometrics in extended reality: a review.

[BibT_eX]

[DOI]

Raghavendra Ramachandra

Sushma Venkatesh

Discov. Artif. Intell., 2024

Spoken Language Change Detection Inspired by Speaker Change Detection.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2024

Representation Loss Minimization with Randomized Selection Strategy for Efficient Environmental Fake Audio Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2024

Depression Classification Using Token Merging-Based Speech Spectrotemporal Transformer.

[BibT_eX]

[DOI]

Lokesh Kumar

Kumar Kaustubh

Proceedings of the Speech and Computer - 26th International Conference, 2024

Speaker and Digit Representation based Voice OTP System.

[BibT_eX]

[DOI]

Sahaja Nandyala

Pavanitha Manche

Proceedings of the International Conference on Signal Processing and Communications, 2024

Depression Classification Using Log-Mel Spectrograms: A Comparative Analysis of Window Size-Based Data Augmentation and Deep Learning Models.

[BibT_eX]

[DOI]

Lokesh Kumar

Kumar Kaustubh

Shashaank Aswatha Mattur

Proceedings of the 27th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2024

Fake Speech Detection in Domain Variability Scenario.

[BibT_eX]

[DOI]

Proceedings of the National Conference on Communications, 2024

TM-PATHVQA: 90000+ Textless Multilingual Questions for Medical Visual Question Answering.

[BibT_eX]

[DOI]

Tonmoy Rajkhowa

Amartya Roy Chowdhury

Sankalp Nagaonkar

Achyut Mani Tripathi

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

The Second DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments.

[BibT_eX]

[DOI]

Hrishikesh Ravindra Karande

Deepu Vijayasenan

Sriram Ganapathy

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Evaluating the Efficacy of Large Acoustic Model for Documenting Non-Orthographic Tribal Languages in India.

[BibT_eX]

[DOI]

Tonmoy Rajkhowa

Amartya Roy Chowdhury

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

Multi-cultural speech emotion recognition using language and speaker cues.

[BibT_eX]

[DOI]

Biomed. Signal Process. Control., May, 2023

Clean vs. Overlapped Speech-Music Detection Using Harmonic-Percussive Features and Multi-Task Learning.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Implicit spoken language diarization.

[BibT_eX]

[DOI]

CoRR, 2023

Dialect Identification in Ao Using Modulation-Based Representation.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 25th International Conference, 2023

Significance of Audio Quality in Speech-to-Text Translation Systems.

[BibT_eX]

[DOI]

Tonmoy Rajkhowa

Proceedings of the Speech and Computer - 25th International Conference, 2023

Source and System-Based Modulation Approach for Fake Speech Detection.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 25th International Conference, 2023

Significance of Indic Self-supervised Speech Representations for Indic Under-Resourced ASR.

[BibT_eX]

[DOI]

Sougata Mukherjee

Proceedings of the Speech and Computer - 25th International Conference, 2023

I-MSV 2022: Indic-Multilingual and Multi-sensor Speaker Verification Challenge.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 25th International Conference, 2023

Design and Development of Voice OTP Authentication System.

[BibT_eX]

[DOI]

Pavanitha Manche

Sahaja Nandyala

Gayathri Ananthanarayanan

Proceedings of the Speech and Computer - 25th International Conference, 2023

Rhythm Formant Analysis for Automatic Depression Classification.

[BibT_eX]

[DOI]

Kumar Kaustubh

Proceedings of the Speech and Computer - 25th International Conference, 2023

Preliminary Analysis of Lambani Vowels and Vowel Classification Using Acoustic Features.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 25th International Conference, 2023

Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language.

[BibT_eX]

[DOI]

Ashwini Dasare

Aditya Srinivas Menon

Konjengbam Anand

Proceedings of the Speech and Computer - 25th International Conference, 2023

Driver Speech Detection in Real Driving Scenario.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 25th International Conference, 2023

Post-processing of Translated Speech by Pole Modification and Residual Enhancement to Improve Perceptual Quality.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 25th International Conference, 2023

Direct Vs Cascaded Speech-to-Speech Translation Using Transformer.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 25th International Conference, 2023

Optimizing Direct Speech-to-Text Translation for un-orthographic low-resource tribal languages using source transliterations.

[BibT_eX]

[DOI]

Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023

Leveraging Cross Lingual Speech Representations To Build ASR For Under-resourced Languages.

[BibT_eX]

[DOI]

Sougata Mukherjee

Prashant Bannulmath

Deepak K. T.

Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023

Comparative Analysis of Direct Speech-to-Speech Translation and Voice Conversion Using Bi-LSTM.

[BibT_eX]

[DOI]

Sai Naga Venu Gopal Bhamidi

Shashi Prabha

Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023

Challenges in spoken language diarization in code-switched scenario.

[BibT_eX]

[DOI]

Proceedings of the 28th National Conference on Communications, 2023

Investigation Of Data Augmentation Techniques For Bi-LSTM Based Direct Speech To Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the 28th National Conference on Communications, 2023

End to End Spoken Language Diarization with Wav2vec Embeddings.

[BibT_eX]

[DOI]

Jayadev N. Patil

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Classification of Cleft Lip and Palate Speech Using Fine-Tuned Transformer Pretrained Models.

[BibT_eX]

[DOI]

Susmita Bhattacharjee

Proceedings of the Intelligent Human Computer Interaction - 15th International Conference, 2023

2022

Speech/music classification using phase-based and magnitude-based features.

[BibT_eX]

[DOI]

Speech Commun., 2022

Attention gated tensor neural network architectures for speech emotion recognition.

[BibT_eX]

[DOI]

Biomed. Signal Process. Control., 2022

Importance of Supra-Segmental Information and Self-Supervised Framework for Spoken Language Diarization Task.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 24th International Conference, 2022

Fake Speech Detection Using Modulation Spectrogram.

[BibT_eX]

[DOI]

Raghav Magazine

Anand Hedge

Proceedings of the Speech and Computer - 24th International Conference, 2022

Fake Speech Detection Using OpenSMILE Features.

[BibT_eX]

[DOI]

Devesh Kumar

Pavan Kumar V. Patil

Proceedings of the Speech and Computer - 24th International Conference, 2022

Automatic Rhythm and Speech Rate Analysis of Mising Spontaneous Speech.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 24th International Conference, 2022

Speech Music Overlap Detection Using Spectral Peak Evolutions.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 24th International Conference, 2022

Overlapped Speech Detection Using AM-FM Based Time-Frequency Representations.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 24th International Conference, 2022

Issues in Sub-Utterance Level Language Identification in a Code Switched Bilingual Scenario.

[BibT_eX]

[DOI]

Joshitha Gandra

Vaishnavi Patil

Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Foreground-Background Audio Separation using Spectral Peaks based Time-Frequency Masks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Speaker Anonymization for Machines using Sinusoidal Model.

[BibT_eX]

[DOI]

Amitabh Swain

Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Significance of excitation source sequence information for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Text to Speech System for Lambani - A Zero Resource, Tribal Language of India.

[BibT_eX]

[DOI]

Ashwini Dasare

K. Samudravijaya

Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022

Analysis of Layer-Wise Training in Direct Speech to Speech Translation Using BI-LSTM.

[BibT_eX]

[DOI]

Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022

Analyzing RMFCC Feature for Dialect Identification in Ao, an Under-Resourced Language.

[BibT_eX]

[DOI]

Proceedings of the 27th National Conference on Communications, 2022

Low-Resource Dialect Identification in Ao Using Noise Robust Mean Hilbert Envelope Coefficients.

[BibT_eX]

[DOI]

Proceedings of the 27th National Conference on Communications, 2022

Importance of excitation source and sequence learning towards spoken language identification task.

[BibT_eX]

[DOI]

Soma Siddhartha

Proceedings of the 27th National Conference on Communications, 2022

Significance of Prosody Modification in Privacy Preservation on speaker verification.

[BibT_eX]

[DOI]

Amitabh Swain

Proceedings of the 27th National Conference on Communications, 2022

Machine Translation for a Very Low-Resource Language - Layer Freezing Approach on Transfer Learning.

[BibT_eX]

[DOI]

Deepak K. T.

Samudra Vijaya K

Proceedings of the Fifth Workshop on Technologies for Machine Translation of Low-Resource Languages, 2022

Prosodic Information in Dialect Identification of a Tonal Language: The case of Ao.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Event-Based Transformation of Misarticulated Stops in Cleft Lip and Palate Speech.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2021

Exploration of Visual Features and their weighted-additive fusion for Video Captioning.

[BibT_eX]

[DOI]

CoRR, 2021

Modification of misarticulated fricative /s/ in cleft lip and palate speech.

[BibT_eX]

[DOI]

Krothapalli Sreenivasa Rao

Biomed. Signal Process. Control., 2021

Multilingual Audio-Visual Smartphone Dataset and Evaluation.

[BibT_eX]

[DOI]

Hareesh Mandalapu

Aravinda Reddy P. N.

Raghavendra Ramachandra

Pabitra Mitra

Krothapalli Sreenivasa Rao

Christoph Busch

IEEE Access, 2021

Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey.

[BibT_eX]

[DOI]

Hareesh Mandalapu

Aravinda Reddy P. N.

Raghavendra Ramachandra

Pabitra Mitra

Christoph Busch

IEEE Access, 2021

Learning Mizo Tones from F0 Contours Using 1D-CNN.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 23rd International Conference, 2021

Enhancing the Intelligibility of Cleft Lip and Palate Speech Using Cycle-Consistent Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Excitation Source Feature Based Dialect Identification in Ao - A Low Resource Language.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Automatic Detection of Shouted Speech Segments in Indian News Debates.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Alzheimer's Dementia Recognition Using Multimodal Fusion of Speech and Text Embeddings.

[BibT_eX]

[DOI]

Shalendar Bhasin

Ravi Jasuja

Proceedings of the Intelligent Human Computer Interaction, 2021

Exploring Multimodal Features and Fusion for Time-Continuous Prediction of Emotional Valence and Arousal.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Human Computer Interaction, 2021

Processing Phoneme Specific Segments for Cleft Lip and Palate Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Vowel Onset Point Based Screening of Misarticulated Stops in Cleft Lip and Palate Speech.

[BibT_eX]

[DOI]

Vikram C. Mathad

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Speech/Music Classification Using Features From Spectral Peaks.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Enhancement of cleft palate speech using temporal and spectral processing.

[BibT_eX]

[DOI]

Speech Commun., 2020

Sinusoidal model-based hypernasality detection in cleft palate speech using CVCV sequence.

[BibT_eX]

[DOI]

Speech Commun., 2020

Classification of Speech vs. Speech with Background Music.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Signal Processing and Communications, 2020

Overlapped/Non-Overlapped Speech Transition Point Detection Using Bag-of-Audio-Words.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Signal Processing and Communications, 2020

Language Specific Information from LP Residual Signal Using Linear Sub Band Filters.

[BibT_eX]

[DOI]

Soma Siddhartha

Proceedings of the 2020 National Conference on Communications, 2020

Analysis of Excitation Source Characteristics for Shouted and Normal Speech Classification.

[BibT_eX]

[DOI]

Proceedings of the 2020 National Conference on Communications, 2020

Lexical Tone Recognition in Mizo using Acoustic-Prosodic Features.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

VOP Detection in Variable Speech Rate Condition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Spectral Moment and Duration of Burst of Plosives in Speech of Children with Hearing Impairment and Typically Developing Children - A Comparative Study.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Detection of Nasalized Voiced Stops in Cleft Palate Speech Using Epoch-Synchronous Features.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Handwriting recognition using sinusoidal model parameters.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2019

Representation of online handwriting using multi-component sinusoidal model.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

Speech synthesis for glottal activity region processing.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2019

An improved discriminative region selection methodology for online handwriting recognition.

[BibT_eX]

[DOI]

Int. J. Document Anal. Recognit., 2019

Exploiting forced alignment of time-reversed data for improving HMM-based handwriting segmentation.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2019

Speech Enhancement Using Source Information for Phoneme Recognition of Speech with Background Music.

[BibT_eX]

[DOI]

Abhishek Dey

Circuits Syst. Signal Process., 2019

Investigating Text-Independent Speaker Verification Systems Under Varied Data Conditions.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2019

Exploring Text-Constraint Models and Source Information for Long-Enrollment with Short-Test Speaker Verification.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2019

Robust Methods for Text-Dependent Speaker Verification.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2019

Acoustic Correlates of Aspiration in Fricatives and Nasals.

[BibT_eX]

[DOI]

Saswati Rabha

Proceedings of the TENCON 2019, 2019

Emotion Recognition from Raw Speech using Wavenet.

[BibT_eX]

[DOI]

Proceedings of the TENCON 2019, 2019

Shouted and Normal Speech Classification Using 1D CNN.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Machine Intelligence, 2019

RSL2019: A Realistic Speech Localization Corpus.

[BibT_eX]

[DOI]

Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

Modelling Glottal Flow Derivative Signal for Detection of Replay Speech Samples.

[BibT_eX]

[DOI]

Proceedings of the National Conference on Communications, 2019

Development of Assamese Text-to-speech System using Deep Neural Network.

[BibT_eX]

[DOI]

Proceedings of the National Conference on Communications, 2019

Modification of Devoicing Error in Cleft Lip and Palate Speech.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Nasal Air Emission in Sibilant Fricatives of Cleft Lip and Palate Speech.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

SpeechMarker: A Voice Based Multi-Level Attendance Application.

[BibT_eX]

[DOI]

Abhishek Shrivastava

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Hypernasality Severity Detection Using Constant Q Cepstral Coefficients.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Exploration of CNN Features for Online Handwriting Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Synthesis of Handwriting Dynamics using Sinusoidal Model.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Glottal Instants Extraction from Speech Signal Using Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Significance of sonority information for voiced/unvoiced decision in speech synthesis.

[BibT_eX]

[DOI]

Speech Commun., 2018

Analysis of the Hilbert Spectrum for Text-Dependent Speaker Verification.

[BibT_eX]

[DOI]

Speech Commun., 2018

Automatic syllabification of speech signal using short time energy and vowel onset points.

[BibT_eX]

[DOI]

Leena Mary

Anil P. Antony

Ben P. Babu

Int. J. Speech Technol., 2018

Significance of duration modification for speaker verification under mismatch speech tempo condition.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2018

Multi-style speaker recognition database in practical conditions.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2018

GMM posterior features for improving online handwriting recognition.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2018

Detection of the Glottal Closure Instants Using Empirical Mode Decomposition.

[BibT_eX]

[DOI]

Hugo Leonardo Rufiner

Gastón Schlotthauer

Circuits Syst. Signal Process., 2018

End Point Detection Using Speech-Specific Knowledge for Text-Dependent Speaker Verification.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2018

Time-Frequency Audio Features for Speech-Music Classification.

[BibT_eX]

[DOI]

CoRR, 2018

Speaker Identification Using Tensor Decomposition of Acoustic Models.

[BibT_eX]

[DOI]

Proceedings of the TENCON 2018, 2018

Hypernasality Detection Using Zero Time Windowing.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Excitation Source Feature for Discriminating Shouted and Normal Speech.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Dialect Identification Using Tonal and Spectral Features in Two Dialects of Ao.

[BibT_eX]

[DOI]

Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Processing Transition Regions of Glottal Stop Substituted /S/ for Intelligibility Enhancement of Cleft Palate Speech.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Spoken Keyword Detection Using Joint DTW-CNN.

[BibT_eX]

[DOI]

Ravi Shankar

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Estimation of Hypernasality Scores from Cleft Lip and Palate Speech.

[BibT_eX]

[DOI]

Ayush Tripathi

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Detection of Glottal Activity Errors in Production of Stop Consonants in Children with Cleft Lip and Palate.

[BibT_eX]

[DOI]

Ajish K. Abraham

Pushpavathi M

Girish K. S

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Epoch Extraction from Pathological Children Speech Using Single Pole Filtering Approach.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Self-similarity Matrix Based Intelligibility Assessment of Cleft Lip and Palate Speech.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Exploration of Compressed ILPR Features for Replay Attack Detection.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Analysis of Breathiness in Contextual Vowel of Voiceless Nasals in Mizo.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Pitch-Adaptive Front-end Feature for Hypernasality Detection.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Robust Mizo Continuous Speech Recognition.

[BibT_eX]

[DOI]

S. R. Nirmala

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

AGROASSAM: A Web Based Assamese Speech Recognition Application for Retrieving Agricultural Commodity Price and Weather Information.

[BibT_eX]

[DOI]

K. Samudravijaya

S. R. Nirmala

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Glotto Vibrato Graph: A Device and Method for Recording, Analysis and Visualization of Glottal Activity.

[BibT_eX]

[DOI]

Kishalay Chakraborty

Senjam Shantirani Devi

Sanjeevan Devnath

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Exploring Discriminative HMM States for Improved Recognition of Online Handwriting.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Exploring Sparse Representation for Improved Online Handwriting Recognition.

[BibT_eX]

[DOI]

Syed Shahnawazuddin

Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

DNN-HMM Based Large Vocabulary Online Handwritten Assamese Word Recognition System.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

Investigating Text-independent Speaker Verification from Practically Realizable System Perspective.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Improvements in IITG Assamese Spoken Query System: Background Noise Suppression and Alternate Acoustic Modeling.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2017

Development of Multi-Level Speech based Person Authentication System.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2017

Sonority Measurement Using System, Source, and Suprasegmental Information.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Epoch Extraction From Telephone Quality Speech Using Single Pole Filter.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Enhancement of Spectral Tilt in Synthesized Speech.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2017

Empirical Mode Decomposition for adaptive AM-FM analysis of Speech: A Review.

[BibT_eX]

[DOI]

Marcelo Alejandro Colominas

Leandro Daniel Vignolo

Gastón Schlotthauer

Hugo Leonardo Rufiner

Speech Commun., 2017

Analysis of the Intrinsic Mode Functions for Speaker Information.

[BibT_eX]

[DOI]

Speech Commun., 2017

Consonant-vowel unit recognition using dominant aperiodic and transition region detection.

[BibT_eX]

[DOI]

Speech Commun., 2017

Exploring kernel discriminant analysis for speaker verification with limited test data.

[BibT_eX]

[DOI]

Akhil Babu Manam

Pattern Recognit. Lett., 2017

Glottal opening instants detection using zero frequency resonator.

[BibT_eX]

[DOI]

K. Ramesh

Int. J. Speech Technol., 2017

Clean speech/speech with background music classification using HNGD spectrum.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2017

Processing degraded speech for text dependent speaker verification.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2017

Improved voicing decision using glottal activity features for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Digit. Signal Process., 2017

Vowel onset point based characterization of velopharyngeal activity using imaging techniques.

[BibT_eX]

[DOI]

Proceedings of the Twenty-third National Conference on Communications, 2017

Pause insertion in assamese synthesized speech using speech specific features.

[BibT_eX]

[DOI]

Proceedings of the Twenty-third National Conference on Communications, 2017

Role of voice activity detection methods for the speakers in the wild challenge.

[BibT_eX]

[DOI]

Proceedings of the Twenty-third National Conference on Communications, 2017

Vowel Onset Point Detection Using Sonority Information.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

IITG-Indigo System for NIST 2016 SRE Challenge.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Indoor/Outdoor Audio Classification Using Foreground Speech Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Acoustic Characterization of Word-Final Glottal Stops in Mizo and Assam Sora.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Hypernasality Severity Analysis in Cleft Lip and Palate Speech Using Vowel Space Area.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Spoof Detection Using Source, Instantaneous Frequency and Cepstral Features.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Phase Modeling Using Integrated Linear Prediction Residual for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Zero Frequency Filter Based Analysis of Voice Disorders.

[BibT_eX]

[DOI]

Keerthi Pullela

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Foreground Speech Segmentation and Enhancement Using Glottal Closure Instants and Mel Cepstral Coefficients.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Feature optimisation for stress recognition in speech.

[BibT_eX]

[DOI]

Leandro Daniel Vignolo

Hugo Leonardo Rufiner

Diego H. Milone

Pattern Recognit. Lett., 2016

A better decomposition of speech obtained using modified Empirical Mode Decomposition.

[BibT_eX]

[DOI]

Digit. Signal Process., 2016

Speech / music classification using speech-specific features.

[BibT_eX]

[DOI]

Digit. Signal Process., 2016

A Subspace Projection Approach for Analysis of Speech Under Stressed Condition.

[BibT_eX]

[DOI]

Sumitra Shukla

Circuits Syst. Signal Process., 2016

Countermeasure to handle replay attacks in practical speaker verification systems.

[BibT_eX]

[DOI]

Anupama Paul

Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Frequency count based two stage classification for online handwritten character recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Significance of constraining text in limited data text-independent speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Spectral Enhancement of Cleft Lip and Palate Speech.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Analysis of Glottal Stop in Assam Sora Language.

[BibT_eX]

[DOI]

Luke Horo

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Exploring Session Variability and Template Aging in Speaker Verification for Fixed Phrase Short Utterances.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Source modeling for HMM based speech synthesis using integrated LP residual.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Low Complexity On-Line Adaptation Techniques in Context of Assamese Spoken Query System.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2015

Detection of Glottal Activity Using Different Attributes of Source Information.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2015

Processing of linear prediction residual in spectral and cepstral domains for speaker information.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2015

Epoch Extraction Using Zero Band Filtering from Speech Signal.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2015

Characterizing glottal activity from speech using Empirical Mode Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Twenty First National Conference on Communications, 2015

Speaker change detection using excitation source and vocal tract system information.

[BibT_eX]

[DOI]

Mousmita Sarma

Sree Nilendra Gadre

Proceedings of the Twenty First National Conference on Communications, 2015

Curvature point based HMM state prediction for online handwritten assamese strokes recognition.

[BibT_eX]

[DOI]

Proceedings of the Twenty First National Conference on Communications, 2015

Speech vs music discrimination using Empirical Mode Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Twenty First National Conference on Communications, 2015

Different aspects of source information for limited data speaker verification.

[BibT_eX]

[DOI]

Proceedings of the Twenty First National Conference on Communications, 2015

Comparison of assamese character recognizer using stroke level and character level engines.

[BibT_eX]

[DOI]

Proceedings of the Twenty First National Conference on Communications, 2015

Detection of mizo tones.

[BibT_eX]

[DOI]

Wendy Lalhminghlui

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Speaker verification using Gaussian posteriorgrams on fixed phrase short utterances.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Analysis of Vocal Tract Constrictions using Zero Frequency Filtering.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2014

Online Stroke and Akshara Recognition GUI in Assamese Language Using Hidden Markov Model.

[BibT_eX]

[DOI]

CoRR, 2014

Speech biometric based attendance system.

[BibT_eX]

[DOI]

Proceedings of the Twentieth National Conference on Communications, 2014

Epochs based compression of LP residual for source modeling in text-to-speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Twentieth National Conference on Communications, 2014

Detection of vowel onset points in voiced aspirated sounds of indian languages.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Combining source and system information for limited data speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

The Blizzard Challenge 2014.

[BibT_eX]

[DOI]

Kishore Prahallad

Anandaswarup Vadapalli

Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014

2013

Speaker Verification by Vowel and Nonvowel Like Segmentation.

[BibT_eX]

[DOI]

Gayadhar Pradhan

IEEE Trans. Speech Audio Process., 2013

Expressive speech synthesis: a review.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2013

Dynamic prosody modification using zero frequency filtered signal.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2013

Development and evaluation of online text-independent speaker verification system for remote person authentication.

[BibT_eX]

[DOI]

Debmalya Chakrabarty

Int. J. Speech Technol., 2013

A syllable-based framework for unit selection synthesis in 13 Indian languages.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Detection of glottal opening instants using Hilbert envelope.

[BibT_eX]

[DOI]

K. Ramesh

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Significance of instants of significant excitation for source modeling.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

The IITG speaker verification systems for NIST SRE 2012.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Speaker verification in sensor and acoustic environment mismatch conditions.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2012

Speaker verification using excitation source information.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2012

Multivariability speaker recognition database in Indian scenario.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2012

Foreground Speech Segmentation using Zero Frequency Filtered Signal.

[BibT_eX]

[DOI]

Deepak K. T.

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Assamese online handwritten digit recognition system using hidden Markov models.

[BibT_eX]

[DOI]

Proceedings of the Proceeding of the workshop on Document Analysis and Recognition, 2012

2011

Significance of Vowel-Like Regions for Speaker Verification Under Degraded Conditions.

[BibT_eX]

[DOI]

Gayadhar Pradhan

IEEE ACM Trans. Audio Speech Lang. Process., 2011

Enhancement of noisy speech by temporal and spectral processing.

[BibT_eX]

[DOI]

Speech Commun., 2011

Recognition of consonant-vowel (CV) units under background noise using combined temporal and spectral preprocessing.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2011

Spectral slope based analysis and classification of stressed speech.

[BibT_eX]

[DOI]

Sumitra Shukla

Int. J. Speech Technol., 2011

Speaker verification under degraded condition: a perceptual study.

[BibT_eX]

[DOI]

Gayadhar Pradhan

Int. J. Speech Technol., 2011

Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2011

Speaker recognition under limited data condition by noise addition.

[BibT_eX]

[DOI]

H. S. Jayanna

Expert Syst. Appl., 2011

Neutral to Target Emotion Conversion Using Source and Suprasegmental Information.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Epoch Extraction in High Pass Filtered Speech Using Hilbert Envelope.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Study of robustness of zero frequency resonator method for extraction of fundamental frequency.

[BibT_eX]

[DOI]

Sunitha Guruprasad

Proceedings of the IEEE International Conference on Acoustics, 2011

Chain Code Histogram Based Facial Image Feature Extraction under Degraded Conditions.

[BibT_eX]

[DOI]

Soyuj Kumar Sahoo

Jitendra Jain

Proceedings of the Advances in Computing and Communications, 2011

2010

Two speaker speech separation by LP residual weighting and harmonics enhancement.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2010

Analysis of excitation source information in emotional speech.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Analysis of instantaneous F0 contours from two speakers mixed signal using zero frequency filtering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies.

[BibT_eX]

[DOI]

B. V. Sandeep Reddy

IEEE Trans. Speech Audio Process., 2009

Speaker Recognition under Limited Data Condition using LVQ and GMM-UBM.

[BibT_eX]

H. S. Jayanna

Proceedings of the 4th Indian International Conference on Artificial Intelligence, 2009

Significance of Word and Syllable Level Information for Expressive Speech Processing.

[BibT_eX]

[DOI]

T. V. Sagar

Proceedings of the Seventh International Conference on Advances in Pattern Recognition, 2009

2007

Determination of Instants of Significant Excitation in Speech Using Hilbert Envelope and Group Delay Function.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2007

MRASTA and PLP in automatic speech recognition.

[BibT_eX]

[DOI]

Hynek Hermansky

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Significance of Multimodal Biometric Systems.

[BibT_eX]

Proceedings of the 3rd Indian International Conference on Artificial Intelligence, 2007

Speaker Recognition in Limited Data Conditions using Self-Organizing Map.

[BibT_eX]

H. S. Jayanna

Proceedings of the 3rd Indian International Conference on Artificial Intelligence, 2007

Resonant Recognition Model for Prediction of 'Lead-In Regions' in Proteins.

[BibT_eX]

[DOI]

Mohenish Jaiswal

Latha Rangan

Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies 2007, 2007

2006

Extraction of speaker-specific excitation information from linear prediction residual of speech.

[BibT_eX]

[DOI]

Cheedella S. Gupta

Speech Commun., 2006

A Partial Image Encryption Method with Pseudo Random Sequences.

[BibT_eX]

[DOI]

Y. V. Subba Rao

Abhijit Mitra

Proceedings of the Information Systems Security, Second International Conference, 2006

2005

Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system.

[BibT_eX]

[DOI]

Jinu Mariam Zachariah

Cheedella S. Gupta

IEEE Trans. Speech Audio Process., 2005

Processing of reverberant speech for time-delay estimation.

[BibT_eX]

[DOI]

Ramani Duraiswami

Dmitry N. Zotkin

IEEE Trans. Speech Audio Process., 2005

Speaker Localization Using Excitation Source Information in Speech.

[BibT_eX]

[DOI]

Vikas C. Raykar

Ramani Duraiswami

IEEE Trans. Speech Audio Process., 2005

Detection of vowel onset point events using excitation information.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Text-Dependent Writer Identification using Word Length Analysis.

[BibT_eX]

Ankur Goel

Ankush Samant

Proceedings of the 2nd Indian International Conference on Artificial Intelligence, 2005

2004

Features for speaker and language identification.

[BibT_eX]

[DOI]

Leena Mary

K. Sri Rama Murty

Proceedings of the Odyssey 2004: The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

Enhancement of reverberant speech using excitation source information.

[BibT_eX]

[DOI]

M. Chaitanya

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Two-Stage Duration Model for Indian Languages Using Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing, 11th International Conference, 2004

Extraction of pitch in adverse conditions.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Enhancement of speech in multispeaker environment.

[BibT_eX]

[DOI]

Mathew Magimai-Doss

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Tracking a moving speaker using excitation source information.

[BibT_eX]

[DOI]

Vikas C. Raykar

Ramani Duraiswami

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Speech enhancement using excitation source information.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

Detection of vowel onset point in speech.

[BibT_eX]

[DOI]

Suryakanth V. Gangashetty

Jinu Mariam Zachariah

Proceedings of the IEEE International Conference on Acoustics, 2002

Linear and nonlinear compression of feature vectors for speech recognition.

[BibT_eX]

[DOI]