S. R. Mahadeva Prasanna

Orcid: 0000-0002-8135-7938

According to our database1, S. R. Mahadeva Prasanna authored at least 225 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Milestones in speaker recognition.
Artif. Intell. Rev., March, 2024

Spectro-Temporally Compressed Source Features for Replay Attack Detection.
IEEE Signal Process. Lett., 2024

Fake Speech Detection in Domain Variability Scenario.
Proceedings of the National Conference on Communications, 2024

2023
Multi-cultural speech emotion recognition using language and speaker cues.
Biomed. Signal Process. Control., May, 2023

Clean vs. Overlapped Speech-Music Detection Using Harmonic-Percussive Features and Multi-Task Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Implicit Self-supervised Language Representation for Spoken Language Diarization.
CoRR, 2023

Implicit spoken language diarization.
CoRR, 2023

Spoken language change detection inspired by speaker change detection.
CoRR, 2023

Dialect Identification in Ao Using Modulation-Based Representation.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Significance of Audio Quality in Speech-to-Text Translation Systems.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Source and System-Based Modulation Approach for Fake Speech Detection.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Significance of Indic Self-supervised Speech Representations for Indic Under-Resourced ASR.
Proceedings of the Speech and Computer - 25th International Conference, 2023

I-MSV 2022: Indic-Multilingual and Multi-sensor Speaker Verification Challenge.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Design and Development of Voice OTP Authentication System.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Rhythm Formant Analysis for Automatic Depression Classification.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Preliminary Analysis of Lambani Vowels and Vowel Classification Using Acoustic Features.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Driver Speech Detection in Real Driving Scenario.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Post-processing of Translated Speech by Pole Modification and Residual Enhancement to Improve Perceptual Quality.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Direct Vs Cascaded Speech-to-Speech Translation Using Transformer.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Optimizing Direct Speech-to-Text Translation for un-orthographic low-resource tribal languages using source transliterations.
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023

Leveraging Cross Lingual Speech Representations To Build ASR For Under-resourced Languages.
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023

Comparative Analysis of Direct Speech-to-Speech Translation and Voice Conversion Using Bi-LSTM.
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023

Challenges in spoken language diarization in code-switched scenario.
Proceedings of the 28th National Conference on Communications, 2023

Investigation Of Data Augmentation Techniques For Bi-LSTM Based Direct Speech To Speech Translation.
Proceedings of the 28th National Conference on Communications, 2023

Classification of Cleft Lip and Palate Speech Using Fine-Tuned Transformer Pretrained Models.
Proceedings of the Intelligent Human Computer Interaction - 15th International Conference, 2023

2022
Speech/music classification using phase-based and magnitude-based features.
Speech Commun., 2022

Attention gated tensor neural network architectures for speech emotion recognition.
Biomed. Signal Process. Control., 2022

Importance of Supra-Segmental Information and Self-Supervised Framework for Spoken Language Diarization Task.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Fake Speech Detection Using Modulation Spectrogram.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Fake Speech Detection Using OpenSMILE Features.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Automatic Rhythm and Speech Rate Analysis of Mising Spontaneous Speech.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Speech Music Overlap Detection Using Spectral Peak Evolutions.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Overlapped Speech Detection Using AM-FM Based Time-Frequency Representations.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Issues in Sub-Utterance Level Language Identification in a Code Switched Bilingual Scenario.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Foreground-Background Audio Separation using Spectral Peaks based Time-Frequency Masks.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Speaker Anonymization for Machines using Sinusoidal Model.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Significance of excitation source sequence information for Speaker Verification.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Text to Speech System for Lambani - A Zero Resource, Tribal Language of India.
Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022

Analysis of Layer-Wise Training in Direct Speech to Speech Translation Using BI-LSTM.
Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022

Analyzing RMFCC Feature for Dialect Identification in Ao, an Under-Resourced Language.
Proceedings of the 27th National Conference on Communications, 2022

Low-Resource Dialect Identification in Ao Using Noise Robust Mean Hilbert Envelope Coefficients.
Proceedings of the 27th National Conference on Communications, 2022

Importance of excitation source and sequence learning towards spoken language identification task.
Proceedings of the 27th National Conference on Communications, 2022

Significance of Prosody Modification in Privacy Preservation on speaker verification.
Proceedings of the 27th National Conference on Communications, 2022

Machine Translation for a Very Low-Resource Language - Layer Freezing Approach on Transfer Learning.
Proceedings of the Fifth Workshop on Technologies for Machine Translation of Low-Resource Languages, 2022

Prosodic Information in Dialect Identification of a Tonal Language: The case of Ao.
Proceedings of the Interspeech 2022, 2022

2021
Event-Based Transformation of Misarticulated Stops in Cleft Lip and Palate Speech.
Circuits Syst. Signal Process., 2021

Exploration of Visual Features and their weighted-additive fusion for Video Captioning.
CoRR, 2021

Modification of misarticulated fricative /s/ in cleft lip and palate speech.
Biomed. Signal Process. Control., 2021

Multilingual Audio-Visual Smartphone Dataset and Evaluation.
IEEE Access, 2021

Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey.
IEEE Access, 2021

Learning Mizo Tones from F0 Contours Using 1D-CNN.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Enhancing the Intelligibility of Cleft Lip and Palate Speech Using Cycle-Consistent Adversarial Networks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Excitation Source Feature Based Dialect Identification in Ao - A Low Resource Language.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Automatic Detection of Shouted Speech Segments in Indian News Debates.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Alzheimer's Dementia Recognition Using Multimodal Fusion of Speech and Text Embeddings.
Proceedings of the Intelligent Human Computer Interaction, 2021

Exploring Multimodal Features and Fusion for Time-Continuous Prediction of Emotional Valence and Arousal.
Proceedings of the Intelligent Human Computer Interaction, 2021

Processing Phoneme Specific Segments for Cleft Lip and Palate Speech Enhancement.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Vowel Onset Point Based Screening of Misarticulated Stops in Cleft Lip and Palate Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Speech/Music Classification Using Features From Spectral Peaks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Enhancement of cleft palate speech using temporal and spectral processing.
Speech Commun., 2020

Sinusoidal model-based hypernasality detection in cleft palate speech using CVCV sequence.
Speech Commun., 2020

Classification of Speech vs. Speech with Background Music.
Proceedings of the International Conference on Signal Processing and Communications, 2020

Overlapped/Non-Overlapped Speech Transition Point Detection Using Bag-of-Audio-Words.
Proceedings of the International Conference on Signal Processing and Communications, 2020

Language Specific Information from LP Residual Signal Using Linear Sub Band Filters.
Proceedings of the 2020 National Conference on Communications, 2020

Analysis of Excitation Source Characteristics for Shouted and Normal Speech Classification.
Proceedings of the 2020 National Conference on Communications, 2020

Lexical Tone Recognition in Mizo using Acoustic-Prosodic Features.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

VOP Detection in Variable Speech Rate Condition.
Proceedings of the Interspeech 2020, 2020

Spectral Moment and Duration of Burst of Plosives in Speech of Children with Hearing Impairment and Typically Developing Children - A Comparative Study.
Proceedings of the Interspeech 2020, 2020

2019
Detection of Nasalized Voiced Stops in Cleft Palate Speech Using Epoch-Synchronous Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Handwriting recognition using sinusoidal model parameters.
Pattern Recognit. Lett., 2019

Representation of online handwriting using multi-component sinusoidal model.
Pattern Recognit., 2019

Speech synthesis for glottal activity region processing.
Int. J. Speech Technol., 2019

An improved discriminative region selection methodology for online handwriting recognition.
Int. J. Document Anal. Recognit., 2019

Exploiting forced alignment of time-reversed data for improving HMM-based handwriting segmentation.
Expert Syst. Appl., 2019

Speech Enhancement Using Source Information for Phoneme Recognition of Speech with Background Music.
Circuits Syst. Signal Process., 2019

Investigating Text-Independent Speaker Verification Systems Under Varied Data Conditions.
Circuits Syst. Signal Process., 2019

Exploring Text-Constraint Models and Source Information for Long-Enrollment with Short-Test Speaker Verification.
Circuits Syst. Signal Process., 2019

Robust Methods for Text-Dependent Speaker Verification.
Circuits Syst. Signal Process., 2019

Acoustic Correlates of Aspiration in Fricatives and Nasals.
Proceedings of the TENCON 2019, 2019

Emotion Recognition from Raw Speech using Wavenet.
Proceedings of the TENCON 2019, 2019

Shouted and Normal Speech Classification Using 1D CNN.
Proceedings of the Pattern Recognition and Machine Intelligence, 2019

RSL2019: A Realistic Speech Localization Corpus.
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

Modelling Glottal Flow Derivative Signal for Detection of Replay Speech Samples.
Proceedings of the National Conference on Communications, 2019

Development of Assamese Text-to-speech System using Deep Neural Network.
Proceedings of the National Conference on Communications, 2019

Modification of Devoicing Error in Cleft Lip and Palate Speech.
Proceedings of the Interspeech 2019, 2019

Nasal Air Emission in Sibilant Fricatives of Cleft Lip and Palate Speech.
Proceedings of the Interspeech 2019, 2019

SpeechMarker: A Voice Based Multi-Level Attendance Application.
Proceedings of the Interspeech 2019, 2019

Hypernasality Severity Detection Using Constant Q Cepstral Coefficients.
Proceedings of the Interspeech 2019, 2019

Exploration of CNN Features for Online Handwriting Recognition.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Synthesis of Handwriting Dynamics using Sinusoidal Model.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Glottal Instants Extraction from Speech Signal Using Generative Adversarial Network.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Significance of sonority information for voiced/unvoiced decision in speech synthesis.
Speech Commun., 2018

Analysis of the Hilbert Spectrum for Text-Dependent Speaker Verification.
Speech Commun., 2018

Automatic syllabification of speech signal using short time energy and vowel onset points.
Int. J. Speech Technol., 2018

Significance of duration modification for speaker verification under mismatch speech tempo condition.
Int. J. Speech Technol., 2018

Multi-style speaker recognition database in practical conditions.
Int. J. Speech Technol., 2018

GMM posterior features for improving online handwriting recognition.
Expert Syst. Appl., 2018

Detection of the Glottal Closure Instants Using Empirical Mode Decomposition.
Circuits Syst. Signal Process., 2018

End Point Detection Using Speech-Specific Knowledge for Text-Dependent Speaker Verification.
Circuits Syst. Signal Process., 2018

Time-Frequency Audio Features for Speech-Music Classification.
CoRR, 2018

Speaker Identification Using Tensor Decomposition of Acoustic Models.
Proceedings of the TENCON 2018, 2018

Hypernasality Detection Using Zero Time Windowing.
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Excitation Source Feature for Discriminating Shouted and Normal Speech.
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Dialect Identification Using Tonal and Spectral Features in Two Dialects of Ao.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Processing Transition Regions of Glottal Stop Substituted /S/ for Intelligibility Enhancement of Cleft Palate Speech.
Proceedings of the Interspeech 2018, 2018

Spoken Keyword Detection Using Joint DTW-CNN.
Proceedings of the Interspeech 2018, 2018

Estimation of Hypernasality Scores from Cleft Lip and Palate Speech.
Proceedings of the Interspeech 2018, 2018

Detection of Glottal Activity Errors in Production of Stop Consonants in Children with Cleft Lip and Palate.
Proceedings of the Interspeech 2018, 2018

Epoch Extraction from Pathological Children Speech Using Single Pole Filtering Approach.
Proceedings of the Interspeech 2018, 2018

Self-similarity Matrix Based Intelligibility Assessment of Cleft Lip and Palate Speech.
Proceedings of the Interspeech 2018, 2018

Exploration of Compressed ILPR Features for Replay Attack Detection.
Proceedings of the Interspeech 2018, 2018

Analysis of Breathiness in Contextual Vowel of Voiceless Nasals in Mizo.
Proceedings of the Interspeech 2018, 2018

Pitch-Adaptive Front-end Feature for Hypernasality Detection.
Proceedings of the Interspeech 2018, 2018

Robust Mizo Continuous Speech Recognition.
Proceedings of the Interspeech 2018, 2018

AGROASSAM: A Web Based Assamese Speech Recognition Application for Retrieving Agricultural Commodity Price and Weather Information.
Proceedings of the Interspeech 2018, 2018

Glotto Vibrato Graph: A Device and Method for Recording, Analysis and Visualization of Glottal Activity.
Proceedings of the Interspeech 2018, 2018

Exploring Discriminative HMM States for Improved Recognition of Online Handwriting.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Exploring Sparse Representation for Improved Online Handwriting Recognition.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

DNN-HMM Based Large Vocabulary Online Handwritten Assamese Word Recognition System.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

Investigating Text-independent Speaker Verification from Practically Realizable System Perspective.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Improvements in IITG Assamese Spoken Query System: Background Noise Suppression and Alternate Acoustic Modeling.
J. Signal Process. Syst., 2017

Development of Multi-Level Speech based Person Authentication System.
J. Signal Process. Syst., 2017

Sonority Measurement Using System, Source, and Suprasegmental Information.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Epoch Extraction From Telephone Quality Speech Using Single Pole Filter.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Enhancement of Spectral Tilt in Synthesized Speech.
IEEE Signal Process. Lett., 2017

Empirical Mode Decomposition for adaptive AM-FM analysis of Speech: A Review.
Speech Commun., 2017

Analysis of the Intrinsic Mode Functions for Speaker Information.
Speech Commun., 2017

Consonant-vowel unit recognition using dominant aperiodic and transition region detection.
Speech Commun., 2017

Exploring kernel discriminant analysis for speaker verification with limited test data.
Pattern Recognit. Lett., 2017

Glottal opening instants detection using zero frequency resonator.
Int. J. Speech Technol., 2017

Clean speech/speech with background music classification using HNGD spectrum.
Int. J. Speech Technol., 2017

Processing degraded speech for text dependent speaker verification.
Int. J. Speech Technol., 2017

Improved voicing decision using glottal activity features for statistical parametric speech synthesis.
Digit. Signal Process., 2017

Vowel onset point based characterization of velopharyngeal activity using imaging techniques.
Proceedings of the Twenty-third National Conference on Communications, 2017

Pause insertion in assamese synthesized speech using speech specific features.
Proceedings of the Twenty-third National Conference on Communications, 2017

Role of voice activity detection methods for the speakers in the wild challenge.
Proceedings of the Twenty-third National Conference on Communications, 2017

Vowel Onset Point Detection Using Sonority Information.
Proceedings of the Interspeech 2017, 2017

IITG-Indigo System for NIST 2016 SRE Challenge.
Proceedings of the Interspeech 2017, 2017

Indoor/Outdoor Audio Classification Using Foreground Speech Segmentation.
Proceedings of the Interspeech 2017, 2017

Acoustic Characterization of Word-Final Glottal Stops in Mizo and Assam Sora.
Proceedings of the Interspeech 2017, 2017

Hypernasality Severity Analysis in Cleft Lip and Palate Speech Using Vowel Space Area.
Proceedings of the Interspeech 2017, 2017

Spoof Detection Using Source, Instantaneous Frequency and Cepstral Features.
Proceedings of the Interspeech 2017, 2017

Phase Modeling Using Integrated Linear Prediction Residual for Statistical Parametric Speech Synthesis.
Proceedings of the Interspeech 2017, 2017

Zero Frequency Filter Based Analysis of Voice Disorders.
Proceedings of the Interspeech 2017, 2017

2016
Foreground Speech Segmentation and Enhancement Using Glottal Closure Instants and Mel Cepstral Coefficients.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Feature optimisation for stress recognition in speech.
Pattern Recognit. Lett., 2016

A better decomposition of speech obtained using modified Empirical Mode Decomposition.
Digit. Signal Process., 2016

Speech / music classification using speech-specific features.
Digit. Signal Process., 2016

A Subspace Projection Approach for Analysis of Speech Under Stressed Condition.
Circuits Syst. Signal Process., 2016

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence.
Proceedings of the Interspeech 2016, 2016

Spectral Enhancement of Cleft Lip and Palate Speech.
Proceedings of the Interspeech 2016, 2016

Analysis of Glottal Stop in Assam Sora Language.
Proceedings of the Interspeech 2016, 2016

Exploring Session Variability and Template Aging in Speaker Verification for Fixed Phrase Short Utterances.
Proceedings of the Interspeech 2016, 2016

Source modeling for HMM based speech synthesis using integrated LP residual.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Low Complexity On-Line Adaptation Techniques in Context of Assamese Spoken Query System.
J. Signal Process. Syst., 2015

Detection of Glottal Activity Using Different Attributes of Source Information.
IEEE Signal Process. Lett., 2015

Processing of linear prediction residual in spectral and cepstral domains for speaker information.
Int. J. Speech Technol., 2015

Epoch Extraction Using Zero Band Filtering from Speech Signal.
Circuits Syst. Signal Process., 2015

Characterizing glottal activity from speech using Empirical Mode Decomposition.
Proceedings of the Twenty First National Conference on Communications, 2015

Speaker change detection using excitation source and vocal tract system information.
Proceedings of the Twenty First National Conference on Communications, 2015

Curvature point based HMM state prediction for online handwritten assamese strokes recognition.
Proceedings of the Twenty First National Conference on Communications, 2015

Speech vs music discrimination using Empirical Mode Decomposition.
Proceedings of the Twenty First National Conference on Communications, 2015

Different aspects of source information for limited data speaker verification.
Proceedings of the Twenty First National Conference on Communications, 2015

Comparison of assamese character recognizer using stroke level and character level engines.
Proceedings of the Twenty First National Conference on Communications, 2015

Detection of mizo tones.
Proceedings of the INTERSPEECH 2015, 2015

Speaker verification using Gaussian posteriorgrams on fixed phrase short utterances.
Proceedings of the INTERSPEECH 2015, 2015

2014
Analysis of Vocal Tract Constrictions using Zero Frequency Filtering.
IEEE Signal Process. Lett., 2014

Online Stroke and Akshara Recognition GUI in Assamese Language Using Hidden Markov Model.
CoRR, 2014

Speech biometric based attendance system.
Proceedings of the Twentieth National Conference on Communications, 2014

Epochs based compression of LP residual for source modeling in text-to-speech synthesis.
Proceedings of the Twentieth National Conference on Communications, 2014

Detection of vowel onset points in voiced aspirated sounds of indian languages.
Proceedings of the INTERSPEECH 2014, 2014

Combining source and system information for limited data speaker verification.
Proceedings of the INTERSPEECH 2014, 2014

2013
Speaker Verification by Vowel and Nonvowel Like Segmentation.
IEEE Trans. Speech Audio Process., 2013

Expressive speech synthesis: a review.
Int. J. Speech Technol., 2013

Dynamic prosody modification using zero frequency filtered signal.
Int. J. Speech Technol., 2013

Development and evaluation of online text-independent speaker verification system for remote person authentication.
Int. J. Speech Technol., 2013

A syllable-based framework for unit selection synthesis in 13 Indian languages.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Detection of glottal opening instants using Hilbert envelope.
Proceedings of the INTERSPEECH 2013, 2013

Significance of instants of significant excitation for source modeling.
Proceedings of the INTERSPEECH 2013, 2013

The IITG speaker verification systems for NIST SRE 2012.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Speaker verification in sensor and acoustic environment mismatch conditions.
Int. J. Speech Technol., 2012

Speaker verification using excitation source information.
Int. J. Speech Technol., 2012

Multivariability speaker recognition database in Indian scenario.
Int. J. Speech Technol., 2012

Foreground Speech Segmentation using Zero Frequency Filtered Signal.
Proceedings of the INTERSPEECH 2012, 2012

Assamese online handwritten digit recognition system using hidden Markov models.
Proceedings of the Proceeding of the workshop on Document Analysis and Recognition, 2012

2011
Significance of Vowel-Like Regions for Speaker Verification Under Degraded Conditions.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

Enhancement of noisy speech by temporal and spectral processing.
Speech Commun., 2011

Recognition of consonant-vowel (CV) units under background noise using combined temporal and spectral preprocessing.
Int. J. Speech Technol., 2011

Spectral slope based analysis and classification of stressed speech.
Int. J. Speech Technol., 2011

Speaker verification under degraded condition: a perceptual study.
Int. J. Speech Technol., 2011

Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information.
Int. J. Speech Technol., 2011

Speaker recognition under limited data condition by noise addition.
Expert Syst. Appl., 2011

Neutral to Target Emotion Conversion Using Source and Suprasegmental Information.
Proceedings of the INTERSPEECH 2011, 2011

Epoch Extraction in High Pass Filtered Speech Using Hilbert Envelope.
Proceedings of the INTERSPEECH 2011, 2011

Study of robustness of zero frequency resonator method for extraction of fundamental frequency.
Proceedings of the IEEE International Conference on Acoustics, 2011

Chain Code Histogram Based Facial Image Feature Extraction under Degraded Conditions.
Proceedings of the Advances in Computing and Communications, 2011

2010
Two speaker speech separation by LP residual weighting and harmonics enhancement.
Int. J. Speech Technol., 2010

Analysis of excitation source information in emotional speech.
Proceedings of the INTERSPEECH 2010, 2010

Analysis of instantaneous F0 contours from two speakers mixed signal using zero frequency filtering.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies.
IEEE Trans. Speech Audio Process., 2009

Speaker Recognition under Limited Data Condition using LVQ and GMM-UBM.
Proceedings of the 4th Indian International Conference on Artificial Intelligence, 2009

Significance of Word and Syllable Level Information for Expressive Speech Processing.
Proceedings of the Seventh International Conference on Advances in Pattern Recognition, 2009

2007
Determination of Instants of Significant Excitation in Speech Using Hilbert Envelope and Group Delay Function.
IEEE Signal Process. Lett., 2007

MRASTA and PLP in automatic speech recognition.
Proceedings of the INTERSPEECH 2007, 2007

Significance of Multimodal Biometric Systems.
Proceedings of the 3rd Indian International Conference on Artificial Intelligence, 2007

Speaker Recognition in Limited Data Conditions using Self-Organizing Map.
Proceedings of the 3rd Indian International Conference on Artificial Intelligence, 2007

Resonant Recognition Model for Prediction of 'Lead-In Regions' in Proteins.
Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies 2007, 2007

2006
Extraction of speaker-specific excitation information from linear prediction residual of speech.
Speech Commun., 2006

A Partial Image Encryption Method with Pseudo Random Sequences.
Proceedings of the Information Systems Security, Second International Conference, 2006

2005
Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system.
IEEE Trans. Speech Audio Process., 2005

Processing of reverberant speech for time-delay estimation.
IEEE Trans. Speech Audio Process., 2005

Speaker Localization Using Excitation Source Information in Speech.
IEEE Trans. Speech Audio Process., 2005

Detection of vowel onset point events using excitation information.
Proceedings of the INTERSPEECH 2005, 2005

Text-Dependent Writer Identification using Word Length Analysis.
Proceedings of the 2nd Indian International Conference on Artificial Intelligence, 2005

2004
Features for speaker and language identification.
Proceedings of the ODYSSEY 2004 - The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

Enhancement of reverberant speech using excitation source information.
Proceedings of the INTERSPEECH 2004, 2004

Two-Stage Duration Model for Indian Languages Using Neural Networks.
Proceedings of the Neural Information Processing, 11th International Conference, 2004

Extraction of pitch in adverse conditions.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Enhancement of speech in multispeaker environment.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Tracking a moving speaker using excitation source information.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Speech enhancement using excitation source information.
Proceedings of the IEEE International Conference on Acoustics, 2002

Detection of vowel onset point in speech.
Proceedings of the IEEE International Conference on Acoustics, 2002

Linear and nonlinear compression of feature vectors for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002


  Loading...