Hemant A. Patil

Comput. Speech Lang., 2022

Improving the potential of Enhanced Teager Energy Cepstral Coefficients (ETECC) for replay attack detection.

[BibT_eX]

[DOI]

Rodrigo Capobianco Guido

Comput. Speech Lang., 2022

Analysis of Time-Averaged Feature Extraction Techniques on Infant Cry Classification.

[BibT_eX]

[DOI]

Aditya Pusuluri

Proceedings of the Speech and Computer - 24th International Conference, 2022

Significance of Energy Features for Severity Classification of Dysarthria.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 24th International Conference, 2022

Continuous Wavelet Transform for Severity-Level Classification of Dysarthria.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 24th International Conference, 2022

Significance of Distance on Pop Noise for Voice Liveness Detection.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 24th International Conference, 2022

Significance of Distance Measures for Speaker Anonymization.

[BibT_eX]

[DOI]

Gauri P. Prajapati

Dipesh K. Singh

Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Morse Wavelet Features for Pop Noise Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Robustness of DAS Beamformer Over MVDR for Replay Attack Detection On Voice Assistants.

[BibT_eX]

[DOI]

Shreya S. Chaturvedi

Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Noisy Student Teacher Training with Self Supervised Learning for Children ASR.

[BibT_eX]

[DOI]

Shreya S. Chaturvedi

Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

Teager Energy Based-Detection of One-point and Two-point Replay Attacks: Towards Cross-Database Generalization.

[BibT_eX]

[DOI]

Anand Therattil

Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

The Impact of Room Acoustics on Replay Speech Signal.

[BibT_eX]

[DOI]

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Data Augmentation for Infant Cry Classification.

[BibT_eX]

[DOI]

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Effect of Speaker-Microphone Proximity on Pop Noise: Continuous Wavelet Transform-Based Approach.

[BibT_eX]

[DOI]

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Constant Q Cepstral coefficients for classification of normal vs. Pathological infant cry.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Subband Teager Energy Representations for Infant Cry Analysis and Classification.

[BibT_eX]

[DOI]

Proceedings of the 30th European Signal Processing Conference, 2022

Voice Liveness Detection using Constant-Q Transform-Based Features.

[BibT_eX]

[DOI]

Proceedings of the 30th European Signal Processing Conference, 2022

Non-Cepstral Uncertainty Vector for Replay Spoofed Speech Detection.

[BibT_eX]

[DOI]

Proceedings of the 30th European Signal Processing Conference, 2022

Features Motivated From Uncertainty Principle for Classification of Normal vs. Pathological Infant Cry.

[BibT_eX]

[DOI]

Proceedings of the 30th European Signal Processing Conference, 2022

Linear Frequency Residual Cepstral Features for Replay Spoof Detection on ASVSpoof 2019.

[BibT_eX]

[DOI]

Proceedings of the 30th European Signal Processing Conference, 2022

Energy Separation Based Instantaneous Frequency Estimation from Quadrature and In-Phase Components for Replay Spoof Detection.

[BibT_eX]

[DOI]

Proceedings of the 30th European Signal Processing Conference, 2022

Morlet Wavelet-Based Voice Liveness Detection using Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 30th European Signal Processing Conference, 2022

2021

Non-intrusive quality assessment of noise-suppressed speech using unsupervised deep features.

[BibT_eX]

[DOI]

Speech Commun., 2021

Residual Neural Network precisely quantifies dysarthria severity-level based on short-duration speech segments.

[BibT_eX]

[DOI]

Rodrigo Capobianco Guido

Neural Networks, 2021

Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework.

[BibT_eX]

[DOI]

Nirmalya Sen

Md. Sahidullah

Krothapalli Sreenivasa Rao

Shyamal Kumar Das Mandal

Tapan Kumar Basu

Int. J. Speech Technol., 2021

Detection of replay spoof speech using teager energy feature cues.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2021

Modified Group Delay Function Using Different Spectral Smoothing Techniques for Voice Liveness Detection.

[BibT_eX]

[DOI]

Shrishti Singh

Proceedings of the Speech and Computer - 23rd International Conference, 2021

Spectral Root Features for Replay Spoof Detection in Voice Assistants.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 23rd International Conference, 2021

Voice Privacy Through Time-Scale and Pitch Modification.

[BibT_eX]

[DOI]

Gauri P. Prajapati

Dipesh K. Singh

Proceedings of the Pattern Recognition and Machine Intelligence, 2021

Voice Liveness Detection Using Bump Wavelet with CNN.

[BibT_eX]

[DOI]

Siddhant Gupta

Proceedings of the Pattern Recognition and Machine Intelligence, 2021

Voice Privacy Through x-Vector and CycleGAN-Based Anonymization.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Cross-Teager Energy Cepstral Coefficients for Replay Spoof Detection on Voice Assistants.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Modified Group Delay Cepstral Coefficients for Voice Liveness Detection.

[BibT_eX]

[DOI]

Shrishti Singh

Proceedings of the 29th European Signal Processing Conference, 2021

Data Augmentation Using CycleGAN for End-to-End Children ASR.

[BibT_eX]

[DOI]

Proceedings of the 29th European Signal Processing Conference, 2021

Exploiting Phase-based Features for Whisper vs. Speech Classification.

[BibT_eX]

[DOI]

Proceedings of the 29th European Signal Processing Conference, 2021

Significance of Constant-Q Transform for Voice Liveness Detection.

[BibT_eX]

[DOI]

Proceedings of the 29th European Signal Processing Conference, 2021

Teager Energy Subband Filtered Features for Near and Far-Field Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Deep Convolutional Neural Network for Voice Liveness Detection.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Combination of Amplitude and Frequency Modulation Features for Presentation Attack Detection.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2020

Amplitude and Frequency Modulation-based features for detection of replay Spoof Speech.

[BibT_eX]

[DOI]

Speech Commun., 2020

Effectiveness of Transfer Learning on Singing Voice Conversion in the Presence of Background Music.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Signal Processing and Communications, 2020

Intelligibility Improvement of Dysarthric Speech using MMSE DiscoGAN.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Signal Processing and Communications, 2020

Analysis of Teager Energy Profiles for Spoof Speech Detection.

[BibT_eX]

[DOI]

Aditya Krishna Sai Pulikonda

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Novel Variable Length Teager Energy Profiles for Replay Spoof Detection.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Mspec-Net : Multi-Domain Speech Conversion Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Weak Speech Supervision: A case study of Dysarthria Severity Classification.

[BibT_eX]

[DOI]

Proceedings of the 28th European Signal Processing Conference, 2020

Energy Separation Based Features for Replay Spoof Detection for Voice Assistant.

[BibT_eX]

[DOI]

Gauri P. Prajapati

Proceedings of the 28th European Signal Processing Conference, 2020

CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion.

[BibT_eX]

[DOI]

Proceedings of the 28th European Signal Processing Conference, 2020

Teager Energy Cepstral Coefficients for Classification of Normal vs. Whisper Speech.

[BibT_eX]

[DOI]

Proceedings of the 28th European Signal Processing Conference, 2020

Query-By-Example Spoken Term Detection Using Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Symmetry In The Structure Of Musical Nodes.

[BibT_eX]

[DOI]

Kirtana Sunil Phatnani

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Significance of CMVN for Replay Spoof Detection.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Subband Channel Selection using TEO for Replay Spoof Detection in Voice Assistants.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Design of Voice Privacy System using Linear Prediction.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

A novel approach to remove outliers for parallel voice conversion.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2019

Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2019

Novel Inception-GAN for Whispered-to-Normal Speech Conversion.

[BibT_eX]

[DOI]

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Novel Teager Energy Based Subband Features for Audio Acoustic Scene Detection and Classification.

[BibT_eX]

[DOI]

Aditya Krishna Sai Pulikonda

Proceedings of the Pattern Recognition and Machine Intelligence, 2019

Whether to Pretrain DNN or not?: An Empirical Analysis for Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Phone Aware Nearest Neighbor Technique Using Spectral Transition Measure for Non-Parallel Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Energy Separation-Based Instantaneous Frequency Estimation for Cochlear Cepstral Feature for Replay Spoof Detection.

[BibT_eX]

[DOI]

Pulikonda Krishna Aditya Sai

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Novel Metric Learning for Non-parallel Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Analysis of Reverberation via Teager Energy Features for Replay Spoof Speech Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Energy Separation Algorithm Based Spectrum Estimation for Very Short Duration of Speech.

[BibT_eX]

[DOI]

Srikant Viswanath

Proceedings of the 27th European Signal Processing Conference, 2019

Combining Evidences from Variable Teager Energy Source and Mel Cepstral Features for Classification of Normal vs. Pathological Voices.

[BibT_eX]

[DOI]

Proceedings of the 27th European Signal Processing Conference, 2019

Effectiveness of Cross-Domain Architectures for Whisper-to-Normal Speech Conversion.

[BibT_eX]

[DOI]

Proceedings of the 27th European Signal Processing Conference, 2019

Novel Enhanced Teager Energy Based Cepstral Coefficients for Replay Spoof Detection.

[BibT_eX]

[DOI]

Harsh Kotta

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Novel Adaptive Generative Adversarial Network for Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Speech Demodulation-based Techniques for Replay and Presentation Attack Detection.

[BibT_eX]

[DOI]

Aditya Krishna Sai Pulikonda

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Significance of Higher-Order Spectral Analysis in Infant Cry Classification.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2018

Combining evidences from magnitude and phase information using VTEO for person recognition using humming.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2018

Design of mixture of GMMs for Query-by-Example Spoken Term Detection.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2018

Feature Extraction from Temporal Phase for Speaker Recognition.

[BibT_eX]

[DOI]

Ami Gandhi

Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Advances in Low Resource ASR: A Deep Learning Perspective.

[BibT_eX]

[DOI]

Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Neural Networks-based Automatic Speech Recognition for Agricultural Commodity in Gujarati Language.

[BibT_eX]

[DOI]

Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Relative Phase Shift Features for Replay Spoof Detection System.

[BibT_eX]

[DOI]

Srinivas Kantheti

Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Combining Phase-based Features for Replay Spoof Detection System.

[BibT_eX]

[DOI]

Srinivas Kantheti

Rohan Kumar Das

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Novel Demodulation-Based Features using Classifier-level Fusion of GMM and CNN for Replay Detection.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Novel Amplitude Weighted Frequency Modulation Features for Replay Spoof Detection.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Novel Empirical Mode Decomposition Cepstral Features for Replay Spoof Detection.

[BibT_eX]

[DOI]

Prasad Tapkir

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Novel Linear Frequency Residual Cepstral Features for Replay Attack Detection.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Effectiveness of Generative Adversarial Network for Non-Audible Murmur-to-Whisper Speech Conversion.

[BibT_eX]

[DOI]

Neil Shah

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Effectiveness of Dynamic Features in INCA and Temporal Context-INCA.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Unsupervised Vocal Tract Length Warped Posterior Features for Non-Parallel Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Auditory Filterbank Learning Using ConvRBM for Infant Cry Classification.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Auditory Filterbank Learning for Temporal Modulation Features in Replay Spoof Speech Detection.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

DA-IICT/IIITV System for Low Resource Speech Recognition Challenge 2018.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Effectiveness of Speech Demodulation-Based Features for Replay Detection.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Novel Variable Length Energy Separation Algorithm Using Instantaneous Amplitude Features for Replay Detection.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Time-Frequency Masking-Based Speech Enhancement Using Generative Adversarial Network.

[BibT_eX]

[DOI]

Neil Shah

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Novel Spectral Root Cepstral Features for Replay Spoof Detection.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Significance of Teager Energy Operator Phase for Replay Spoof Detection.

[BibT_eX]

[DOI]

Prasad A. Tapkir

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Replay Spoof Detection using Power Function Based Features.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Novel Inter Mixture Weighted GMM Posteriorgram for DNN and GAN-based Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Time-Frequency Mask-based Speech Enhancement using Convolutional Generative Adversarial Network.

[BibT_eX]

[DOI]

Neil Shah

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

A Survey on Replay Attack Detection for Automatic Speaker Verification (ASV) System.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Significance of Source-Filter Interaction for Classification of Natural vs. Spoofed Speech.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2017

Cochlear Filter and Instantaneous Frequency Based Features for Spoofed Speech Detection.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2017

Partial matching and search space reduction for QbE-STD.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

Novel Phase Encoded Mel Cepstral Features for Speaker Verification.

[BibT_eX]

[DOI]

Apeksha J. Naik

Rishabh Tak

Proceedings of the Speech and Computer - 19th International Conference, 2017

Novel Linear Prediction Temporal Phase Based Features for Speaker Recognition.

[BibT_eX]

[DOI]

Ami Gandhi

Proceedings of the Speech and Computer - 19th International Conference, 2017

Fusion of a Novel Volterra-Wiener Filter Based Nonlinear Residual Phase and MFCC for Speaker Verification.

[BibT_eX]

[DOI]

Purvi Agrawal

Proceedings of the Speech and Computer - 19th International Conference, 2017

Novel Phase Encoded Mel Filterbank Energies for Environmental Sound Classification.

[BibT_eX]

[DOI]

Rishabh N. Tak

Dharmesh M. Agrawal

Proceedings of the Pattern Recognition and Machine Intelligence, 2017

Analysis of Features and Metrics for Alignment in Text-Dependent Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Machine Intelligence, 2017

Novel Gammatone Filterbank Based Spectro-Temporal Features for Robust Phoneme Recognition.

[BibT_eX]

[DOI]

Ankit Nagpal

Proceedings of the Pattern Recognition and Machine Intelligence, 2017

Spoken Keyword Retrieval Using Source and System Features.

[BibT_eX]

[DOI]

Nikhil Bhendawade

Proceedings of the Pattern Recognition and Machine Intelligence, 2017

Effectiveness of Mel Scale-Based ESA-IFCC Features for Classification of Natural vs. Spoofed Speech.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Machine Intelligence, 2017

Novel Shifted Real Spectrum for Exact Signal Reconstruction.

[BibT_eX]

[DOI]

Rishabh Tak

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Unsupervised Representation Learning Using Convolutional Restricted Boltzmann Machine for Spoof Speech Detection.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Unsupervised Filterbank Learning Using Convolutional Restricted Boltzmann Machine for Environmental Sound Classification.

[BibT_eX]

[DOI]

Dharmesh M. Agrawal

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Novel Variable Length Teager Energy Separation Based Instantaneous Frequency Features for Replay Detection.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Novel Amplitude Scaling method for bilinear frequency Warping-based Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Quality assessment of voice converted speech using articulatory features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Sub-band Autoencoder features for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Advances in Pattern Recognition, 2017

Unsupervised Filterbank Learning for Speech-based Access System for Agricultural Commodity.

[BibT_eX]

[DOI]

Avni Rajpal

Proceedings of the Ninth International Conference on Advances in Pattern Recognition, 2017

Two Stage Zero-resource Approaches for QbE-STD.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Advances in Pattern Recognition, 2017

Novel Energy Separation Based Frequency Modulation Features for Spoofed Speech Classification.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Advances in Pattern Recognition, 2017

Effectiveness of ideal ratio mask for non-intrusive quality assessment of noise suppressed speech.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

VTLN-warped Gaussian posteriorgram for QbE-STD.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

Novel energy separation based instantaneous frequency features for spoof speech detection.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

Novel TEO-based Gammatone features for environmental sound classification.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

On the convergence of INCA algorithm.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

A novel filtering-based F0 estimation algorithm with an application to voice conversion.

[BibT_eX]

[DOI]

Pramod B. Bachhav

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Combining evidences from detection sources for query-by-example spoken term detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Novel Unsupervised Auditory Filterbank Learning Using Convolutional RBM for Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Newborn infant's cry analysis.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2016

Spectral analysis of infant cries and adult speech.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2016

Non-intrusive Quality Assessment of Synthesized Speech using Spectral Features and Support Vector Regression.

[BibT_eX]

[DOI]

Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Novel Pre-processing using Outlier Removal in Voice Conversion.

[BibT_eX]

[DOI]

Sushant V. Rao

Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Jerk Minimization for Acoustic-To-Articulatory Inversion.

[BibT_eX]

[DOI]

Avni Rajpal

Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Analysis of hierarchical bottleneck framework for improved phoneme recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Modification in sequential dynamic time warping for fast computation of query-by-example spoken term detection task.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

A novel lowpass filtering-based approach for estimating strength of excitation from speech signal.

[BibT_eX]

[DOI]

Deep Gandhi

Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Novel Subband Autoencoder Features for Detection of Spoofed Speech.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Novel Subband Autoencoder Features for Non-Intrusive Quality Assessment of Noise Suppressed Speech.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Unsupervised Deep Auditory Model Using Stack of Convolutional RBMs for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Native Language Identification Using Spectral and Source-Based Features.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Novel Nonlinear Prediction Based Features for Spoofed Speech Detection.

[BibT_eX]

[DOI]

Himanshu N. Bhavsar

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Filterbank learning using Convolutional Restricted Boltzmann Machine for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Analysis of natural and synthetic speech using Fujisaki model.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Effectiveness of fundamental frequency (F0) and strength of excitation (SOE) for spoofed speech detection.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Novel deep autoencoder features for non-intrusive speech quality assessment.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Unsupervised learning of temporal receptive fields using convolutional RBM for ASR task.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

2015

Combining Evidences from Mel Cepstral and Cochlear Cepstral Features for Speaker Recognition Using Whispered Speech.

[BibT_eX]

[DOI]

Aditya Raikar

Ami Gandhi

Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Vocal Tract Length Normalization Features for Audio Search.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Modified Group Delay Based Features for Asthma and HIE Infant Cries Classification.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Significance of Unvoiced Segments and Fundamental Frequency in Infant Cry Analysis.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Combining Evidences from Bark Scale and Mel Scale Warped Features for VTLN.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Conference on Perception and Machine Intelligence, 2015

Significance of Phase-based Features for Person Recognition Using Humming.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Conference on Perception and Machine Intelligence, 2015

Classification of Stop Consonants using Modulation Spectrogram-Based Features.

[BibT_eX]

[DOI]

Kewal D. Malde

Proceedings of the 2nd International Conference on Perception and Machine Intelligence, 2015

Fusion of TEO Phase with MFCC Features for Speaker Verification.

[BibT_eX]

[DOI]

Purvi Agrawal

Proceedings of the 2nd International Conference on Perception and Machine Intelligence, 2015

Combining evidences from mel cepstral, cochlear filter cepstral and instantaneous frequency features for detection of natural vs. spoofed speech.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A novel filtering based approach for epoch extraction.

[BibT_eX]

[DOI]

Pramod B. Bachhav

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Effectiveness of multiscale fractal dimension for improvement of frame classification rate.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

Spectral transition measure for detection of obstruents.

[BibT_eX]

[DOI]

Bhavik B. Vachhani

Proceedings of the 23rd European Signal Processing Conference, 2015

Classification of normal and pathological infant cries using bispectrum features.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

2014

Development of vocal tract length normalized phonetic engine for Gujarati and Marathi languages.

[BibT_eX]

[DOI]

Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Obstruent classification using modulation spectrogram based features.

[BibT_eX]

[DOI]

Kewal D. Malde

Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Effectiveness of fractal dimension for ASR in low resource language.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Exploiting speech source information for vowel landmark detection for low resource language.

[BibT_eX]

[DOI]

Ankur G. Undhad

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Deterministic annealing EM algorithm for developing TTS system in Gujarati.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Fusion of magnitude and phase-based features for objective evaluation of TTS voice.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Novel approach for estimating length of the vocal folds using Fujisaki model.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Exploiting Variable length Teager Energy Operator in melcepstral features for person recognition from humming.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Classification of pathological infant cries using modulation spectrogram features.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Chaotic mixed excitation source for speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Effectiveness of PLP-based phonetic segmentation for speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Effectiveness of multiscale fractal dimension-based phonetic segmentation in speech synthesis for low resource language.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Conference on Asian Language Processing, 2014

A spectral transition measure based MELCEPSTRAL features for obstruent detection.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Conference on Asian Language Processing, 2014

Vocal tract length normalization for vowel recognition in low resource languages.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Conference on Asian Language Processing, 2014

Influence of various asymmetrical contextual factors for TTS in a low resource language.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Conference on Asian Language Processing, 2014

A Cepstral Mean Subtraction based features for Singer Identification.

[BibT_eX]

[DOI]

Purushotam G. Radadia

Proceedings of the 2014 International Conference on Asian Language Processing, 2014

Nonlinear analysis of natural vs. HTS-based synthetic speech.

[BibT_eX]

[DOI]

S. Adarsa

Proceedings of the 2014 International Conference on Asian Language Processing, 2014

Development of language resources for speech application in Gujarati and Marathi.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Conference on Asian Language Processing, 2014

Use of glottal inverse filtering for asthma and HIE infant cries classification.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Conference on Asian Language Processing, 2014

Classification of phonemes using modulation spectrogram based features for Gujarati language.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Conference on Asian Language Processing, 2014

The Blizzard Challenge 2014.

[BibT_eX]

[DOI]

Kishore Prahallad

Anandaswarup Vadapalli

S. R. Mahadeva Prasanna

Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014

2013

Classification of Fricatives Using Novel Modulation Spectrogram Based Features.

[BibT_eX]

[DOI]

Kewal D. Malde

Proceedings of the Pattern Recognition and Machine Intelligence, 2013

Speaker Recognition Using Sparse Representation via Superimposed Features.

[BibT_eX]

[DOI]

Yashesh Gaur

Proceedings of the Pattern Recognition and Machine Intelligence, 2013

Algorithms for speech segmentation at syllable-level for text-to-speech synthesis system in Gujarati.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

A syllable-based framework for unit selection synthesis in 13 Indian languages.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Development of speech corpora in Gujarati and Marathi for phonetic transcription.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Data collection and corpus design for analysis of nonnal and pathological infant cry.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Development of corpora for person recognition using humming, singing and speech.

[BibT_eX]

[DOI]

Nirav H. Chhayani

Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Importance of Utterance Partitioning in SVM Classifier with GMM Supervectors for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

Nirmalya Sen

Shyamal Kr. Das Mandal

K. Sreenivasa Rao

Proceedings of the Mining Intelligence and Knowledge Exploration, 2013

Nonlinear prediction of speech signal using volterra-wiener series.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Use of PLP Cepstral Features for Phonetic Segmentation.

[BibT_eX]

[DOI]

Bhavik B. Vachhani

Proceedings of the 2013 International Conference on Asian Language Processing, 2013

A Novel Gaussian Filter-Based Automatic Labeling of Speech Data for TTS System in Gujarati Language.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Conference on Asian Language Processing, 2013

2012

Static and dynamic information derived from source and system features for person recognition from humming.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2012

Combining Evidence from Temporal and Spectral Features for Person Recognition Using Humming.

[BibT_eX]

[DOI]

Proceedings of the Perception and Machine Intelligence - First Indo-Japan Conference, 2012

Novel Interleaving Schemes for Speaker Recognition over Lossy Networks.

[BibT_eX]

[DOI]

Parth A. Goswami

Tapan Kumar Basu

Proceedings of the Perception and Machine Intelligence - First Indo-Japan Conference, 2012

Significance of magnitude and phase information via VTEO for humming based biometrics.

[BibT_eX]

[DOI]

Proceedings of the 5th IAPR International Conference on Biometrics, 2012

A comparison of waveform fractal dimension techniques for voice pathology classification.

[BibT_eX]

[DOI]

Pallavi N. Baljekar

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Combining Evidences from Mel Cepstral Features and Cepstral Mean Subtracted Features for Singer Identification.

[BibT_eX]

[DOI]

Purushotam G. Radadia

Proceedings of the 2012 International Conference on Asian Language Processing, 2012

Phonetic Transcription of Fricatives and Plosives for Gujarati and Marathi Languages.

[BibT_eX]

[DOI]

Proceedings of the 2012 International Conference on Asian Language Processing, 2012

Person Recognition Using Humming, Singing and Speech.

[BibT_eX]

[DOI]

Nirav H. Chhayani

Proceedings of the 2012 International Conference on Asian Language Processing, 2012

2011

Effectiveness of Teager energy operator for epoch detection from speech signals.

[BibT_eX]

[DOI]

Viswanath Srikanth

Int. J. Speech Technol., 2011

Combining Evidence from Spectral and Source-Like Features for Person Recognition from Humming.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Novel VTEO Based Mel Cepstral Features for Classification of Normal and Pathological Voices.

[BibT_eX]

[DOI]

Pallavi N. Baljekar

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Novel Temporal and Spectral Features Derived from TEO for Classification Normal and Dysphonic Voices.

[BibT_eX]

[DOI]

Pallavi N. Baljekar

Proceedings of the Frontiers in Computer Education [International Conference on Frontiers in Computer Education, 2011

Design of a Query-by-Humming System for Hindi Songs Using DDTW Based Approach.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Asian Language Processing, 2011

2010

Novel Variable length Teager Energy Based features for person recognition from their hum.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Variable Length Teager Energy Based Mel Cepstral Features for Identification of Twins.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Machine Intelligence, 2009

Design and Implementation of HMM-VQ based Isolated Digit Recognition System.

[BibT_eX]

Mayank Mishra

Proceedings of the 4th Indian International Conference on Artificial Intelligence, 2009

DA-IICT Cross-lingual and Multilingual Corpora for Speaker Recognition.

[BibT_eX]

[DOI]

Sunayana Sitaram

Esha Sharma

Proceedings of the Seventh International Conference on Advances in Pattern Recognition, 2009

A Novel Approach to Identification of Speakers from Their Hum.

[BibT_eX]

[DOI]

Prakhar Kant Jain

Robin Jain

Proceedings of the Seventh International Conference on Advances in Pattern Recognition, 2009

A Novel Modified Polynomial Network Design for Dialect Recognition.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Conference on Advances in Pattern Recognition, 2009

Infant Identification from Their Cry.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Conference on Advances in Pattern Recognition, 2009

2008

A Novel Approach to Language Identification Using Modified Polynomial Networks.

[BibT_eX]

[DOI]

Proceedings of the Speech, 2008

Development of speech corpora for speaker recognition research and evaluation in Indian languages.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2008

LP spectra vs. Mel spectra for identification of professional mimics in Indian languages.

[BibT_eX]

[DOI]

Tapan Kumar Basu

Int. J. Speech Technol., 2008

Identifying Perceptually Similar Languages Using Teager Energy Based Cepstrum.

[BibT_eX]

[DOI]

Eng. Lett., 2008

Identification of Speakers from Their Hum.

[BibT_eX]

[DOI]

Robin Jain

Prakhar Kant Jain

Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008

On the development of variable length Teager energy operator (VTEO).

[BibT_eX]

[DOI]

Vikrant Tomar

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Cepstral Domain Teager Energy for Identifying Perceptually Similar Languages.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Machine Intelligence, 2007

Advances in Speaker Recognition: A Feature Based Approach.

[BibT_eX]

Proceedings of the International Conference on Artificial Intelligence and Pattern Recognition, 2007

Identifying Phonetically Similar Languages Using Teager Energy Based Cepstrum.

[BibT_eX]

Proceedings of the International Conference on Artificial Intelligence and Pattern Recognition, 2007

2006

Design of Cross-lingual and Multilingual Corpora for Speaker Recognition Research and Evaluation in Indian Languages.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

A New Data Fusion Technique and Performance Measure for Identification of Twins in Marathi.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Design of Cubic Spline Wavelet for Open Set Speaker Classification in Marathi.

[BibT_eX]

[DOI]

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

2005

The Wavelet Packet Based Cepstral Features for Open Set Speaker Classification in Marathi.

[BibT_eX]

[DOI]

Pranab Kumar Dutta

Proceedings of the From Data and Information Analysis to Knowledge Engineering, 2005

2004

The Teager Energy Based Features for Identification of Identical Twins in Multi-lingual Environment.

[BibT_eX]

[DOI]