Hema A. Murthy

Orcid: 0000-0003-3611-6550

According to our database1, Hema A. Murthy authored at least 179 papers between 1987 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Report on the 23rd International Society for Music Information Retrieval Conference (ISMIR 2022).
SIGIR Forum, June, 2023

Exploring the Role of Language Families for Building Indic Speech Synthesisers.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Enhancing COVID-19 Severity Analysis through Ensemble Methods.
CoRR, 2023

Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages.
CoRR, 2023

Identification and Severity Assessment of COVID-19 Using Lung CT Scans.
IEEE Access, 2023

E-TTS: Expressive Text-to-Speech Synthesis for Hindi Using Data Augmentation.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Segmentation and Analysis of Taniavartanam in Carnatic Music Concerts.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Ensemble Methods For Enhanced Covid-19 CT Scan Severity Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2023

Lightweight, Multi-Speaker, Multi-Lingual Indic Text-to-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023

Towards Developing State-of-The-Art TTS Synthesisers for 13 Indian Languages with Signal Processing Aided Alignments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Structural Segmentation and Labeling of Tabla Solo Performances.
CoRR, 2022

Using Signal Processing in Tandem With Adapted Mixture Models for Classifying Genomic Signals.
CoRR, 2022

Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages.
CoRR, 2022

The Importance of Accurate Alignments in End-to-End Speech Synthesis.
CoRR, 2022

USS Directed E2E Speech Synthesis For Indian Languages.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

2021
Evidence of Task-Independent Person-Specific Signatures in EEG Using Subspace Techniques.
IEEE Trans. Inf. Forensics Secur., 2021

Novel Architectures for Unsupervised Information Bottleneck Based Speaker Diarization of Meetings.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Signal-to-signal neural networks for improved spike estimation from calcium imaging data.
PLoS Comput. Biol., 2021

Functional parcellation of mouse visual cortex using statistical techniques reveals response-dependent clustering of cortical processing areas.
PLoS Comput. Biol., 2021

Acoustic unit discovery using transient and steady-state regions in speech and its applications.
J. Phonetics, 2021

Front-end Diarization for Percussion Separation in Taniavartanam of Carnatic Music Concerts.
CoRR, 2021

Towards Zero-Shot Learning with Fewer Seen Class Examples.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Tabla Gharana Recognition from Audio music recordings of Tabla Solo performances.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Dual Script E2E Framework for Multilingual and Code-Switching ASR.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Analysis of Conversational Speech with Application to Voice Adaptation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Importance of Signal Processing Cues in Transcription Correction for Low-Resource Indian Languages.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

Significance of spectral cues in automatic speech segmentation for Indian language speech synthesizers.
Speech Commun., 2020

Zero-shot learning for action recognition using synthesized features.
Neurocomputing, 2020

Correlation based Multi-phasal models for improved imagined speech EEG recognition.
CoRR, 2020

Exploration of End-to-end Synthesisers forZero Resource Speech Challenge 2020.
CoRR, 2020

Neural Speech Decoding During Audition, Imagination and Production.
IEEE Access, 2020

Stacked Adversarial Network for Zero-Shot Sketch based Image Retrieval.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Comparison of Feature-Model Variants for coSpeech-EEG Classification.
Proceedings of the 2020 National Conference on Communications, 2020

P300 based Stereo localization of single frequency audio stimulus.
Proceedings of the 2020 National Conference on Communications, 2020

The "Sound of Silence" in EEG - Cognitive Voice Activity Detection.
Proceedings of the Interspeech 2020, 2020

Exploration of End-to-End Synthesisers for Zero Resource Speech Challenge 2020.
Proceedings of the Interspeech 2020, 2020

Generic Indic Text-to-Speech Synthesisers with Rapid Adaptation in an End-to-End Framework.
Proceedings of the Interspeech 2020, 2020

A Hybrid HMM-Waveglow Based Text-to-Speech Synthesizer Using Histogram Equalization for Low Resource Indian Languages.
Proceedings of the Interspeech 2020, 2020

State-Based Transcription of Components of Carnatic Music.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Spike Estimation From Fluorescence Signals Using High-Resolution Property of Group Delay.
IEEE Trans. Signal Process., 2019

Analysis of Inter-Pausal Units in Indian Languages and Its Application to Text-to-Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Spoof detection using x-vector and feature switching.
CoRR, 2019

Indic language computing.
Commun. ACM, 2019

Zero Resource Speech Synthesis Using Transcripts Derived from Perceptual Acoustic Units.
Proceedings of the Interspeech 2019, 2019

An Empirical Study of Speech Processing in the Brain by Analyzing the Temporal Syllable Structure in Speech-input Induced EEG.
Proceedings of the IEEE International Conference on Acoustics, 2019

Incremental Transfer Learning in Two-pass Information Bottleneck Based Speaker Diarization System for Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2019

Level-wise Subject adaptation to improve classification of motor and mental EEG tasks.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Time Warping Solutions for Classifying Artifacts in EEG.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Subspace techniques for task-independent EEG person identification.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Spoof Detection Using Time-Delay Shallow Neural Network and Feature Switching.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Transcription Correction Using Group Delay Processing for Continuous Speech Recognition.
Circuits Syst. Signal Process., 2018

Replay Attack Detection in Speaker Verification Using non-voiced segments and Decision Level Feature Switching.
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

An SVD Based Approach for Spoken Language Identification.
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Signal Processing Cues to Improve Automatic Speech Recognition for Low Resource Indian Languages.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Precision of Sung Notes in Carnatic Music.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Code-switching in Indic Speech Synthesisers.
Proceedings of the Interspeech 2018, 2018

Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech.
Proceedings of the Interspeech 2018, 2018

Decision-level Feature Switching as a Paradigm for Replay Attack Detection.
Proceedings of the Interspeech 2018, 2018

Transcription Correction for Indian Languages Using Acoustic Signatures.
Proceedings of the Interspeech 2018, 2018

Brain-Computer Interface using Electroencephalogram Signatures of Eye Blinks.
Proceedings of the Interspeech 2018, 2018

Resyllabification in Indian Languages and Its Implications in Text-to-speech Systems.
Proceedings of the Interspeech 2018, 2018

Early Vocabulary Development Through Picture-based Software Solutions.
Proceedings of the Interspeech 2018, 2018

Mobile Application for Learning Languages for the Unlettered.
Proceedings of the Interspeech 2018, 2018

Information Bottleneck Based Percussion Instrument Diarization System for Taniavartanam Segments of Carnatic Music Concerts.
Proceedings of the Interspeech 2018, 2018

Single Trial P300 Classification Using Convolutional LSTM and Deep Learning Ensembles Method.
Proceedings of the Intelligent Human Computer Interaction - 10th International Conference, 2018

A Common Spatial Pattern Approach for Classification of Mental Counting and Motor Execution EEG.
Proceedings of the Intelligent Human Computer Interaction - 10th International Conference, 2018

A Generative Model for Zero Shot Learning Using Conditional Variational Autoencoders.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Feature-switching: Dynamic feature selection for an i-vector based speaker verification system.
Speech Commun., 2017

Two-pitch tracking in co-channel speech using modified group delay functions.
Speech Commun., 2017

Melody extraction from music using modified group delay functions.
Int. J. Speech Technol., 2017

Non-uniform time-scaling of Carnatic music transients.
CoRR, 2017

A semi-automatic method for transcription error correction for Indian language TTS systems.
Proceedings of the Twenty-third National Conference on Communications, 2017

An approach to transcription of varnams in Carnatic music using hidden Markov models.
Proceedings of the Twenty-third National Conference on Communications, 2017

Music genre classification by fusion of Modified Group Delay and Melodic Features.
Proceedings of the Twenty-third National Conference on Communications, 2017

Raga identification using Locality Sensitive Hashing.
Proceedings of the Twenty-third National Conference on Communications, 2017

A Statistical Analysis of Gamakas in Carnatic Music.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Onset Detection in Composition Items of Carnatic Music.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Discovering Language in Marmoset Vocalization.
Proceedings of the Interspeech 2017, 2017

TBT (Toolkit to Build TTS): A High Performance Framework to Build Multiple Language HTS Voice.
Proceedings of the Interspeech 2017, 2017

Deep Learning Techniques in Tandem with Signal Processing Cues for Phonetic Segmentation for Text to Speech Synthesis in Indian Languages.
Proceedings of the Interspeech 2017, 2017

GDspike: An accurate spike estimation algorithm from noisy calcium fluorescence signals.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
An analysis of the high resolution property of group delay function with applications to audio signal processing.
Speech Commun., 2016

Modified Group Delay Based MultiPitch Estimation in Co-Channel Speech.
CoRR, 2016

A Unified Parser for Developing Indian Language Text to Speech Synthesizers.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Organization-Level Control of Excessive Internet Downloads.
Proceedings of the 41st IEEE Conference on Local Computer Networks, 2016

Acoustic Analysis of Syllables Across Indian Languages.
Proceedings of the Interspeech 2016, 2016

Two-Pass IB Based Speaker Diarization System Using Meeting-Specific ANN Based Features.
Proceedings of the Interspeech 2016, 2016

Significance of Pseudo-syllables in building better acoustic models for Indian English TTS.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Eigen and multimodal analysis for localizing moving sounding objects.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Modified group delay feature based total variability space modelling for speaker recognition.
Int. J. Speech Technol., 2015

Pitch estimation from speech using Grating Compression Transform on Modified Group-Delay-gram.
Proceedings of the Twenty First National Conference on Communications, 2015

Building speech synthesis systems for Indian languages.
Proceedings of the Twenty First National Conference on Communications, 2015

Akshara transcription of mrudangam strokes in Carnatic music.
Proceedings of the Twenty First National Conference on Communications, 2015

Musical onset detection on carnatic percussion instruments.
Proceedings of the Twenty First National Conference on Communications, 2015

Discovery of Syllabic Percussion Patterns in Tabla Solo Recordings.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Raga Verification in Carnatic Music Using Longest Common Segment Set.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

A multi-level resilience framework for unified networked environments.
Proceedings of the IFIP/IEEE International Symposium on Integrated Network Management, 2015

2014
Tonic-Independent Stroke Transcription of the Mridangam.
Proceedings of the AES International Conference on Semantic Audio 2014, 2014

Group delay based phone segmentation for HTS.
Proceedings of the Twentieth National Conference on Communications, 2014

An approach to building language-independent text-to-speech synthesis for Indian languages.
Proceedings of the Twentieth National Conference on Communications, 2014

Analysis of fricatives, stop consonants and nasals in the automatic segmentation of speech using the group delay algorithm.
Proceedings of the Twentieth National Conference on Communications, 2014

URL classification using non negative matrix factorization.
Proceedings of the Twentieth National Conference on Communications, 2014

A modified rough longest common subsequence algorithm for motif spotting in an Alapana of Carnatic Music.
Proceedings of the Twentieth National Conference on Communications, 2014

A probabilistic approach to selecting units for speech synthesis based on acoustic similarity.
Proceedings of the Twentieth National Conference on Communications, 2014

Discovering Typical Motifs of a Raga from One-Liners of Songs in Carnatic Music.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

A hybrid approach to segmentation of speech using group delay processing and HMM based embedded reestimation.
Proceedings of the INTERSPEECH 2014, 2014

Feature Switching in the i-vector framework for speaker verification.
Proceedings of the INTERSPEECH 2014, 2014

2013
A common attribute based unified HTS framework for speech synthesis in Indian languages.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

A syllable-based framework for unit selection synthesis in 13 Indian languages.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Seamless integration of common framework Indian language TTSes in various applications.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

Inter and Intra Item Segmentation of Continuous Audio Recordings of Carnatic Music for Archival.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Motif Spotting in an Alapana in Carnatic Music.
Proceedings of the 14th International Society for Music Information Retrieval Conference, 2013

Group delay based melody monopitch extraction from music.
Proceedings of the IEEE International Conference on Acoustics, 2013

Modal analysis and transcription of strokes of the mridangam using non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2013

A syllable based statistical text to speech system.
Proceedings of the 21st European Signal Processing Conference, 2013

Analysis of vowel deletion in continuous speech.
Proceedings of the 21st European Signal Processing Conference, 2013

A novel application of group delay function for identifying tonic in Carnatic music.
Proceedings of the 21st European Signal Processing Conference, 2013

Cent Filter-Banks and its Relevance to Identifying the Main Song in Carnatic Music.
Proceedings of the Sound, Music, and Motion - 10th International Symposium, 2013

Is word-to-phone mapping better than phone-phone mapping for handling English words?
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Acoustic Segmentation Using Group Delay Functions and Its Relevance to Spoken Keyword Spotting.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Decoupling non-stationary and stationary components in long range network time series in the context of anomaly detection.
Proceedings of the 37th Annual IEEE Conference on Local Computer Networks, 2012

User traffic classification for proxy-server based internet access control.
Proceedings of the 6th International Conference on Signal Processing and Communication Systems, 2012

Unsupervised clustering of syllables for language identification.
Proceedings of the 20th European Signal Processing Conference, 2012

2011
Robustness of group delay representations for noisy speech signals.
Int. J. Speech Technol., 2011

An Online Inclusive Examination Framework for India.
Proceedings of the 2011 IEEE International Conference on Technology for Education, 2011

Indian Language Screen Readers and Syllable Based Festival Text-to-Speech Synthesis System.
Proceedings of the Second Workshop on Speech and Language Processing for Assistive Technologies, 2011

Time series models and its relevance to modeling TCP SYN based DoS attacks.
Proceedings of the 7th Conference on Next Generation Internet, 2011

On Convergence of Discriminative Training Algorithm for Speaker Recognition.
Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011

Personalized Intelligent Tutoring System Using Reinforcement Learning.
Proceedings of the Twenty-Fourth International Florida Artificial Intelligence Research Society Conference, 2011

2010
Feature Selection for Text Classification Based on Gini Coefficient of Inequality.
Proceedings of the Fourth International Workshop on Feature Selection in Data Mining, 2010

Acoustic feature diversity and speaker verification.
Proceedings of the INTERSPEECH 2010, 2010

Inference Based Query Expansion Using User's Real Time Implicit Feedback.
Proceedings of the Knowledge Discovery, Knowledge Engineering and Knowledge Management, 2010

Dynamic Query Expansion based on User's Real Time Implicit Feedback.
Proceedings of the KDIR 2010, 2010

2009
Robustness of phase based features for speaker recognition.
Proceedings of the INTERSPEECH 2009, 2009

Dynamic selection of magnitude and phase based acoustic feature streams for speaker verification.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Determining user's interest in real time.
Proceedings of the 17th International Conference on World Wide Web, 2008

Methods for improving the quality of syllable based speech synthesis.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Significance of group delay based acoustic features in the linguistic search space for robust speech recognition.
Proceedings of the INTERSPEECH 2008, 2008

Incorporating acoustic feature diversity into the linguistic search space for syllable based speech recognition.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Signal processing based segmentation and hmm based acoustic clustering of syllable segments for low bit rate segment vocoder at 1.4 Kbps.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Effect of word density on measuring words association.
Proceedings of the 1st Bangalore Annual Compute Conference, Compute 2008, 2008

2007
Significance of the Modified Group Delay Feature in Speech Recognition.
IEEE Trans. Speech Audio Process., 2007

Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing.
EURASIP J. Audio Speech Music. Process., 2007

Teaching - Learning Strategies in Interactive Education - A Case Study.
Proceedings of the Home Informatics and Telematics: ICT for The Next Billion, 2007

2006
Language identification using acoustic log-likelihoods of syllable-like units.
Speech Commun., 2006

Traffic Modeling and Classification Using Packet Train Length and Packet Train Size.
Proceedings of the Autonomic Principles of IP Operations and Management, 2006

A syllable based continuous speech recognizer for Tamil.
Proceedings of the INTERSPEECH 2006, 2006

Detection of Syn Flooding Attacks using Linear Prediction Analysis.
Proceedings of the 14th IEEE International Conference on Networks, 2006

Automatic identification of bird calls using Spectral Ensemble Average Voice Prints.
Proceedings of the 14th European Signal Processing Conference, 2006

Natural sounding TTS based on syllable-like units.
Proceedings of the 14th European Signal Processing Conference, 2006

2005
Speech Processing Using Joint Features Derived from the Modified Group Delay Function.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Automatic segmentation of continuous speech using minimum phase group delay functions.
Speech Commun., 2004

Subband-Based Group Delay Segmentation of Spontaneous Speech into Syllable-Like Units.
EURASIP J. Adv. Signal Process., 2004

Duration modeling of Indian languages Hindi and Telugu.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

The modified group delay feature: a new spectral representation of speech.
Proceedings of the INTERSPEECH 2004, 2004

A new prosodic phrasing model for indian language telugu.
Proceedings of the INTERSPEECH 2004, 2004

Continuous speech recognition using joint features derived from the modified group delay function and MFCC.
Proceedings of the INTERSPEECH 2004, 2004

Automatic transcription of continuous speech using unsupervised and incremental training.
Proceedings of the INTERSPEECH 2004, 2004

Distributed speaker recognition.
Proceedings of the INTERSPEECH 2004, 2004

Cluster and Intrinsic Dimensionality Analysis of the Modified Group Delay Feature for Speaker Classification.
Proceedings of the Neural Information Processing, 11th International Conference, 2004

Language identification using parallel syllable-like unit recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Application of the modified group delay function to speaker identification and discrimination.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Automatic segmentation and labeling of continuous speech without bootstrapping.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003
Segmentation of speech into syllable-like units.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

The modified group delay function and its application to phoneme recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2000
An automatic algorithm for segmenting and labelling a connected digit sequence.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Language identification from short segments of speech.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Robust text-independent speaker identification over telephone channels.
IEEE Trans. Speech Audio Process., 1999

1995
Transformation of formants for voice conversion using artificial neural networks.
Speech Commun., 1995

1994
Pitch extraction from root cepstrum.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1992
Significance of group delay functions in spectrum estimation.
IEEE Trans. Signal Process., 1992

1991
Formant extraction from group delay function.
Speech Commun., 1991

Speech processing using group delay functions.
Signal Process., 1991

Processing of noisy speech using modified group delay functions.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
Speech enhancement using group delay functions.
Proceedings of the First International Conference on Spoken Language Processing, 1990

1989
Formant extraction from Fourier transform phase.
Proceedings of the IEEE International Conference on Acoustics, 1989

A nonparametric method of formant estimation using group delay spectra.
Proceedings of the IEEE International Conference on Acoustics, 1989

1987
Processing of noisy speech using partial phase.
Proceedings of the European Conference on Speech Technology, 1987

Reconstruction from Fourier transform phase with applications to speech analysis.
Proceedings of the IEEE International Conference on Acoustics, 1987


  Loading...