Srikanth R. Madikeri

Orcid: 0000-0002-4361-784X

According to our database1, Srikanth R. Madikeri authored at least 58 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR.
CoRR, 2024


Normalizing Flows for Speaker and Language Recognition Backend.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Probability-Aware Word-Confusion-Network-To-Text Alignment Approach for Intent Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Fine-Tuning Self-Supervised Models for Language Identification Using Orthonormal Constraint.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers.
Proceedings of the IEEE International Conference on Acoustics, 2024

Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding.
CoRR, 2023

Implementing Contextual Biasing in GPU Decoder for Online ASR.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2023

Parameter-Efficient Tuning with Adaptive Bottlenecks for Automatic Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Expanded Lattice Embeddings for Spoken Document Retrieval on Informal Meetings.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Speaker Recognition on Mono-Channel Telephony Recordings.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

2021
Novel Architectures for Unsupervised Information Bottleneck Based Speaker Diarization of Meetings.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Comparing CTC and LFMMI for Out-of-Domain Adaptation of wav2vec 2.0 Acoustic Model.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech Activity Detection Based on Multilingual Speech Recognition System.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multitask Adaptation with Lattice-Free MMI for Multi-Genre Speech Recognition of Low Resource Languages.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Lattice-Free Mmi Adaptation of Self-Supervised Pretrained Acoustic Models.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Comparison of Methods for OOV-Word Recognition on a New Public Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Pkwrap: a PyTorch Package for LF-MMI Training of Acoustic Models.
CoRR, 2020

Quantization of Acoustic Model Parameters in Automatic Speech Recognition Framework.
CoRR, 2020

Improving Speaker Identification using Network Knowledge in Criminal Conversational Data.
CoRR, 2020

Supervised Domain Adaptation for Text-Independent Speaker Verification Using Limited Data.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition Systems.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Incremental Semi-Supervised Learning for Multi-Genre Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Voice Presentation Attack Detection Using Convolutional Neural Networks.
Proceedings of the Handbook of Biometric Anti-Spoofing, 2019

Regularization Advantages of Multilingual Neural Language Models for Low Resource Domains.
CoRR, 2019

A Bayesian Approach to Inter-task Fusion for Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Incremental Transfer Learning in Two-pass Information Bottleneck Based Speaker Diarization System for Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2019

SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Analysis of Language Dependent Front-End for Speaker Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

End-to-end Text-dependent Speaker Verification Using Novel Distance Measures.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

DNN Based Speaker Embedding Using Content Information for Text-Dependent Speaker Verification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Template-matching for text-dependent speaker verification.
Speech Commun., 2017

Content Normalization for Text-Dependent Speaker Verification.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Intra-class covariance adaptation in PLDA back-ends for speaker verification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Exploiting sequence information for text-dependent Speaker Verification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Towards a Breakthrough Speaker Identification Approach for Law Enforcement Agencies: SIIP.
Proceedings of the European Intelligence and Security Informatics Conference, 2017

2016
Speaker Diarization and Linking of Meeting Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition.
IEEE Signal Process. Lett., 2016

Inter-Task System Fusion for Speaker Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Two-Pass IB Based Speaker Diarization System Using Meeting-Specific ANN Based Features.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

System fusion and speaker linking for longitudinal diarization of TV shows.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Information theoretic clustering for unsupervised domain-adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Deep neural network based posteriors for text-dependent speaker verification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Modified group delay feature based total variability space modelling for speaker recognition.
Int. J. Speech Technol., 2015

Integrating online i-vector extractor with information bottleneck based speaker diarization system.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Employment of Subspace Gaussian Mixture Models in speaker recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Combining SGMM speaker vectors and KL-HMM approach for speaker diarization.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

KL-HMM based speaker diarization system for meetings.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Towards utterance-based neural network adaptation in acoustic modeling.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
A fast and scalable hybrid FA/PPCA-based framework for speaker recognition.
Digit. Signal Process., 2014

Feature Switching in the i-vector framework for speaker verification.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Filterbank slope based features for speaker diarization.
Proceedings of the IEEE International Conference on Acoustics, 2014

2012
Acoustic Segmentation Using Group Delay Functions and Its Relevance to Spoken Keyword Spotting.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

A hybrid factor analysis and probabilistic PCA-based system for dictionary learning and encoding for robust speaker recognition.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

2011
On Convergence of Discriminative Training Algorithm for Speaker Recognition.
Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011


  Loading...