Vimal Manohar

According to our database¹, Vimal Manohar authored at least 35 papers between 2013 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

SemAlignVC: Enhancing zero-shot timbre conversion using semantic alignment.

[BibT_eX]

[DOI]

CoRR, July, 2025

Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Less Peaky and More Accurate CTC Forced Alignment by Label Priors.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Voice-Preserving Zero-Shot Multiple Accent Conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Supervised Representations for Singing Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders.

[BibT_eX]

[DOI]

CoRR, 2022

2021

On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Kaizen: Continuously Improving Teacher Using Exponential Moving Average for Semi-Supervised Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

The JHU ASR System for VOiCES from a Distance Challenge 2019.

[BibT_eX]

[DOI]

Phani Sankar Nidadavolu

Daniel Povey

Sanjeev Khudanpur

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Using ASR Methods for OCR.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Towards Automatic Methods to Detect Errors in Transcriptions of Speech Recordings.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection.

[BibT_eX]

[DOI]

CoRR, 2018

A Teacher-Student Learning Approach for Unsupervised Domain Adaptation of Sequence-Trained ASR Models.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Semi-Supervised Training of Acoustic Models Using Lattice-Free MMI.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Characterizing Performance of Speaker Diarization Systems on Far-Field Speech Using Standard Methods.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

ASR for Under-Resourced Languages From Probabilistic Transcription.

[BibT_eX]

[DOI]

Mark A. Hasegawa-Johnson

Preethi Jyothi

Daniel McCloy

Majid Mirbagheri

Giovanni M. Di Liberto

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Using of heterogeneous corpora for training of an ASR system.

[BibT_eX]

[DOI]

CoRR, 2017

Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

The Kaldi OpenKWS System: Improving Low Resource Keyword Search.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

An Exploration of Dropout with LSTMs.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

JHU Kaldi system for Arabic MGB-3 ASR challenge using diarization, audio-transcript alignment and transfer learning.

[BibT_eX]

[DOI]

Vimal Manohar

Daniel Povey

Sanjeev Khudanpur

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Investigation of transfer learning for ASR using LF-MMI trained neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Far-Field ASR Without Parallel Data.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Acoustic Modelling from the Signal Domain Using CNNs.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Adapting ASR for under-resourced languages using mismatched transcriptions.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Sanjeev Khudanpur

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Semi-supervised maximum mutual information training of deep neural network acoustic models.

[BibT_eX]

[DOI]

Vimal Manohar

Daniel Povey

Sanjeev Khudanpur

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

JHU ASpIRE system: Robust LVCSR with TDNNS, iVector adaptation and RNN-LMS.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

A keyword search system using open source software.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

2013

Acoustic modeling using transform-based phone-cluster adaptive training.

[BibT_eX]

[DOI]

Vimal Manohar

Srinivas C. Bhargav

Srinivasan Umesh

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Vimal Manohar

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...