Vimal Manohar

According to our database1, Vimal Manohar authored at least 33 papers between 2013 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Voice-Preserving Zero-Shot Multiple Accent Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Supervised Representations for Singing Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders.
CoRR, 2022

2021
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models.
CoRR, 2021

On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Kaizen: Continuously Improving Teacher Using Exponential Moving Average for Semi-Supervised Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR.
Proceedings of the Interspeech 2020, 2020

2019
The JHU ASR System for VOiCES from a Distance Challenge 2019.
Proceedings of the Interspeech 2019, 2019

Using ASR Methods for OCR.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Towards Automatic Methods to Detect Errors in Transcriptions of Speech Recordings.
Proceedings of the IEEE International Conference on Acoustics, 2019

Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection.
CoRR, 2018

A Teacher-Student Learning Approach for Unsupervised Domain Adaptation of Sequence-Trained ASR Models.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages.
Proceedings of the Interspeech 2018, 2018

Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge.
Proceedings of the Interspeech 2018, 2018

Semi-Supervised Training of Acoustic Models Using Lattice-Free MMI.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Characterizing Performance of Speaker Diarization Systems on Far-Field Speech Using Standard Methods.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
ASR for Under-Resourced Languages From Probabilistic Transcription.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Using of heterogeneous corpora for training of an ASR system.
CoRR, 2017

Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework.
Proceedings of the Interspeech 2017, 2017

The Kaldi OpenKWS System: Improving Low Resource Keyword Search.
Proceedings of the Interspeech 2017, 2017

An Exploration of Dropout with LSTMs.
Proceedings of the Interspeech 2017, 2017

JHU Kaldi system for Arabic MGB-3 ASR challenge using diarization, audio-transcript alignment and transfer learning.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Investigation of transfer learning for ASR using LF-MMI trained neural networks.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI.
Proceedings of the Interspeech 2016, 2016

Far-Field ASR Without Parallel Data.
Proceedings of the Interspeech 2016, 2016

Acoustic Modelling from the Signal Domain Using CNNs.
Proceedings of the Interspeech 2016, 2016

Adapting ASR for under-resourced languages using mismatched transcriptions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Semi-supervised maximum mutual information training of deep neural network acoustic models.
Proceedings of the INTERSPEECH 2015, 2015

JHU ASpIRE system: Robust LVCSR with TDNNS, iVector adaptation and RNN-LMS.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
A keyword search system using open source software.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

2013
Acoustic modeling using transform-based phone-cluster adaptive training.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013


  Loading...