Zhong Meng

Orcid: 0000-0001-7814-5929

According to our database1, Zhong Meng authored at least 61 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models.
CoRR, 2024

2023
SLM: Bridge the thin gap between speech and text foundation models.
CoRR, 2023

Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm.
CoRR, 2023

Massive End-to-end Models for Short Search Queries.
CoRR, 2023

Augmenting conformers with structured state space models for online speech recognition.
CoRR, 2023

Text Injection for Capitalization and Turn-Taking Prediction in Speech Models.
CoRR, 2023

Improving Joint Speech-Text Representations Without Alignment.
CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.
CoRR, 2023

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Modular Hybrid Autoregressive Transducer.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Separating Long-Form Speech with Group-wise Permutation Invariant Training.
Proceedings of the Interspeech 2022, 2022

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Streaming Multi-Talker ASR with Token-Level Serialized Output Training.
Proceedings of the Interspeech 2022, 2022

Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings.
Proceedings of the Interspeech 2022, 2022

Continuous Speech Separation with Recurrent Selective Attention Network.
Proceedings of the IEEE International Conference on Acoustics, 2022

Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers Using End-to-End Speaker-Attributed ASR.
Proceedings of the IEEE International Conference on Acoustics, 2022

Factorized Neural Transducer for Efficient Language Model Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Exploring End-to-End Multi-Channel ASR with Bias Information for Meeting Transcription.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Investigation of End-to-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving Multilingual Transformer Transducer Models by Reducing Language Confusions.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

End-to-End Speaker-Attributed ASR with Transformer.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Sequence-Level Self-Teaching Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Internal Language Model Training for Domain-Adaptive End-To-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR.
Proceedings of the IEEE International Conference on Acoustics, 2021

Hypothesis Stitcher for End-to-End Speaker-Attributed ASR on Long-Form Multi-Talker Recordings.
Proceedings of the IEEE International Conference on Acoustics, 2021

Continuous Speech Separation with Ad Hoc Microphone Arrays.
Proceedings of the 29th European Signal Processing Conference, 2021

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Active voice authentication.
Digit. Signal Process., 2020

Continuous speech separation: dataset and analysis.
CoRR, 2020

Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability.
Proceedings of the Interspeech 2020, 2020

Serialized Output Training for End-to-End Overlapped Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers.
Proceedings of the Interspeech 2020, 2020

L-Vector: Neural Label Embedding for Domain Adaptation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Continuous Speech Separation: Dataset and Analysis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Speaker Adaptation for Attention-Based End-to-End Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Acoustic-to-Phrase Models for Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Adversarial Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Conditional Teacher-student Learning.
Proceedings of the IEEE International Conference on Acoustics, 2019

Attentive Adversarial Learning for Domain-invariant Training.
Proceedings of the IEEE International Conference on Acoustics, 2019

Adversarial Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Speech Separation Using Speaker Inventory.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Character-Aware Attention-Based End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Discriminative and adaptive training for robust speech recognition and understanding.
PhD thesis, 2018

Speaker-Invariant Training via Adversarial Learning.
CoRR, 2018

Adversarial Feature-Mapping for Speech Enhancement.
Proceedings of the Interspeech 2018, 2018

Cycle-Consistent Speech Enhancement.
Proceedings of the Interspeech 2018, 2018

Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speaker-Invariant Training Via Adversarial Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Non-Uniform MCE Training of Deep Long Short-Term Memory Recurrent Neural Networks for Keyword Spotting.
Proceedings of the Interspeech 2017, 2017

Minimum Semantic Error Cost Training of Deep Long Short-Term Memory Networks for Topic Spotting on Conversational Speech.
Proceedings of the Interspeech 2017, 2017

Deep long short-term memory adaptive beamforming networks for multichannel robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Unsupervised adaptation with domain separation networks for robust speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Statistical Modeling of Speaker's Voice with Temporal Co-Location for Active Voice Authentication.
Proceedings of the Interspeech 2016, 2016

Non-Uniform Boosted MCE Training of Deep Neural Networks for Keyword Spotting.
Proceedings of the Interspeech 2016, 2016


  Loading...