Xiaohui Zhang

Affiliations:
  • Meta AI, New York, NY, USA
  • Johns Hopkins University, Center for Language and Speech Processing, Baltimore, MD, USA (PhD)


According to our database1, Xiaohui Zhang authored at least 28 papers between 2011 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch.
CoRR, 2023

Scaling Speech Technology to 1, 000+ Languages.
CoRR, 2023

Anchored Speech Recognition with Neural Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2023

TorchAudio 2.1: Advancing Speech Recognition, Self-Supervised Learning, and Audio Processing Components for Pytorch.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

2022
Omni-Sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR Via Supernet.
Proceedings of the IEEE International Conference on Acoustics, 2022

Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution.
Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.
CoRR, 2020

Multilingual Graphemic Hybrid ASR with Massive Data Augmentation.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.
Proceedings of the Interspeech 2020, 2020

OOV Recovery with Efficient 2nd Pass Decoding and Open-vocabulary Word-level RNNLM Rescoring for Hybrid ASR.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Transformer-Based Acoustic Modeling for Hybrid Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DEJA-VU: Double Feature Presentation and Iterated Loss in Deep Transformer Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Deja-vu: Double Feature Presentation in Deep Transformer Networks.
CoRR, 2019

Multilingual ASR with Massive Data Augmentation.
CoRR, 2019

From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2017
Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework.
Proceedings of the Interspeech 2017, 2017

Backstitch: Counteracting Finite-Sample Bias via Negative Steps.
Proceedings of the Interspeech 2017, 2017

The Kaldi OpenKWS System: Improving Low Resource Keyword Search.
Proceedings of the Interspeech 2017, 2017

2015
Parallel training of Deep Neural Networks with Natural Gradient and Parameter Averaging.
Proceedings of the 3rd International Conference on Learning Representations, 2015

A diversity-penalizing ensemble training method for deep learning.
Proceedings of the INTERSPEECH 2015, 2015

2014
A keyword search system using open source software.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Improving speaker recognition performance in the domain adaptation challenge using deep neural networks.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Improving deep neural network acoustic models using generalized maxout networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

2011
Flaw classification in ultrasonic guided waves signal using Wavelet Transform and PNN classifier.
Proceedings of the 2011 International Conference on Wireless Communications & Signal Processing, 2011


  Loading...