Wei Kang

Affiliations:
  • Xiaomi Corp., Beijing, China


According to our database1, Wei Kang authored at least 14 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow Matching.
CoRR, July, 2025

ZipVoice: Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching.
CoRR, June, 2025

CR-CTC: Consistency regularization on CTC for improved speech recognition.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization.
CoRR, 2024

LibriheavyMix: A 20, 000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Zipformer: A faster and better encoder for automatic speech recognition.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PromptASR for Contextualized ASR with Controllable Style.
Proceedings of the IEEE International Conference on Acoustics, 2024

Libriheavy: A 50, 000 Hours ASR Corpus with Punctuation Casing and Context.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Delay-penalized CTC Implemented Based on Finite State Transducer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Blank-regularized CTC for Frame Skipping in Neural Transducer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Delay-Penalized Transducer for Low-Latency Streaming ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

Fast and Parallel Decoding for Transducer.
Proceedings of the IEEE International Conference on Acoustics, 2023

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Pruned RNN-T for fast, memory-efficient ASR training.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022


  Loading...