Di Wu
Affiliations:- Horizon Robotics, Beijing, China
- WeNet Open Source Community
- Mobvoi Inc., Beijing, China
According to our database1,
Di Wu authored at least 19 papers
between 2019 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
TTS-PRISM: A Perceptual Reasoning and Interpretable Speech Model for Fine-Grained Diagnosis.
CoRR, April, 2026
Iterate to Differentiate: Enhancing Discriminability and Reliability in Zero-Shot TTS Evaluation.
CoRR, March, 2026
2025
CoRR, December, 2025
2024
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
TrimTail: Low-Latency Streaming ASR with Simple But Effective Spectrogram-Level Length Penalty.
Proceedings of the IEEE International Conference on Acoustics, 2023
Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition.
CoRR, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
CoRR, 2021
CoRR, 2021
WeNet: Production Oriented Streaming and Non-Streaming End-to-End Speech Recognition Toolkit.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2020
Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition.
CoRR, 2020
2019
Design of Gesture Recognition System Based on Multi-Channel Myoelectricity Correlation.
Proceedings of the 2019 IEEE Global Communications Conference, 2019