Yu Zhang
Orcid: 0009-0007-4594-0281Affiliations:
- Zhejiang University, Hangzhou, China
According to our database1,
Yu Zhang authored at least 18 papers
between 2024 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations.
CoRR, October, 2025
STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation.
CoRR, July, 2025
Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis.
CoRR, February, 2025
A Multimodal Evaluation Framework for Spatial Audio Playback Systems: From Localization to Listener Preference.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025
Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
Denoising algorithm for medical ultrasound image with improved threshold neighborhood-mean filtering.
Proceedings of the 2024 5th International Symposium on Artificial Intelligence for Medicine Science, 2024
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024