Yu Zhang

Orcid: 0009-0007-4594-0281

Affiliations:
  • Zhejiang University, Hangzhou, China


According to our database1, Yu Zhang authored at least 14 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Conan: A Chunkwise Online Network for Zero-Shot Adaptive Voice Conversion.
CoRR, July, 2025

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation.
CoRR, July, 2025

Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly.
CoRR, May, 2025

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting.
CoRR, April, 2025

Versatile Framework for Song Generation with Prompt-based Control.
CoRR, April, 2025

Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis.
CoRR, February, 2025

TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Denoising algorithm for medical ultrasound image with improved threshold neighborhood-mean filtering.
Proceedings of the 2024 5th International Symposium on Artificial Intelligence for Medicine Science, 2024

TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Robust Singing Voice Transcription Serves Synthesis.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024


  Loading...