Yu Zhang

Orcid: 0009-0007-4594-0281

Affiliations:

Zhejiang University, Hangzhou, China

According to our database¹, Yu Zhang authored at least 16 papers between 2024 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations.

[BibT_eX]

[DOI]

CoRR, October, 2025

ASAudio: A Survey of Advanced Spatial Audio Research.

[BibT_eX]

[DOI]

CoRR, August, 2025

Conan: A Chunkwise Online Network for Zero-Shot Adaptive Voice Conversion.

[BibT_eX]

[DOI]

Yu Zhang

Baotong Tian

Zhiyao Duan

CoRR, July, 2025

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation.

[BibT_eX]

[DOI]

CoRR, July, 2025

Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly.

[BibT_eX]

[DOI]

CoRR, May, 2025

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting.

[BibT_eX]

[DOI]

CoRR, April, 2025

Versatile Framework for Song Generation with Prompt-based Control.

[BibT_eX]

[DOI]

CoRR, April, 2025

Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis.

[BibT_eX]

[DOI]

CoRR, February, 2025

TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Denoising algorithm for medical ultrasound image with improved threshold neighborhood-mean filtering.

[BibT_eX]

[DOI]

Yu Zhang

Ruiqi Li

Proceedings of the 2024 5th International Symposium on Artificial Intelligence for Medicine Science, 2024

TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Robust Singing Voice Transcription Serves Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Yu Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...