Yongyi Zang

According to our database1, Yongyi Zang authored at least 24 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2026
ASVspoof 5: Design, collection and validation of resources for spoofing, deepfake, and adversarial attack detection using crowdsourced speech.
Comput. Speech Lang., 2026

2025
Efficient Vocal Source Separation Through Windowed Sink Attention.
CoRR, October, 2025

Learning Interpretable Features in Audio Latent Spaces via Sparse Autoencoders.
CoRR, October, 2025

PromptReverb: Multimodal Room Impulse Response Generation Through Latent Rectified Flow Matching.
CoRR, October, 2025

StylePitcher: Generating Style-Following and Expressive Pitch Curves for Versatile Singing Tasks.
CoRR, October, 2025

FlowSynth: Instrument Generation Through Distributional Flow Matching and Test-Time Search.
CoRR, October, 2025

Smule Renaissance Small: Efficient General-Purpose Vocal Restoration.
CoRR, October, 2025

MSRBench: A Benchmarking Dataset for Music Source Restoration.
CoRR, October, 2025

Music Source Restoration.
CoRR, May, 2025

Training-Free Multi-Step Audio Source Separation.
CoRR, May, 2025

GSound-SIR: A Spatial Impulse Response Ray-Tracing and High-order Ambisonic Auralization Python Toolkit.
CoRR, March, 2025

YuE: Scaling Open Foundation Models for Long-Form Music Generation.
CoRR, March, 2025

Are You Really Listening? Boosting Perceptual Awareness in Music-QA Benchmarks.
Proceedings of the 26th International Society for Music Information Retrieval Conference, 2025

DiffStereo: End-to-End Mono-to-Stereo Audio Generation with Diffusion Transformer.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

The Interpretation Gap in Text-to-Music Generation Models.
CoRR, 2024

Ambisonizer: Neural Upmixing as Spherical Harmonics Generation.
CoRR, 2024

SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan.
CoRR, 2024

SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription.
Proceedings of the IEEE International Conference on Acoustics, 2024

SingFake: Singing Voice Deepfake Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Phase perturbation improves channel robustness for speech spoofing countermeasures.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023


  Loading...