Junchuan Zhao

Orcid: 0009-0008-2616-6590

According to our database1, Junchuan Zhao authored at least 13 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Remix the Timbre: Diffusion-Based Style Transfer Across Polyphonic Stems.
CoRR, May, 2026

PersonaGest: Personalized Co-Speech Gesture Generation with Semantic-Guided Hierarchical Motion Representation.
CoRR, May, 2026

Hierarchical Decoding for Discrete Speech Synthesis with Multi-Resolution Spoof Detection.
CoRR, March, 2026

CodecFlow: Efficient Bandwidth Extension via Conditional Flow Matching in Neural Codec Latent Space.
CoRR, March, 2026

Segment-Aware Conditioning for Training-Free Intra-Utterance Emotion and Duration Control in Text-to-Speech.
CoRR, January, 2026

TED-TTS: Training-Free Intra-Utterance Emotion and Duration Control for Text-to-Speech Synthesis.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
Disentangling Score Content and Performance Style for Joint Piano Rendering and Transcription.
CoRR, September, 2025

InconVAD: A Two-Stage Dual-Tower Framework for Multimodal Emotion Inconsistency Detection.
CoRR, September, 2025

KSDiff: Keyframe-Augmented Speech-Aware Dual-Path Diffusion for Facial Animation.
CoRR, September, 2025

CoMelSinger: Discrete Token-Based Zero-Shot Singing Synthesis With Structured Melody Control and Guidance.
CoRR, September, 2025

Prosody-Adaptable Audio Codecs for Zero-Shot Voice Conversion via In-Context Learning.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

SPSinger: Multi-Singer Singing Voice Synthesis with Short Reference Prompt.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
SinTechSVS: A Singing Technique Controllable Singing Voice Synthesis System.
IEEE ACM Trans. Audio Speech Lang. Process., 2024


  Loading...