Zehua Chen

Orcid: 0009-0000-2294-7447

Affiliations:
  • Tsinghua University, Beijing, China
  • Shengshu AI, Beijing, China


According to our database1, Zehua Chen authored at least 17 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Omni2Sound: Towards Unified Video-Text-to-Audio Generation.
CoRR, January, 2026

2025
RefineBridge: Generative Bridge Models Improve Financial Forecasting by Foundation Models.
CoRR, December, 2025

ControlAudio: Tackling Text-Guided, Timing-Indicated and Intelligible Audio Generation via Progressive Diffusion Modeling.
CoRR, October, 2025

VoiceBridge: Designing Latent Bridge Models for General Speech Restoration at Scale.
CoRR, September, 2025

Versatile Cardiovascular Signal Generation with a Unified Diffusion Transformer.
CoRR, May, 2025

FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

FrameBridge: Improving Image-to-Video Generation with Bridge Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

RespDiff: An End-to-End Multi-scale RNN Diffusion Model for Respiratory Waveform Estimation from PPG Signals.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
FrameBridge: Improving Image-to-Video Generation with Bridge Models.
CoRR, 2024

TiVA: Time-Aligned Video-to-Audio Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2023
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis.
CoRR, 2023

Improving Diffusion Models for ECG Imputation with an Augmented Template Prior.
CoRR, 2023

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models.
Proceedings of the International Conference on Machine Learning, 2023

2022
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech.
CoRR, 2022

BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Infergrad: Improving Diffusion Models for Vocoder by Considering Inference in Training.
Proceedings of the IEEE International Conference on Acoustics, 2022

2020
A Probabilistic Beat-to-Beat Filtering Model for Continuous and Accurate Blood Pressure Estimation.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020


  Loading...