Kaiwen Zheng

Affiliations:
  • Tsinghua University, Department of Computer Science and Technic, THBI Lab, Institute for AI, BNRist Center, China


According to our database1, Kaiwen Zheng authored at least 26 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models.
CoRR, May, 2026

Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation.
CoRR, May, 2026

SLA2: Sparse-Linear Attention with Learnable Routing and QAT.
CoRR, February, 2026

2025
Vidarc: Embodied Video Diffusion Model for Closed-loop Control.
CoRR, December, 2025

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times.
CoRR, December, 2025

Data-regularized Reinforcement Learning for Diffusion Models at Scale.
CoRR, December, 2025

Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency.
CoRR, October, 2025

VoiceBridge: Designing Latent Bridge Models for General Speech Restoration at Scale.
CoRR, September, 2025

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention.
CoRR, September, 2025

DiffusionNFT: Online Diffusion Reinforcement with Forward Process.
CoRR, September, 2025

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning.
CoRR, May, 2025

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Visual Generation Without Guidance.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Diffusion Bridge Implicit Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Elucidating the Preconditioning in Consistency Distillation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models.
CoRR, 2024

Consistency Diffusion Bridge Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis.
CoRR, 2023

DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs.
Proceedings of the International Conference on Machine Learning, 2023

PREIM3D: 3D Consistent Precise Image Attribute Editing from a Single Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Maximum Likelihood Training for Score-based Diffusion ODEs by High Order Denoising Score Matching.
Proceedings of the International Conference on Machine Learning, 2022


  Loading...