Zhenxiong Tan

According to our database1, Zhenxiong Tan authored at least 21 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers.
CoRR, March, 2026

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer.
CoRR, March, 2026

MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation.
Trans. Mach. Learn. Res., 2026

Minute-Long Videos with Dual Parallelisms.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
SpotEdit: Selective Region Editing in Diffusion Transformers.
CoRR, December, 2025

Vision Bridge Transformer at Scale.
CoRR, November, 2025

FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation.
CoRR, November, 2025

Image Editing As Programs with Diffusion Models.
CoRR, June, 2025

OminiControl2: Efficient Conditioning for Diffusion Transformers.
CoRR, March, 2025

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Ultra-Resolution Adaptation with Ease.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

OminiControl: Minimal and Universal Control for Diffusion Transformer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
LinFusion: 1 GPU, 1 Minute, 16K Image.
CoRR, 2024

Video-Infinity: Distributed Long Video Generation.
CoRR, 2024

Implicit Curriculum in Procgen Made Explicit.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

MindBridge: A Cross-Subject Brain Decoding Framework.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
C-Procgen: Empowering Procgen with Controllable Contexts.
CoRR, 2023

2020
Multi-granularity Multimodal Feature Interaction for Referring Image Segmentation.
Proceedings of the Pattern Recognition and Computer Vision, Third Chinese Conference, 2020

AdversarialNAS: Adversarial Neural Architecture Search for GANs.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


  Loading...