Hangjie Yuan

Orcid: 0009-0009-3270-1526

According to our database¹, Hangjie Yuan authored at least 62 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Knowledge is Power: Advancing Few-shot Action Recognition with Multimodal Semantics from MLLMs.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., June, 2026

Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

Bridging Brain and Semantics: A Hierarchical Framework for Semantically Enhanced fMRI-to-Video Reconstruction.

[BibT_eX]

[DOI]

CoRR, May, 2026

A Faster Path to Continual Learning.

[BibT_eX]

[DOI]

CoRR, April, 2026

LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation.

[BibT_eX]

[DOI]

CoRR, March, 2026

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, March, 2026

Why Does RL Generalize Better Than SFT? A Data-Centric Perspective on VLM Post-Training.

[BibT_eX]

[DOI]

CoRR, February, 2026

Continual GUI Agents.

[BibT_eX]

[DOI]

CoRR, January, 2026

CogFlow: Bridging Perception and Reasoning through Knowledge Internalization for Visual Mathematical Problem Solving.

[BibT_eX]

[DOI]

CoRR, January, 2026

OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Adapt Before Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

MCIE: Multimodal LLM-Driven Complex Instruction Image Editing with Spatial Guidance.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

An Efficient Graph-Transformer Operator for Learning Physical Dynamics with Manifolds Embedding.

[BibT_eX]

[DOI]

CoRR, December, 2025

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning.

[BibT_eX]

[DOI]

CoRR, December, 2025

RynnVLA-002: A Unified Vision-Language-Action and World Model.

[BibT_eX]

[DOI]

CoRR, November, 2025

UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback.

[BibT_eX]

[DOI]

CoRR, November, 2025

Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance.

[BibT_eX]

[DOI]

CoRR, October, 2025

C-Flat++: Towards a More Efficient and Powerful Framework for Continual Learning.

[BibT_eX]

[DOI]

CoRR, August, 2025

Lumos-1: On Autoregressive Video Generation from a Unified Model Perspective.

[BibT_eX]

[DOI]

CoRR, July, 2025

WorldVLA: Towards Autoregressive Action World Model.

[BibT_eX]

[DOI]

CoRR, June, 2025

DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing.

[BibT_eX]

[DOI]

CoRR, June, 2025

VideoMAR: Autoregressive Video Generatio with Continuous Tokens.

[BibT_eX]

[DOI]

CoRR, June, 2025

LumosFlow: Motion-Guided Long Video Generation.

[BibT_eX]

[DOI]

CoRR, June, 2025

Taming Consistency Distillation for Accelerated Human Image Animation.

[BibT_eX]

[DOI]

CoRR, April, 2025

MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems.

[BibT_eX]

[DOI]

CoRR, March, 2025

Frequency Autoregressive Image Generation with Continuous Tokens.

[BibT_eX]

[DOI]

CoRR, March, 2025

Generative Artificial Intelligence in Robotic Manipulation: A Survey.

[BibT_eX]

[DOI]

CoRR, March, 2025

Class-Incremental Player Detection With Refined Response-Based Knowledge Distillation.

[BibT_eX]

[DOI]

IEEE Trans. Instrum. Meas., 2025

Rethinking the Stability-Plasticity Trade-off in Continual Learning from an Architectural Perspective.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

ZeroFlow: Overcoming Catastrophic Forgetting is Easier than You Think.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

DreamRelation: Relation-Centric Video Customization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

AeroGTO: An Efficient Graph-Transformer Operator for Learning Large-Scale Aerodynamics of 3D Vehicle Geometries.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

UniGrad-FS: Unified Gradient Projection With Flatter Sharpness for Continual Learning.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Informatics, December, 2024

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control.

[BibT_eX]

[DOI]

CoRR, 2024

Make Continual Learning Stronger via C-Flat.

[BibT_eX]

[DOI]

CoRR, 2024

From Denoising Training to Test-Time Adaptation: Enhancing Domain Generalization for Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Make Continual Learning Stronger via C-Flat.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Revisiting Neural Networks for Continual Learning: An Architectural Perspective.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

PAPM: A Physics-aware Proxy Model for Process Systems.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition.

[BibT_eX]

[DOI]

Lingfeng Liu

Dong Ni

Hangjie Yuan

Proceedings of the Twelfth International Conference on Learning Representations, 2024

InstructVideo: Instructing Video Diffusion Models with Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Dream Video: Composing Your Dream Videos with Customized Subject and Motion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion.

[BibT_eX]

[DOI]

CoRR, 2023

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Few-shot Action Recognition with Captioning Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

ModelScope Text-to-Video Technical Report.

[BibT_eX]

[DOI]

CoRR, 2023

Refined Response Distillation for Class-Incremental Player Detection.

[BibT_eX]

[DOI]

CoRR, 2023

VideoComposer: Compositional Video Synthesis with Motion Controllability.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RLIPv2: Fast Scaling of Relational Language-Image Pre-training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Progressive Learning without Forgetting.

[BibT_eX]

[DOI]

CoRR, 2022

RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation.

[BibT_eX]

[DOI]

Tao Feng

Mang Wang

Hangjie Yuan

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Towards Mask-robust Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Spatio-Temporal Dynamic Inference Network for Group Activity Recognition.

[BibT_eX]

[DOI]

Hangjie Yuan

Dong Ni

Mang Wang

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Visual Context for Group Activity Recognition.

[BibT_eX]

[DOI]

Hangjie Yuan

Dong Ni

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Hangjie Yuan

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...