Hangjie Yuan

Orcid: 0009-0009-3270-1526

According to our database1, Hangjie Yuan authored at least 62 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Knowledge is Power: Advancing Few-shot Action Recognition with Multimodal Semantics from MLLMs.
Int. J. Comput. Vis., June, 2026

Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models.
CoRR, May, 2026

Bridging Brain and Semantics: A Hierarchical Framework for Semantically Enhanced fMRI-to-Video Reconstruction.
CoRR, May, 2026

A Faster Path to Continual Learning.
CoRR, April, 2026

LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation.
CoRR, March, 2026

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning.
CoRR, March, 2026

Why Does RL Generalize Better Than SFT? A Data-Centric Perspective on VLM Post-Training.
CoRR, February, 2026

Continual GUI Agents.
CoRR, January, 2026

CogFlow: Bridging Perception and Reasoning through Knowledge Internalization for Visual Mathematical Problem Solving.
CoRR, January, 2026

OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Adapt Before Continual Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

MCIE: Multimodal LLM-Driven Complex Instruction Image Editing with Spatial Guidance.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
An Efficient Graph-Transformer Operator for Learning Physical Dynamics with Manifolds Embedding.
CoRR, December, 2025

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning.
CoRR, December, 2025

RynnVLA-002: A Unified Vision-Language-Action and World Model.
CoRR, November, 2025

UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback.
CoRR, November, 2025

Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance.
CoRR, October, 2025

C-Flat++: Towards a More Efficient and Powerful Framework for Continual Learning.
CoRR, August, 2025

Lumos-1: On Autoregressive Video Generation from a Unified Model Perspective.
CoRR, July, 2025

WorldVLA: Towards Autoregressive Action World Model.
CoRR, June, 2025

DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing.
CoRR, June, 2025

VideoMAR: Autoregressive Video Generatio with Continuous Tokens.
CoRR, June, 2025

LumosFlow: Motion-Guided Long Video Generation.
CoRR, June, 2025

Taming Consistency Distillation for Accelerated Human Image Animation.
CoRR, April, 2025

MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems.
CoRR, March, 2025

Frequency Autoregressive Image Generation with Continuous Tokens.
CoRR, March, 2025

Generative Artificial Intelligence in Robotic Manipulation: A Survey.
CoRR, March, 2025

Class-Incremental Player Detection With Refined Response-Based Knowledge Distillation.
IEEE Trans. Instrum. Meas., 2025

Rethinking the Stability-Plasticity Trade-off in Continual Learning from an Architectural Perspective.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

ZeroFlow: Overcoming Catastrophic Forgetting is Easier than You Think.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

DreamRelation: Relation-Centric Video Customization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

AeroGTO: An Efficient Graph-Transformer Operator for Learning Large-Scale Aerodynamics of 3D Vehicle Geometries.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
UniGrad-FS: Unified Gradient Projection With Flatter Sharpness for Continual Learning.
IEEE Trans. Ind. Informatics, December, 2024

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control.
CoRR, 2024

Make Continual Learning Stronger via C-Flat.
CoRR, 2024

From Denoising Training to Test-Time Adaptation: Enhancing Domain Generalization for Medical Image Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Make Continual Learning Stronger via C-Flat.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Revisiting Neural Networks for Continual Learning: An Architectural Perspective.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

PAPM: A Physics-aware Proxy Model for Process Systems.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

InstructVideo: Instructing Video Diffusion Models with Human Feedback.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Dream Video: Composing Your Dream Videos with Customized Subject and Motion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion.
CoRR, 2023

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models.
CoRR, 2023

Few-shot Action Recognition with Captioning Foundation Models.
CoRR, 2023

ModelScope Text-to-Video Technical Report.
CoRR, 2023

Refined Response Distillation for Class-Incremental Player Detection.
CoRR, 2023

VideoComposer: Compositional Video Synthesis with Motion Controllability.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RLIPv2: Fast Scaling of Relational Language-Image Pre-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Progressive Learning without Forgetting.
CoRR, 2022

RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Towards Mask-robust Face Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Spatio-Temporal Dynamic Inference Network for Group Activity Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Visual Context for Group Activity Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021


  Loading...