Hangjie Yuan

Orcid: 0009-0009-3270-1526

According to our database1, Hangjie Yuan authored at least 45 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Lumos-1: On Autoregressive Video Generation from a Unified Model Perspective.
CoRR, July, 2025

WorldVLA: Towards Autoregressive Action World Model.
CoRR, June, 2025

DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing.
CoRR, June, 2025

VideoMAR: Autoregressive Video Generatio with Continuous Tokens.
CoRR, June, 2025

Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models.
CoRR, June, 2025

Adapt before Continual Learning.
CoRR, June, 2025

Rethinking the Stability-Plasticity Trade-off in Continual Learning from an Architectural Perspective.
CoRR, June, 2025

LumosFlow: Motion-Guided Long Video Generation.
CoRR, June, 2025

Taming Consistency Distillation for Accelerated Human Image Animation.
CoRR, April, 2025

MathFlow: Enhancing the Perceptual Flow of MLLMs for Visual Mathematical Problems.
CoRR, March, 2025

DreamRelation: Relation-Centric Video Customization.
CoRR, March, 2025

Frequency Autoregressive Image Generation with Continuous Tokens.
CoRR, March, 2025

Generative Artificial Intelligence in Robotic Manipulation: A Survey.
CoRR, March, 2025

ZeroFlow: Overcoming Catastrophic Forgetting is Easier than You Think.
CoRR, January, 2025

Class-Incremental Player Detection With Refined Response-Based Knowledge Distillation.
IEEE Trans. Instrum. Meas., 2025

AeroGTO: An Efficient Graph-Transformer Operator for Learning Large-Scale Aerodynamics of 3D Vehicle Geometries.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
UniGrad-FS: Unified Gradient Projection With Flatter Sharpness for Continual Learning.
IEEE Trans. Ind. Informatics, December, 2024

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion.
CoRR, 2024

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control.
CoRR, 2024

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.
CoRR, 2024

Make Continual Learning Stronger via C-Flat.
CoRR, 2024

From Denoising Training to Test-Time Adaptation: Enhancing Domain Generalization for Medical Image Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Make Continual Learning Stronger via C-Flat.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Revisiting Neural Networks for Continual Learning: An Architectural Perspective.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

PAPM: A Physics-aware Proxy Model for Process Systems.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

InstructVideo: Instructing Video Diffusion Models with Human Feedback.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Dream Video: Composing Your Dream Videos with Customized Subject and Motion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion.
CoRR, 2023

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models.
CoRR, 2023

Few-shot Action Recognition with Captioning Foundation Models.
CoRR, 2023

ModelScope Text-to-Video Technical Report.
CoRR, 2023

Refined Response Distillation for Class-Incremental Player Detection.
CoRR, 2023

VideoComposer: Compositional Video Synthesis with Motion Controllability.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RLIPv2: Fast Scaling of Relational Language-Image Pre-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Progressive Learning without Forgetting.
CoRR, 2022

RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Towards Mask-robust Face Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Spatio-Temporal Dynamic Inference Network for Group Activity Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Visual Context for Group Activity Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021


  Loading...