Yong-Lu Li

Orcid: 0000-0003-0478-0692

Affiliations:

Shanghai Jiao Tong University, Department of Electrical and Computer Engineering, China
Hong Kong University of Science and Technology, Hong Kong (former)

According to our database¹, Yong-Lu Li authored at least 75 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

LIDEA: Human-to-Robot Imitation Learning via Implicit Feature Distillation and Explicit Geometry Alignment.

[BibT_eX]

[DOI]

CoRR, April, 2026

Beyond Static Vision: Scene Dynamic Field Unlocks Intuitive Physics Understanding in Multi-modal Large Language Models.

[BibT_eX]

[DOI]

CoRR, April, 2026

OmniXtreme: Breaking the Generality Barrier in High-Dynamic Humanoid Control.

[BibT_eX]

[DOI]

CoRR, February, 2026

A Pragmatic VLA Foundation Model.

[BibT_eX]

[DOI]

CoRR, January, 2026

The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents.

[BibT_eX]

[DOI]

CoRR, January, 2026

Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

GarmageNet: A Multimodal Generative Framework for Sewing Pattern Design and Generic Garment Modeling.

[BibT_eX]

[DOI]

ACM Trans. Graph., December, 2025

Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols.

[BibT_eX]

[DOI]

CoRR, December, 2025

Efficient and Scalable Monocular Human-Object Interaction Motion Reconstruction.

[BibT_eX]

[DOI]

CoRR, December, 2025

L1 Sample Flow for Efficient Visuomotor Learning.

[BibT_eX]

[DOI]

CoRR, November, 2025

IPR-1: Interactive Physical Reasoner.

[BibT_eX]

[DOI]

CoRR, November, 2025

RoboHiMan: A Hierarchical Evaluation Paradigm for Compositional Generalization in Long-Horizon Manipulation.

[BibT_eX]

[DOI]

CoRR, October, 2025

exUMI: Extensible Robot Teaching System with Action-aware Task-agnostic Tactile Representation.

[BibT_eX]

[DOI]

CoRR, September, 2025

Motion Before Action: Diffusing Object Motion as Manipulation Condition.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., July, 2025

GarmageNet: A Dataset and Scalable Representation for Generic Garment Modeling.

[BibT_eX]

[DOI]

CoRR, April, 2025

Dense Policy: Bidirectional Autoregressive Learning of Actions.

[BibT_eX]

[DOI]

CoRR, March, 2025

SIME: Enhancing Policy Self-Improvement with Modal-level Exploration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition.

[BibT_eX]

[DOI]

Katherine Rose Driggs-Campbell

Cewu Lu

Yong-Lu Li

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ImDy: Human Inverse Dynamics from Imitated Observations.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Dense Policy: Bidirectional Autoregressive Learning of Actions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GaPT-DAR: Category-level Garments Pose Tracking via Integrated 2D Deformation and 3D Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Homogeneous Dynamics Space for Heterogeneous Humans.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Interacted Object Grounding in Spatio-Temporal Human-Object Interactions.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

M<sup>3</sup>-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

General Articulated Objects Manipulation in Real Images via Part-Aware Diffusion Process.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Low-Rank Similarity Mining for Multimodal Dataset Distillation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Take a Step Back: Rethinking the Two Stages in Visual Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-Level Control.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Distill Gold from Massive Ores: Bi-level Data Pruning Towards Efficient Dataset Distillation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Bridging the Gap Between Human Motion and Action Semantics via Kinematic Phrases.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Revisit Human-Scene Interaction via Space Occupancy.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

From Isolated Islands to Pangea: Unifying Semantic Space for Human Action Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Primitive-Based 3D Human-Object Interaction Modelling and Programming.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Dynamic Context Removal: A General Training Strategy for Robust Models on Video Action Predictive Tasks.

[BibT_eX]

[DOI]

Xinyu Xu

Yong-Lu Li

Cewu Lu

Int. J. Comput. Vis., December, 2023

HAKE: A Knowledge Engine Foundation for Human Activity Understanding.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Dancing with Images: Video Distillation via Static-Dynamic Disentanglement.

[BibT_eX]

[DOI]

CoRR, 2023

Distill Gold from Massive Ores: Efficient Dataset Distillation via Critical Samples Selection.

[BibT_eX]

[DOI]

CoRR, 2023

From Isolated Islands to Pangea: Unifying Semantic Space for Human Action Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human Activity Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Beyond Object Recognition: A New Benchmark towards Object Concept Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Learning Single/Multi-Attribute of Object With Symmetry and Group.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Transferable Interactiveness Knowledge for Human-Object Interaction Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions.

[BibT_eX]

[DOI]

CoRR, 2022

Constructing Balance from Imbalance for Long-Tailed Image Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UKPGAN: A General Self-Supervised Keypoint Detector.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Anticipate Future with Dynamic Context Removal.

[BibT_eX]

[DOI]

Xinyu Xu

Yong-Lu Li

Cewu Lu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Interactiveness Field in Human-Object Interactions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Human Trajectory Prediction with Momentary Observation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Highlighting Object Category Immunity for the Generalization of Human-Object Interaction Detection.

[BibT_eX]

[DOI]

Xinpeng Liu

Yong-Lu Li

Cewu Lu

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Localization with Sampling-Argmax.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PAL-Net: Predicate-Aware Learning Network for Visual Relationship Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

VVS: Action Recognition With Virtual View Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

DecAug: Augmenting HOI Detection via Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

UKPGAN: Unsupervised KeyPoint GANeration.

[BibT_eX]

[DOI]

CoRR, 2020

HOI Analysis: Integrating and Decomposing Human-Object Interaction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Symmetry and Group in Attribute-Object Compositions.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PaStaNet: Toward Human Activity Knowledge Engine.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Detailed 2D-3D Joint Representation for Human-Object Interaction.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

HAKE: Human Activity Knowledge Engine.

[BibT_eX]

[DOI]

CoRR, 2019

InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Transferable Interactiveness Knowledge for Human-Object Interaction Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Transferable Interactiveness Prior for Human-Object Interaction Detection.

[BibT_eX]

[DOI]

CoRR, 2018

Generating Instance Segmentation Annotation by Geometry-guided GAN.

[BibT_eX]

[DOI]

Wenqiang Xu

Yonglu Li

Cewu Lu

CoRR, 2018

SRDA: Generating Instance Segmentation Annotation via Scanning, Reasoning and Domain Adaptation.

[BibT_eX]

[DOI]

Wenqiang Xu

Yonglu Li

Cewu Lu

Proceedings of the Computer Vision - ECCV 2018, 2018

Beyond Holistic Object Recognition: Enriching Image Understanding With Part States.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Yong-Lu Li

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...