Yong-Lu Li

Orcid: 0000-0003-0478-0692

Affiliations:
  • Shanghai Jiao Tong University, Department of Electrical and Computer Engineering, China
  • Hong Kong University of Science and Technology, Hong Kong (former)


According to our database1, Yong-Lu Li authored at least 74 papers between 2018 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
LIDEA: Human-to-Robot Imitation Learning via Implicit Feature Distillation and Explicit Geometry Alignment.
CoRR, April, 2026

OmniXtreme: Breaking the Generality Barrier in High-Dynamic Humanoid Control.
CoRR, February, 2026

A Pragmatic VLA Foundation Model.
CoRR, January, 2026

The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents.
CoRR, January, 2026

Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
GarmageNet: A Multimodal Generative Framework for Sewing Pattern Design and Generic Garment Modeling.
ACM Trans. Graph., December, 2025

Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols.
CoRR, December, 2025

Efficient and Scalable Monocular Human-Object Interaction Motion Reconstruction.
CoRR, December, 2025

L1 Sample Flow for Efficient Visuomotor Learning.
CoRR, November, 2025

IPR-1: Interactive Physical Reasoner.
CoRR, November, 2025

RoboHiMan: A Hierarchical Evaluation Paradigm for Compositional Generalization in Long-Horizon Manipulation.
CoRR, October, 2025

exUMI: Extensible Robot Teaching System with Action-aware Task-agnostic Tactile Representation.
CoRR, September, 2025

Motion Before Action: Diffusing Object Motion as Manipulation Condition.
IEEE Robotics Autom. Lett., July, 2025

GarmageNet: A Dataset and Scalable Representation for Generic Garment Modeling.
CoRR, April, 2025

Dense Policy: Bidirectional Autoregressive Learning of Actions.
CoRR, March, 2025

SIME: Enhancing Policy Self-Improvement with Modal-level Exploration.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ImDy: Human Inverse Dynamics from Imitated Observations.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Dense Policy: Bidirectional Autoregressive Learning of Actions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GaPT-DAR: Category-level Garments Pose Tracking via Integrated 2D Deformation and 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Homogeneous Dynamics Space for Heterogeneous Humans.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Interacted Object Grounding in Spatio-Temporal Human-Object Interactions.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
M<sup>3</sup>-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation.
CoRR, 2024

HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

General Articulated Objects Manipulation in Real Images via Part-Aware Diffusion Process.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Low-Rank Similarity Mining for Multimodal Dataset Distillation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Take a Step Back: Rethinking the Two Stages in Visual Reasoning.
Proceedings of the Computer Vision - ECCV 2024, 2024

DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-Level Control.
Proceedings of the Computer Vision - ECCV 2024, 2024

Distill Gold from Massive Ores: Bi-level Data Pruning Towards Efficient Dataset Distillation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Bridging the Gap Between Human Motion and Action Semantics via Kinematic Phrases.
Proceedings of the Computer Vision - ECCV 2024, 2024

Revisit Human-Scene Interaction via Space Occupancy.
Proceedings of the Computer Vision - ECCV 2024, 2024

Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

From Isolated Islands to Pangea: Unifying Semantic Space for Human Action Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Primitive-Based 3D Human-Object Interaction Modelling and Programming.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Dynamic Context Removal: A General Training Strategy for Robust Models on Video Action Predictive Tasks.
Int. J. Comput. Vis., December, 2023

HAKE: A Knowledge Engine Foundation for Human Activity Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Dancing with Images: Video Distillation via Static-Dynamic Disentanglement.
CoRR, 2023

Distill Gold from Massive Ores: Efficient Dataset Distillation via Critical Samples Selection.
CoRR, 2023

From Isolated Islands to Pangea: Unifying Semantic Space for Human Action Understanding.
CoRR, 2023

Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human Activity Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Beyond Object Recognition: A New Benchmark towards Object Concept Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Learning Single/Multi-Attribute of Object With Symmetry and Group.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Transferable Interactiveness Knowledge for Human-Object Interaction Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions.
CoRR, 2022

Constructing Balance from Imbalance for Long-Tailed Image Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UKPGAN: A General Self-Supervised Keypoint Detector.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Anticipate Future with Dynamic Context Removal.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Interactiveness Field in Human-Object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Human Trajectory Prediction with Momentary Observation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Highlighting Object Category Immunity for the Generalization of Human-Object Interaction Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Localization with Sampling-Argmax.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PAL-Net: Predicate-Aware Learning Network for Visual Relationship Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

VVS: Action Recognition With Virtual View Synthesis.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

DecAug: Augmenting HOI Detection via Decomposition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
UKPGAN: Unsupervised KeyPoint GANeration.
CoRR, 2020

HOI Analysis: Integrating and Decomposing Human-Object Interaction.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Symmetry and Group in Attribute-Object Compositions.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PaStaNet: Toward Human Activity Knowledge Engine.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Detailed 2D-3D Joint Representation for Human-Object Interaction.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
HAKE: Human Activity Knowledge Engine.
CoRR, 2019

InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Transferable Interactiveness Knowledge for Human-Object Interaction Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Transferable Interactiveness Prior for Human-Object Interaction Detection.
CoRR, 2018

Generating Instance Segmentation Annotation by Geometry-guided GAN.
CoRR, 2018

SRDA: Generating Instance Segmentation Annotation via Scanning, Reasoning and Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Beyond Holistic Object Recognition: Enriching Image Understanding With Part States.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018


  Loading...