Kehan Li

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning With Gaussian Splatting.
IEEE Robotics Autom. Lett., May, 2026

Uncertainty-Aware Disentangled Dynamic Graph Attention Network for Out-of-Distribution Generalization.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2026

RynnBrain: Open Embodied Foundation Models.
CoRR, February, 2026

CCTD-MARL: Coupled Communication-Task Decoupling Framework for Multi-Agent Systems Under Partial Observability.
Big Data Cogn. Comput., 2026

Generating Risky Samples with Conformity Constraints via Diffusion Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
DynaIP: Dynamic Image Prompt Adapter for Scalable Zero-shot Personalized Text-to-Image Generation.
CoRR, December, 2025

EchoVLA: Robotic Vision-Language-Action Model with Synergistic Declarative Memory for Mobile Manipulation.
CoRR, November, 2025

RynnVLA-002: A Unified Vision-Language-Action and World Model.
CoRR, November, 2025

PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity.
CoRR, October, 2025

InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue.
CoRR, October, 2025

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation.
CoRR, September, 2025

MVQA-68K: A Multi-dimensional and Causally-annotated Dataset with Quality Interpretability for Video Assessment.
CoRR, September, 2025

LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence.
CoRR, September, 2025

RynnEC: Bringing MLLMs into Embodied World.
CoRR, August, 2025

T2I-ConBench: Text-to-Image Benchmark for Continual Post-training.
CoRR, May, 2025

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding.
CoRR, January, 2025

Machine learning prediction models for multidrug-resistant organism infections in ICU ventilator-associated pneumonia patients: Analysis using the MIMIC-IV database.
Comput. Biol. Medicine, 2025

Safety Constraint-Oriented Path Planning and Navigation Method for Substation Robots.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2025

Referent-Aligned Training for Unsupervised Interactive Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

SegLLM: Multi-round Reasoning Segmentation with Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ODP-Bench: Benchmarking Out-Of-Distribution Performance Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Dynamic Decision Correction Framework Integrating Path Optimization and Incomplete Information Handling.
Proceedings of the 2025 2nd International Conference on Generative Artificial Intelligence and Information Security, 2025

Breaking the Memory Barrier of Contrastive Loss via Tile-Based Strategy.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Spherical scan context: a global descriptor in the form of a third-order tensor for loop closure detection.
Multim. Syst., December, 2024

SegLLM: Multi-round Reasoning Segmentation.
CoRR, 2024

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss.
CoRR, 2024

A Distributional Reinforcement Learning-Based Strategy for Pod Scheduling in Satellite Clusters.
Proceedings of the 20th International Conference on Mobility, Sensing and Networking, 2024


  Loading...