Yuheng Ji

Orcid: 0009-0005-4898-6918

According to our database1, Yuheng Ji authored at least 27 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
PReD: An LLM-based Foundation Multimodal Model for Electromagnetic Perception, Recognition, and Decision.
CoRR, March, 2026

PRM-as-a-Judge: A Dense Evaluation Paradigm for Fine-Grained Robotic Auditing.
CoRR, March, 2026

RoboBrain 2.5: Depth in Sight, Time in Mind.
CoRR, January, 2026

Action-Sketcher: From Reasoning to Action via Visual Sketches for Long-Horizon Robotic Manipulation.
CoRR, January, 2026

ManipLVM-R1: Reinforcement Learning for Reasoning in Embodied Manipulation with Large Vision-Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation.
CoRR, December, 2025

RoboMirror: Understand Before You Imitate for Video to Humanoid Locomotion.
CoRR, December, 2025

Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives.
CoRR, December, 2025

RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics.
CoRR, December, 2025

Scaling Up AI-Generated Image Detection via Generator-Aware Prototypes.
CoRR, December, 2025

Towards Cross-View Point Correspondence in Vision-Language Models.
CoRR, December, 2025

RoboOS-NeXT: A Unified Memory-based Framework for Lifelong, Scalable, and Robust Multi-Robot Collaboration.
CoRR, October, 2025

Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey.
CoRR, October, 2025

MathSticks: A Benchmark for Visual Symbolic Compositional Reasoning with Matchstick Puzzles.
CoRR, October, 2025

VisualTrans: A Benchmark for Real-World Visual Transformation Reasoning.
CoRR, August, 2025

RoboBrain 2.0 Technical Report.
CoRR, July, 2025

ManipLVM-R1: Reinforcement Learning for Reasoning in Embodied Manipulation with Large Vision-Language Models.
CoRR, May, 2025

FastRSR: Efficient and Accurate Road Surface Reconstruction from Bird's Eye View.
CoRR, April, 2025

Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.
CoRR, March, 2025

FastRSR: Efficient and Accurate Road Surface Reconstruction in Bird's Eye View.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

EgoPrompt: Prompt Learning for Egocentric Action Recognition.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Enhancing Adversarial Robustness of Vision-Language Models through Low-Rank Adaptation.
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025

What Really Matters for Robust Multi-Sensor HD Map Construction?
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Alleviating Performance Disparity in Adversarial Spatiotemporal Graph Learning Under Zero-Inflated Distribution.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
AdvLoRA: Adversarial Low-Rank Adaptation of Vision-Language Models.
CoRR, 2024


  Loading...