Chao Yu

Orcid: 0000-0001-6975-0158

Affiliations:
  • Tsinghua University, China


According to our database1, Chao Yu authored at least 62 papers between 2017 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Hysteresis-Aware Neural Network Modeling and Whole-Body Reinforcement Learning Control of Soft Robots.
IEEE Robotics Autom. Lett., November, 2025

RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training.
CoRR, October, 2025

EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models.
CoRR, October, 2025

JuggleRL: Mastering Ball Juggling with a Quadrotor via Deep Reinforcement Learning.
CoRR, September, 2025

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation.
CoRR, September, 2025

Online Planning for Multi-UAV Pursuit-Evasion in Unknown Environments Using Deep Reinforcement Learning.
IEEE Robotics Autom. Lett., August, 2025

FlightBench: Benchmarking Learning-Based Methods for Ego-Vision-Based Quadrotors Navigation.
IEEE Robotics Autom. Lett., July, 2025

Neural Internal Model Control: Learning a Robust Control Policy Via Predictive Error Feedback.
IEEE Robotics Autom. Lett., July, 2025

What Matters in Learning a Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study.
IEEE Robotics Autom. Lett., July, 2025

Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance.
CoRR, July, 2025

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments.
CoRR, June, 2025

Toward Real-World Cooperative and Competitive Soccer with Quadrupedal Robot Teams.
CoRR, May, 2025

Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps.
CoRR, May, 2025

Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning.
CoRR, May, 2025

AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models.
CoRR, March, 2025

Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards.
CoRR, February, 2025

Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization.
CoRR, February, 2025

VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play.
CoRR, February, 2025

Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network.
CoRR, February, 2025

2024
SleepNetZero: Zero-Burden Zero-Shot Reliable Sleep Staging with Neural Networks Based on Ballistocardiograms.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., November, 2024

Active Neural Topological Mapping for Multi-Agent Exploration.
IEEE Robotics Autom. Lett., January, 2024

OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control.
IEEE Robotics Autom. Lett., 2024

Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning.
CoRR, 2024

Few-shot In-Context Preference Learning Using Large Language Models.
CoRR, 2024

Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning.
CoRR, 2024

A Survey on Self-play Methods in Reinforcement Learning.
CoRR, 2024

FlightBench: A Comprehensive Benchmark of Spatial Planning Methods for Quadrotors.
CoRR, 2024

CityLight: A Universal Model Towards Real-world City-scale Traffic Signal Control Coordination.
CoRR, 2024

Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models.
CoRR, 2024

LAGOON: Language-Guided Motion Control.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Multi-Agent Vulnerability Discovery for Autonomous Driving Policy by Finding AV-Responsible Scenarios.
Proceedings of the 20th IEEE International Conference on Automation Science and Engineering, 2024

LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
TaskFlex Solver for Multi-Agent Pursuit via Automatic Curriculum Learning.
CoRR, 2023

MASP: Scalable GNN-based Planning for Multi-Agent Navigation.
CoRR, 2023

Language-Guided Generation of Physically Realistic Robot Motion and Control.
CoRR, 2023

Automatic Truss Design with Reinforcement Learning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
INCAME: Interruptible CNN Accelerator for Multirobot Exploration.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

A Benchmark of Planning-based Exploration Methods in Photo-Realistic 3D Simulator.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2022

The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

SAVE: Spatial-Attention Visual Exploration.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Learning Efficient Multi-agent Cooperative Visual Exploration.
Proceedings of the Computer Vision - ECCV 2022, 2022

VMAPD: Generate Diverse Solutions for Multi-Agent Games with Recurrent Trajectory Discriminators.
Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

2021
Multi-Agent Vulnerability Discovery for Autonomous Driving with Hazard Arbitration Reward.
CoRR, 2021

The Surprising Effectiveness of MAPPO in Cooperative, Multi-Agent Games.
CoRR, 2021

Low-Cost Multi-Agent Navigation via Reinforcement Learning With Multi-Fidelity Simulator.
IEEE Access, 2021

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization.
Proceedings of the 9th International Conference on Learning Representations, 2021

Unlocking the Potential of MAPPO with Asynchronous Optimization.
Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021

2020
CNN-based Monocular Decentralized SLAM on embedded FPGA.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

INCAME: INterruptible CNN Accelerator for Multi-robot Exploration.
Proceedings of the FPGA '20: The 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2020

CNN-based Feature-point Extraction for Real-time Visual SLAM on Embedded FPGA.
Proceedings of the 28th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2020

INCA: INterruptible CNN Accelerator for Multi-tasking in Embedded Robots.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019
A DenseNet feature-based loop closure method for visual SLAM system.
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019

2018
DS-SLAM: A Semantic Visual SLAM towards Dynamic Environments.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

2017
Multi-robot coordination for high-speedpick-and-place tasks.
Proceedings of the 2017 IEEE International Conference on Robotics and Biomimetics, 2017


  Loading...