Chao Yu

Orcid: 0000-0001-6975-0158

Affiliations:

Tsinghua University, China

According to our database¹, Chao Yu authored at least 63 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Hysteresis-Aware Neural Network Modeling and Whole-Body Reinforcement Learning Control of Soft Robots.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., November, 2025

RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training.

[BibT_eX]

[DOI]

CoRR, October, 2025

EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

JuggleRL: Mastering Ball Juggling with a Quadrotor via Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation.

[BibT_eX]

[DOI]

CoRR, September, 2025

Online Planning for Multi-UAV Pursuit-Evasion in Unknown Environments Using Deep Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., August, 2025

FlightBench: Benchmarking Learning-Based Methods for Ego-Vision-Based Quadrotors Navigation.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., July, 2025

Neural Internal Model Control: Learning a Robust Control Policy Via Predictive Error Feedback.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., July, 2025

What Matters in Learning a Zero-Shot Sim-to-Real RL Policy for Quadrotor Control? A Comprehensive Study.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., July, 2025

Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance.

[BibT_eX]

[DOI]

CoRR, July, 2025

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments.

[BibT_eX]

[DOI]

CoRR, June, 2025

Toward Real-World Cooperative and Competitive Soccer with Quadrupedal Robot Teams.

[BibT_eX]

[DOI]

CoRR, May, 2025

Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps.

[BibT_eX]

[DOI]

CoRR, May, 2025

Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, May, 2025

AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models.

[BibT_eX]

[DOI]

CoRR, March, 2025

Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards.

[BibT_eX]

[DOI]

CoRR, February, 2025

VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play.

[BibT_eX]

[DOI]

CoRR, February, 2025

Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

CityLight: A Neighborhood-inclusive Universal Model for Coordinated City-scale Traffic Signal Control.

[BibT_eX]

[DOI]

Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

2024

SleepNetZero: Zero-Burden Zero-Shot Reliable Sleep Staging with Neural Networks Based on Ballistocardiograms.

[BibT_eX]

[DOI]

Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., November, 2024

Active Neural Topological Mapping for Multi-Agent Exploration.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., January, 2024

OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2024

Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Few-shot In-Context Preference Learning Using Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey on Self-play Methods in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

FlightBench: A Comprehensive Benchmark of Spatial Planning Methods for Quadrotors.

[BibT_eX]

[DOI]

CoRR, 2024

CityLight: A Universal Model Towards Real-world City-scale Traffic Signal Control Coordination.

[BibT_eX]

[DOI]

CoRR, 2024

Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

LAGOON: Language-Guided Motion Control.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Multi-Agent Vulnerability Discovery for Autonomous Driving Policy by Finding AV-Responsible Scenarios.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Conference on Automation Science and Engineering, 2024

LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

TaskFlex Solver for Multi-Agent Pursuit via Automatic Curriculum Learning.

[BibT_eX]

[DOI]

CoRR, 2023

MASP: Scalable GNN-based Planning for Multi-Agent Navigation.

[BibT_eX]

[DOI]

CoRR, 2023

Language-Guided Generation of Physically Realistic Robot Motion and Control.

[BibT_eX]

[DOI]

CoRR, 2023

Automatic Truss Design with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022

INCAME: Interruptible CNN Accelerator for Multirobot Exploration.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

A Benchmark of Planning-based Exploration Methods in Photo-Realistic 3D Simulator.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2022

The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

SAVE: Spatial-Attention Visual Exploration.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Learning Efficient Multi-agent Cooperative Visual Exploration.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

VMAPD: Generate Diverse Solutions for Multi-Agent Games with Recurrent Trajectory Discriminators.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

2021

Multi-Agent Vulnerability Discovery for Autonomous Driving with Hazard Arbitration Reward.

[BibT_eX]

[DOI]

CoRR, 2021

The Surprising Effectiveness of MAPPO in Cooperative, Multi-Agent Games.

[BibT_eX]

[DOI]

CoRR, 2021

Low-Cost Multi-Agent Navigation via Reinforcement Learning With Multi-Fidelity Simulator.

[BibT_eX]

[DOI]

IEEE Access, 2021

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Unlocking the Potential of MAPPO with Asynchronous Optimization.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021

2020

CNN-based Monocular Decentralized SLAM on embedded FPGA.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

INCAME: INterruptible CNN Accelerator for Multi-robot Exploration.

[BibT_eX]

[DOI]

Proceedings of the FPGA '20: The 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2020

CNN-based Feature-point Extraction for Real-time Visual SLAM on Embedded FPGA.

[BibT_eX]

[DOI]

Proceedings of the 28th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2020

INCA: INterruptible CNN Accelerator for Multi-tasking in Embedded Robots.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019

A DenseNet feature-based loop closure method for visual SLAM system.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019

2018

DS-SLAM: A Semantic Visual SLAM towards Dynamic Environments.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

2017

Multi-robot coordination for high-speedpick-and-place tasks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Robotics and Biomimetics, 2017

Chao Yu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...