Chengdong Ma

Orcid: 0000-0002-7963-3024

According to our database1, Chengdong Ma authored at least 20 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Goal Discovery with Causal Capacity for Efficient Reinforcement Learning.
CoRR, August, 2025

EconGym: A Scalable AI Testbed with Diverse Economic Tasks.
CoRR, June, 2025

Fast Visuomotor Policies via Partial Denoising.
CoRR, March, 2025

Mean Field Correlated Imitation Learning.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems.
Proceedings of the Artificial General Intelligence - 18th International Conference, 2025

Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games.
CoRR, 2024

Conflux-PSRO: Effectively Leveraging Collective Advantages in Policy Space Response Oracles.
CoRR, 2024

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment.
CoRR, 2024

A Survey on Self-play Methods in Reinforcement Learning.
CoRR, 2024

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles.
CoRR, 2024

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects.
CoRR, 2024

Panacea: Pareto Alignment via Preference Adaptation for LLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models.
CoRR, 2023

Confrontation and Obstacle-Avoidance of Unmanned Vehicles Based on Progressive Reinforcement Learning.
IEEE Access, 2023

2022
Fully Decentralized Model-based Policy Optimization for Networked Systems.
CoRR, 2022

Scalable Model-based Policy Optimization for Decentralized Networked Systems.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

2018
Design of a Low-Power Cold Chain Logistics Internet of Things System.
Proceedings of the Advances in Internet, 2018


  Loading...