Chengdong Ma

Orcid: 0000-0002-7963-3024

According to our database¹, Chengdong Ma authored at least 24 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

World Models Should Prioritize the Unification of Physical and Social Dynamics.

[BibT_eX]

[DOI]

CoRR, October, 2025

Social World Model-Augmented Mechanism Design Policy Learning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Vulnerable Agent Identification in Large-Scale Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

Goal Discovery with Causal Capacity for Efficient Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, August, 2025

EconGym: A Scalable AI Testbed with Diverse Economic Tasks.

[BibT_eX]

[DOI]

CoRR, June, 2025

Mean Field Correlated Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Falcon: Fast Visuomotor Policies via Partial Denoising.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems.

[BibT_eX]

[DOI]

Proceedings of the Artificial General Intelligence - 18th International Conference, 2025

Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games.

[BibT_eX]

[DOI]

CoRR, 2024

Conflux-PSRO: Effectively Leveraging Collective Advantages in Policy Space Response Oracles.

[BibT_eX]

[DOI]

CoRR, 2024

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey on Self-play Methods in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles.

[BibT_eX]

[DOI]

CoRR, 2024

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects.

[BibT_eX]

[DOI]

CoRR, 2024

Panacea: Pareto Alignment via Preference Adaptation for LLMs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023

Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Confrontation and Obstacle-Avoidance of Unmanned Vehicles Based on Progressive Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Access, 2023

2022

Fully Decentralized Model-based Policy Optimization for Networked Systems.

[BibT_eX]

[DOI]

CoRR, 2022

Scalable Model-based Policy Optimization for Decentralized Networked Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

2018

Design of a Low-Power Cold Chain Logistics Internet of Things System.

[BibT_eX]

[DOI]

Heshuai Shao

Ronglin Hu

Chengdong Ma

Proceedings of the Advances in Internet, 2018

Chengdong Ma

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...