Muning Wen

Orcid: 0009-0000-7868-1262

According to our database1, Muning Wen authored at least 31 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
RDHNet: addressing rotational and permutational symmetries in continuous multi-agent systems.
Frontiers Comput. Sci., November, 2025

MobileUse: A GUI Agent with Hierarchical Reflection for Autonomous Mobile Operation.
CoRR, July, 2025

A Survey of AI Agent Protocols.
CoRR, April, 2025

MARFT: Multi-Agent Reinforcement Fine-Tuning.
CoRR, April, 2025

Learning Humanoid Standing-up Control across Diverse Postures.
CoRR, February, 2025

PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Robust Function-Calling for On-Device Language Model via Function Masking.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Assistant Scenarios.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Safe Multiagent Learning With Soft Constrained Policy Optimization in Real Robot Control.
IEEE Trans. Ind. Informatics, September, 2024

RoMAT: Role-based multi-agent transformer for generalizable heterogeneous cooperation.
Neural Networks, 2024

HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device Scenarios.
CoRR, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models.
CoRR, 2024

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking.
CoRR, 2024

P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for Optimizing LLM Training.
CoRR, 2024

Reinforcing Language Agents via Policy Optimization with Action Decomposition.
CoRR, 2024

Entropy-Regularized Token-Level Policy Optimization for Large Language Models.
CoRR, 2024

TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Reinforcing LLM Agents via Policy Optimization with Action Decomposition.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and Training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Large sequence models for sequential decision-making: a survey.
Frontiers Comput. Sci., December, 2023

Offline Pre-trained Multi-agent Decision Transformer.
Mach. Intell. Res., April, 2023

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
J. Mach. Learn. Res., 2023

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training.
CoRR, 2023

2022
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks.
CoRR, 2021

Multi-Agent Constrained Policy Optimisation.
CoRR, 2021

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
CoRR, 2021

Settling the Variance of Multi-Agent Policy Gradients.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


  Loading...