Muning Wen

According to our database1, Muning Wen authored at least 12 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision.
CoRR, 2024

Entropy-Regularized Token-Level Policy Optimization for Large Language Models.
CoRR, 2024

2023
Large sequence models for sequential decision-making: a survey.
Frontiers Comput. Sci., December, 2023

Offline Pre-trained Multi-agent Decision Transformer.
Mach. Intell. Res., April, 2023

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
J. Mach. Learn. Res., 2023

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training.
CoRR, 2023

2022
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks.
CoRR, 2021

Multi-Agent Constrained Policy Optimisation.
CoRR, 2021

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
CoRR, 2021

Settling the Variance of Multi-Agent Policy Gradients.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


  Loading...