Yun Qu

Orcid: 0009-0000-1803-8435

Affiliations:
  • Tsinghua University, Beijing, China


According to our database1, Yun Qu authored at least 15 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?
CoRR, July, 2025

Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments.
CoRR, April, 2025

Beyond Any-Shot Adaptation: Predicting Optimization Outcome for Robustness Gains without Extra Pay.
CoRR, January, 2025

Robust Fast Adaptation from Adversarially Explicit Task Distribution Generation.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration.
CoRR, 2024

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.
CoRR, 2024

Robust Fast Adaptation from Adversarially Explicit Task Distribution Generation.
CoRR, 2024

A novel memetic algorithm for distributed shape formation of swarm robots with both acceleration and velocity constraints.
Sci. China Inf. Sci., 2024

Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Doubly Mild Generalization for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

LLM-Empowered State Representation for Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Complementary Attention for Multi-Agent Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023


  Loading...