Yun Qu
Orcid: 0009-0000-1803-8435Affiliations:
- Tsinghua University, Beijing, China
According to our database1,
Yun Qu authored at least 22 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex.
CoRR, May, 2026
CoRR, March, 2026
Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models.
CoRR, February, 2026
HINT: Hierarchical Interaction Modeling for Autoregressive Multi-Human Motion Generation.
CoRR, January, 2026
Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026
2025
CoRR, November, 2025
CoRR, October, 2025
A Unified Multi-Task Learning Framework for Generative Auto-Bidding with Validation-Aligned Optimization.
CoRR, October, 2025
CoRR, January, 2025
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025
Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments.
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Lessons and Winning Solutions in Industrial Object Detection and Pose Estimation from the 2025 Bin-Picking Perception Challenge.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration.
CoRR, 2024
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.
CoRR, 2024
A novel memetic algorithm for distributed shape formation of swarm robots with both acceleration and velocity constraints.
Sci. China Inf. Sci., 2024
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
2023
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the International Conference on Machine Learning, 2023