Mickel Liu
According to our database1,
Mickel Liu
authored at least 11 papers
between 2022 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries.
CoRR, July, 2025
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning.
CoRR, June, 2025
Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models.
CoRR, June, 2025
CoRR, March, 2025
2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset.
CoRR, 2023
CoRR, 2023
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022