Tiehua Mei
Orcid: 0009-0005-9677-4653
According to our database1,
Tiehua Mei authored at least 6 papers
between 2025 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation.
CoRR, May, 2026
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment.
CoRR, May, 2026
Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning.
CoRR, March, 2026
2025
Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning.
CoRR, December, 2025
GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025