Chenlu Ye

According to our database1, Chenlu Ye authored at least 16 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Transformers as Multi-task Learners: Decoupling Features in Hidden Markov Models.
CoRR, June, 2025

Understanding Overadaptation in Supervised Fine-Tuning: The Role of Ensemble Methods.
CoRR, June, 2025

Daunce: Data Attribution through Uncertainty Estimation.
CoRR, May, 2025

Self-rewarding correction for mathematical reasoning.
CoRR, February, 2025

Logarithmic Regret for Online KL-Regularized Reinforcement Learning.
CoRR, February, 2025

Catoni Contextual Bandits are Robust to Heavy-tailed Rewards.
CoRR, February, 2025

2024
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF.
CoRR, 2024

A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference.
CoRR, 2024

Online Iterative Reinforcement Learning from Human Feedback with General Preference Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Gibbs Sampling from Human Feedback: A Provable KL- constrained Framework for RLHF.
CoRR, 2023

Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks.
CoRR, 2023

Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning.
CoRR, 2023

Corruption-Robust Offline Reinforcement Learning with General Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes.
Proceedings of the International Conference on Machine Learning, 2023


  Loading...