Lanjihong Ma

Orcid: 0000-0001-7978-6146

According to our database1, Lanjihong Ma authored at least 5 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Data-dependent Exploration for Online Reinforcement Learning from Human Feedback.
CoRR, May, 2026

2025
Learning Objective Adaptation by Correlation-Based Model Reuse.
IEEE Trans. Neural Networks Learn. Syst., August, 2025

Achieving Nearly-Optimal Regret and Sample Complexity in Dueling Bandits with Applications in Online Recommendations.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

2024
Handling Varied Objectives by Online Decision Making.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

2020
An Unbiased Risk Estimator for Learning with Augmented Classes.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020


  Loading...