Jalaj Bhandari

Orcid: 0000-0002-7115-8986

According to our database1, Jalaj Bhandari authored at least 15 papers between 2016 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Structure Enables Effective Self-Localization of Errors in LLMs.
CoRR, February, 2026

2025
Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO.
CoRR, November, 2025

A Note on Code Quality Score: LLMs for Maintainable Large Codebases.
CoRR, August, 2025

Aligned Multi Objective Optimization.
CoRR, February, 2025

Aligned Multi Objective Optimization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
Pearl: A Production-Ready Reinforcement Learning Agent.
J. Mach. Learn. Res., 2024

Global Optimality Guarantees for Policy Gradient Methods.
Oper. Res., 2024

2023
Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning.
Proceedings of the 17th ACM Conference on Recommender Systems, 2023

2021
On the Linear Convergence of Policy Gradient Methods for Finite MDPs.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Optimization Foundations of Reinforcement Learning.
PhD thesis, 2020

A Note on the Linear Convergence of Policy Gradient Methods.
CoRR, 2020

2018
A Finite Time Analysis of Temporal Difference Learning With Linear Function Approximation.
Proceedings of the Conference On Learning Theory, 2018

2017
Annular Augmentation Sampling.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
On the tightness of an LP relaxation for rational optimization and its applications.
Oper. Res. Lett., 2016

Elliptical Slice Sampling with Expectation Propagation.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016


  Loading...