Kishan Panaganti

Orcid: 0000-0001-9746-698X

According to our database¹, Kishan Panaganti authored at least 31 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis.

[BibT_eX]

[DOI]

CoRR, May, 2026

Distributionally Robust Cooperative Multi-Agent Reinforcement Learning via Robust Value Factorization.

[BibT_eX]

[DOI]

CoRR, February, 2026

The Single-Multi Evolution Loop for Self-Improving Model Collaboration Systems.

[BibT_eX]

[DOI]

CoRR, February, 2026

Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, December, 2025

Guided Self-Evolving LLMs with Minimal Human Supervision.

[BibT_eX]

[DOI]

CoRR, December, 2025

Every Question Has Its Own Value: Reinforcement Learning with Explicit Human Values.

[BibT_eX]

[DOI]

CoRR, October, 2025

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation.

[BibT_eX]

[DOI]

CoRR, September, 2025

TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for Autonomy.

[BibT_eX]

[DOI]

Héctor Carrión

Yutong Bai

Víctor A. Hernández Castro

CoRR, June, 2025

Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees.

[BibT_eX]

[DOI]

CoRR, May, 2025

KL-regularization Itself is Differentially Private in Bandits and RLHF.

[BibT_eX]

[DOI]

CoRR, May, 2025

Distributionally Robust Direct Preference Optimization.

[BibT_eX]

[DOI]

CoRR, February, 2025

Risk-Averse Total-Reward Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage.

[BibT_eX]

[DOI]

Proceedings of the 7th Annual Learning for Dynamics & Control Conference, 2025

Online Robust Reinforcement Learning Through Monte-Carlo Planning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Tractable Multi-Agent Reinforcement Learning through Behavioral Economics.

[BibT_eX]

[DOI]

Eric Mazumdar

Kishan Panaganti

Laixi Shi

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Plan for the Worst with Advice: Advice-augmented Robust Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the 64th IEEE Conference on Decision and Control, 2025

Off-Policy Evaluation Using Information Borrowing and Context-Based Switching.

[BibT_eX]

[DOI]

Proceedings of the 64th IEEE Conference on Decision and Control, 2025

Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

2024

Tractable Equilibrium Computation in Markov Games through Risk Aversion.

[BibT_eX]

[DOI]

Eric Mazumdar

Kishan Panaganti

Laixi Shi

CoRR, 2024

Distributionally Robust Constrained Reinforcement Learning under Strong Duality.

[BibT_eX]

[DOI]

RLJ, 2024

Model-Free Robust ϕ-Divergence Reinforcement Learning Using Both Offline and Online Data.

[BibT_eX]

[DOI]

Kishan Panaganti

Adam Wierman

Eric Mazumdar

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Personalized Reward Learning with Interaction-Grounded Learning (IGL).

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Distributionally Robust Behavioral Cloning for Robust Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning.

[BibT_eX]

[DOI]

Zaiyan Xu

Kishan Panaganti

Dileep Kalathil

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

Interaction-Grounded Learning for Recommender Systems.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Online Recommender Systems and User Modeling co-located with the 16th ACM Conference on Recommender Systems, 2022

Robust Reinforcement Learning using Offline Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sample Complexity of Robust Reinforcement Learning with a Generative Model.

[BibT_eX]

[DOI]

Kishan Panaganti

Dileep M. Kalathil

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

Bounded Regret for Finitely Parameterized Multi-Armed Bandits.

[BibT_eX]

[DOI]

Kishan Panaganti

Dileep M. Kalathil

IEEE Control. Syst. Lett., 2021

Sample Complexity of Model-Based Robust Reinforcement Learning.

[BibT_eX]

[DOI]

Kishan Panaganti

Dileep Kalathil

Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

2020

Model-Free Robust Reinforcement Learning with Linear Function Approximation.

[BibT_eX]

[DOI]

Kishan Panaganti

Dileep M. Kalathil

CoRR, 2020

Kishan Panaganti

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...