Kishan Panaganti

Orcid: 0000-0001-9746-698X

According to our database1, Kishan Panaganti authored at least 30 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Distributionally Robust Cooperative Multi-Agent Reinforcement Learning via Robust Value Factorization.
CoRR, February, 2026

The Single-Multi Evolution Loop for Self-Improving Model Collaboration Systems.
CoRR, February, 2026

Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM Reasoning.
CoRR, January, 2026

2025
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning.
CoRR, December, 2025

Guided Self-Evolving LLMs with Minimal Human Supervision.
CoRR, December, 2025

Every Question Has Its Own Value: Reinforcement Learning with Explicit Human Values.
CoRR, October, 2025

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation.
CoRR, September, 2025

Risk-Averse Total-Reward Reinforcement Learning.
CoRR, June, 2025

TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for Autonomy.
CoRR, June, 2025

Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees.
CoRR, May, 2025

KL-regularization Itself is Differentially Private in Bandits and RLHF.
CoRR, May, 2025

Distributionally Robust Direct Preference Optimization.
CoRR, February, 2025

Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage.
Proceedings of the 7th Annual Learning for Dynamics & Control Conference, 2025

Online Robust Reinforcement Learning Through Monte-Carlo Planning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Tractable Multi-Agent Reinforcement Learning through Behavioral Economics.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Plan for the Worst with Advice: Advice-augmented Robust Markov Decision Processes.
Proceedings of the 64th IEEE Conference on Decision and Control, 2025

Off-Policy Evaluation Using Information Borrowing and Context-Based Switching.
Proceedings of the 64th IEEE Conference on Decision and Control, 2025

Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

2024
Tractable Equilibrium Computation in Markov Games through Risk Aversion.
CoRR, 2024

Distributionally Robust Constrained Reinforcement Learning under Strong Duality.
RLJ, 2024

Model-Free Robust ϕ-Divergence Reinforcement Learning Using Both Offline and Online Data.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Personalized Reward Learning with Interaction-Grounded Learning (IGL).
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Distributionally Robust Behavioral Cloning for Robust Imitation Learning.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

Improved Sample Complexity Bounds for Distributionally Robust Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Interaction-Grounded Learning for Recommender Systems.
Proceedings of the 5th Workshop on Online Recommender Systems and User Modeling co-located with the 16th ACM Conference on Recommender Systems, 2022

Robust Reinforcement Learning using Offline Data.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sample Complexity of Robust Reinforcement Learning with a Generative Model.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Bounded Regret for Finitely Parameterized Multi-Armed Bandits.
IEEE Control. Syst. Lett., 2021

Sample Complexity of Model-Based Robust Reinforcement Learning.
Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

2020
Model-Free Robust Reinforcement Learning with Linear Function Approximation.
CoRR, 2020


  Loading...