Sayak Ray Chowdhury

CoRR, January, 2026

DP-NCB: Privacy Preserving Fair Bandits.

[BibT_eX]

[DOI]

Dhruv Sarkar

Nishant Pandey

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Revisiting Social Welfare in Bandits: UCB is (Nearly) All You Need.

[BibT_eX]

[DOI]

Dhruv Sarkar

Nishant Pandey

CoRR, October, 2025

Why DPO is a Misspecified Estimator and How to Fix It.

[BibT_eX]

[DOI]

Debangshu Banerjee

CoRR, October, 2025

Constrained Adversarial Perturbation.

[BibT_eX]

[DOI]

CoRR, October, 2025

KITE: Kernelized and Information Theoretic Exemplars for In-Context Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

Active Preference Optimization for Sample Efficient RLHF.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2025

Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

Communication Efficient Secure and Private Multi-Party Deep Learning.

[BibT_eX]

[DOI]

IACR Cryptol. ePrint Arch., 2024

Provably Sample Efficient RLHF via Active Preference Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

OAK: Enriching Document Representations using Auxiliary Knowledge for Extreme Classification.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Provably Robust DPO: Aligning Language Models with Noisy Feedback.

[BibT_eX]

[DOI]

Anush Kini

Nagarajan Natarajan

Proceedings of the Forty-first International Conference on Machine Learning, 2024

On Differentially Private Federated Linear Contextual Bandits.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Differentially Private Reward Estimation with Preference Feedback.

[BibT_eX]

[DOI]

Nagarajan Natarajan

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023

GAR-meets-RAG Paradigm for Zero-Shot Information Retrieval.

[BibT_eX]

[DOI]

CoRR, 2023

Combinatorial categorized bandits with expert rankings.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2023

Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Distributed Differential Privacy in Multi-Armed Bandits.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Bregman Deviations of Generic Exponential Families.

[BibT_eX]

[DOI]

Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

Model Selection in Reinforcement Learning with General Function Approximations.

[BibT_eX]

[DOI]

Avishek Ghosh

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Shuffle Private Linear Contextual Bandits.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Value Function Approximations via Kernel Embeddings for No-Regret Reinforcement Learning.

[BibT_eX]

[DOI]

Rafael Oliveira

Proceedings of the Asian Conference on Machine Learning, 2022

Differentially Private Regret Minimization in Episodic Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Model Selection with Near Optimal Rates for Reinforcement Learning with General Model Classes.

[BibT_eX]

[DOI]

Avishek Ghosh

Kannan Ramchandran

CoRR, 2021

Adaptive Control of Differentially Private Linear Quadratic Systems.

[BibT_eX]

[DOI]

Ness B. Shroff

Proceedings of the IEEE International Symposium on Information Theory, 2021

Reinforcement Learning in Parametric MDPs with Exponential Families.

[BibT_eX]

[DOI]

Odalric-Ambrym Maillard

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

No-regret Algorithms for Multi-task Bayesian Optimization.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020

No-Regret Reinforcement Learning with Value Function Approximation: a Kernel Embedding Approach.

[BibT_eX]

[DOI]

Rafael Oliveira

CoRR, 2020

Active Learning of Conditional Mean Embeddings via Bayesian Optimisation.

[BibT_eX]

[DOI]

Rafael Oliveira

Fabio Ramos

Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

2019

On Batch Bayesian Optimization.

[BibT_eX]

[DOI]

CoRR, 2019

Bayesian Optimization under Heavy-tailed Payoffs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Online Learning in Kernelized Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2017

On Kernelized Multi-armed Bandits.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

Misspecified Linear Bandits.

[BibT_eX]

[DOI]

Avishek Ghosh