# Akshay Krishnamurthy

According to our database

Collaborative distances:

^{1}, Akshay Krishnamurthy authored at least 68 papers between 2010 and 2020.Collaborative distances:

## Timeline

#### Legend:

Book In proceedings Article PhD thesis Other## Links

#### On csauthors.net:

## Bibliography

2020

Sample-Efficient Reinforcement Learning of Undercomplete POMDPs.

CoRR, 2020

Information Theoretic Regret Bounds for Online Nonlinear Control.

CoRR, 2020

Open Problem: Model Selection for Contextual Bandits.

CoRR, 2020

Provably adaptive reinforcement learning in metric spaces.

CoRR, 2020

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs.

CoRR, 2020

Efficient Contextual Bandits with Continuous Actions.

CoRR, 2020

Contrastive estimation reveals topic posterior information to linear models.

CoRR, 2020

Corrupted Multidimensional Binary Search: Learning in the Presence of Irrational Agents.

CoRR, 2020

Adaptive Estimator Selection for Off-Policy Evaluation.

CoRR, 2020

Reward-Free Exploration for Reinforcement Learning.

CoRR, 2020

Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds.

Proceedings of the 8th International Conference on Learning Representations, 2020

Algebraic and Analytic Approaches for Parameter Learning in Mixture Models.

Proceedings of the Algorithmic Learning Theory, 2020

2019

Active Learning for Cost-Sensitive Classification.

J. Mach. Learn. Res., 2019

Optimism in Reinforcement Learning with Generalized Linear Function Approximation.

CoRR, 2019

Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning.

CoRR, 2019

Sample Complexity of Learning Mixtures of Sparse Linear Regressions.

CoRR, 2019

Robust Dynamic Assortment Optimization in the Presence of Outlier Customers.

CoRR, 2019

Doubly robust off-policy evaluation with shrinkage.

CoRR, 2019

Sample Complexity of Learning Mixture of Sparse Linear Regressions.

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Model Selection for Contextual Bandits.

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Scalable Hierarchical Clustering with Tree Grafting.

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Myopic Posterior Sampling for Adaptive Goal Oriented Design of Experiments.

Proceedings of the 36th International Conference on Machine Learning, 2019

Provably efficient RL with Rich Observations via Latent State Decoding.

Proceedings of the 36th International Conference on Machine Learning, 2019

Trace Reconstruction: Generalized and Parameterized.

Proceedings of the 27th Annual European Symposium on Algorithms, 2019

Model-based RL in Contextual Decision Processes: PAC bounds and Exponential Improvements over Model-free Approaches.

Proceedings of the Conference on Learning Theory, 2019

Contextual bandits with continuous actions: Smoothing, zooming, and adapting.

Proceedings of the Conference on Learning Theory, 2019

Disagreement-Based Combinatorial Pure Exploration: Sample Complexity Bounds and an Efficient Algorithm.

Proceedings of the Conference on Learning Theory, 2019

2018

Extreme Compressive Sampling for Covariance Estimation.

IEEE Trans. Inf. Theory, 2018

Model-Based Reinforcement Learning in Contextual Decision Processes.

CoRR, 2018

Myopic Bayesian Design of Experiments via Posterior Sampling and Probabilistic Programming.

CoRR, 2018

On Polynomial Time PAC Reinforcement Learning with Rich Observations.

CoRR, 2018

Contextual bandits with surrogate losses: Margin bounds and efficient algorithms.

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

On Oracle-Efficient PAC RL with Rich Observations.

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Semiparametric Contextual Bandits.

Proceedings of the 35th International Conference on Machine Learning, 2018

Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning.

Proceedings of the 6th International Conference on Learning Representations, 2018

Parallelised Bayesian Optimisation via Thompson Sampling.

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017

Disagreement-based combinatorial pure exploration: Efficient algorithms and an analysis with localization.

CoRR, 2017

An Online Hierarchical Algorithm for Extreme Clustering.

CoRR, 2017

Asynchronous Parallel Bayesian Optimisation via Thompson Sampling.

CoRR, 2017

Off-policy evaluation for slate recommendation.

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

A Hierarchical Algorithm for Extreme Clustering.

Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Contextual Decision Processes with low Bellman rank are PAC-Learnable.

Proceedings of the 34th International Conference on Machine Learning, 2017

Open Problem: First-Order Regret Bounds for Contextual Bandits.

Proceedings of the 30th Conference on Learning Theory, 2017

Go for a Walk and Arrive at the Answer: Reasoning Over Knowledge Bases with Reinforcement Learning.

Proceedings of the 6th Workshop on Automated Knowledge Base Construction, 2017

2016

Contextual-MDPs for PAC-Reinforcement Learning with Rich Observations.

CoRR, 2016

Exploratory Gradient Boosting for Reinforcement Learning in Complex Domains.

CoRR, 2016

Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits.

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

PAC Reinforcement Learning with Rich Observations.

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Contextual semibandits via supervised learning oracles.

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Minimax structured normal means inference.

Proceedings of the IEEE International Symposium on Information Theory, 2016

Efficient Algorithms for Adversarial Contextual Learning.

Proceedings of the 33nd International Conference on Machine Learning, 2016

2015

Efficient Contextual Semi-Bandit Learning.

CoRR, 2015

Minimaxity in Structured Normal Means Inference.

CoRR, 2015

Nonparametric von Mises Estimators for Entropies, Divergences and Mutual Informations.

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Learning to Search Better than Your Teacher.

Proceedings of the 32nd International Conference on Machine Learning, 2015

On Estimating L22 Divergence.

Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014

On the Power of Adaptivity in Matrix Completion and Approximation.

CoRR, 2014

Influence Functions for Machine Learning: Nonparametric Estimators for Entropies, Divergences and Mutual Informations.

CoRR, 2014

Nonparametric Estimation of Renyi Divergence and Friends.

Proceedings of the 31th International Conference on Machine Learning, 2014

Subspace learning from extremely compressed measurements.

Proceedings of the 48th Asilomar Conference on Signals, Systems and Computers, 2014

2013

Near-optimal Anomaly Detection in Graphs using Lovasz Extended Scan Statistic.

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Low-Rank Matrix and Tensor Completion via Adaptive Sampling.

Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Detecting Activations over Graphs using Spanning Tree Wavelet Bases.

Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, 2013

Recovering graph-structured activations using adaptive compressive measurements.

Proceedings of the 2013 Asilomar Conference on Signals, 2013

2012

Robust multi-source network tomography using selective probes.

Proceedings of the IEEE INFOCOM 2012, Orlando, FL, USA, March 25-30, 2012, 2012

Efficient Active Algorithms for Hierarchical Clustering.

Proceedings of the 29th International Conference on Machine Learning, 2012

2011

Noise Thresholds for Spectral Clustering.

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

2010

Fine-grained privilege separation for web applications.

Proceedings of the 19th International Conference on World Wide Web, 2010