Sham M. Kakade
Affiliations: Harvard University, Cambridge, MA, USA
 University of Washington, Department of Statistics, Seattle, WA, USA (former)
 Microsoft Research New England, Cambridge, MA, USA (former)
 Toyota Technological Institute at Chicago, IL, USA (former)
 University of Pennsylvania, Department of Statistics, Philadelphia, PA, USA (former)
 University College London, Gatsby Computational Neuroscience Unit, UK
According to our database^{1},
Sham M. Kakade
authored at least 228 papers
between 1999 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:

on twitter.com
On csauthors.net:
Bibliography
2024
Trans. Mach. Learn. Res., 2024
CoRR, 2024
CoRR, 2024
DataCompLM: In search of the next generation of training sets for language models.
CoRR, 2024
CoRR, 2024
CoLoRFilter: Conditional Loss Reduction Filtering for Targeted Language Model Pretraining.
CoRR, 2024
CoRR, 2024
Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass.
CoRR, 2024
Matching the Statistical Query Lower Bound for ksparse Parity Problems with Stochastic Gradient Descent.
CoRR, 2024
Follow My Instruction and Spill the Beans: Scalable Data Extraction from RetrievalAugmented Generation Systems.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
ModelBased MultiAgent RL in ZeroSum Markov Games with NearOptimal Sample Complexity.
J. Mach. Learn. Res., 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the IEEE Statistical Signal Processing Workshop, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Hardness of Independent Learning and Sparse Equilibrium Computation in Markov Games.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023
2022
IEEE Trans. Signal Process., 2022
CoRR, 2022
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
The Power and Limitation of PretrainingFinetuning for Linear Regression under Covariate Shift.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Last Iterate Risk Bounds of SGD with Decaying Stepsize for Overparameterized Linear Regression.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
2021
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift.
J. Mach. Learn. Res., 2021
On Nonconvex Optimization for Machine Learning: Gradients, Stochasticity, and Saddle Points.
J. ACM, 2021
CoRR, 2021
CoRR, 2021
CoRR, 2021
An Exponential Lower Bound for LinearlyRealizable MDPs with Constant Suboptimality Gap.
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
An Exponential Lower Bound for Linearly Realizable MDP with Constant Suboptimality Gap.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the Conference on Learning Theory, 2021
2020
Found. Comput. Math., 2020
IEEE Data Eng. Bull., 2020
CoRR, 2020
Is Long Horizon Reinforcement Learning More Difficult Than Short Horizon Reinforcement Learning?
CoRR, 2020
CoRR, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Provable Representation Learning for Imitation Learning via Bilevel Optimization.
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the Conference on Learning Theory, 2020
Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes.
Proceedings of the Conference on Learning Theory, 2020
Proceedings of the Algorithmic Learning Theory, 2020
Proceedings of the Algorithmic Learning Theory, 2020
2019
CoRR, 2019
CoRR, 2019
CoRR, 2019
The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure.
CoRR, 2019
CoRR, 2019
A Short Note on Concentration Inequalities for Random Vectors with SubGaussian Norm.
CoRR, 2019
The Illusion of Change: Correcting for Biases in Change Inference for Sparse, SocietalScale Data.
Proceedings of the World Wide Web Conference, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure For Least Squares.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Plan Online, Learn Offline: Efficient Learning and Exploration via ModelBased Control.
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the Conference on Learning Theory, 2019
2018
CoRR, 2018
CoRR, 2018
Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the 9th Innovations in Theoretical Computer Science Conference, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Variance Reduction for Policy Gradient with ActionDependent Factorized Baselines.
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Conference On Learning Theory, 2018
2017
Parallelizing Stochastic Gradient Descent for Least Squares Regression: Minibatching, Averaging, and Model Misspecification.
J. Mach. Learn. Res., 2017
CoRR, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 5th International Conference on Learning Representations, 2017
A Markov Chain Theory Approach to Characterizing the Minimax Optimality of Stochastic Gradient Descent (for Least Squares).
Proceedings of the 37th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, 2017
Global Convergence of NonConvex Gradient Descent for Computing Matrix Squareroot.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017
2016
IEEE Trans. Signal Process., 2016
CoRR, 2016
CoRR, 2016
Matching Matrix Bernstein with Little Memory: NearOptimal Finite Sample Guarantees for Oja's Algorithm.
CoRR, 2016
Provable Efficient Online Matrix Completion via Nonconvex Stochastic Gradient Descent.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Efficient Algorithms for Largescale Generalized Eigenvector Computation and Canonical Correlation Analysis.
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Streaming PCA: Matching Matrix Bernstein and NearOptimal Finite Sample Guarantees for Oja's Algorithm.
Proceedings of the 29th Conference on Learning Theory, 2016
2015
When are overcomplete topic models identifiable? uniqueness of tensor tucker decompositions with structured sparsity.
J. Mach. Learn. Res., 2015
Robust ShiftandInvert Preconditioning: Faster and More Sample Efficient Algorithms for Eigenvector Computation.
CoRR, 2015
CoRR, 2015
Algorithmica, 2015
Proceedings of the FortySeventh Annual ACM on Symposium on Theory of Computing, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Unregularizing: approximate proximal point and faster stochastic algorithms for empirical risk minimization.
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of The 28th Conference on Learning Theory, 2015
Proceedings of the Algorithmic Learning Theory  26th International Conference, 2015
2014
J. Mach. Learn. Res., 2014
J. Mach. Learn. Res., 2014
Found. Comput. Math., 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 52nd Annual Allerton Conference on Communication, 2014
2013
SIAM J. Optim., 2013
J. Mach. Learn. Res., 2013
Oper. Res., 2013
Learning mixtures of spherical gaussians: moment methods and spectral decompositions.
Proceedings of the Innovations in Theoretical Computer Science, 2013
Proceedings of the 30th International Conference on Machine Learning, 2013
Proceedings of the COLT 2013, 2013
2012
InformationTheoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting.
IEEE Trans. Inf. Theory, 2012
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012
J. Mach. Learn. Res., 2012
Proceedings of the COLT 2012, 2012
Proceedings of the COLT 2012, 2012
Proceedings of the COLT 2012, 2012
J. Comput. Syst. Sci., 2012
CoRR, 2012
CoRR, 2012
Two SVDs Suffice: Spectral decompositions for probabilistic topic modeling and latent Dirichlet allocation
CoRR, 2012
CoRR, 2012
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 36, 2012
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 36, 2012
2011
IEEE Trans. Inf. Theory, 2011
SIGecom Exch., 2011
Preface.
Proceedings of the COLT 2011, 2011
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011
CoRR, 2011
CoRR, 2011
CoRR, 2011
CoRR, 2011
Efficient Learning of Generalized Linear and Single Index Models with Isotonic Regression.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 1214 December 2011, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 1214 December 2011, 2011
2010
Mach. Learn., 2010
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010
CoRR, 2010
CoRR, 2010
CoRR, 2010
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 69 December 2010, 2010
Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design.
Proceedings of the 27th International Conference on Machine Learning (ICML10), 2010
2009
SIAM J. Comput., 2009
Math. Oper. Res., 2009
CoRR, 2009
CoRR, 2009
Applications of strong convexitystrong smoothness duality to learning with matrices
CoRR, 2009
Proceedings of the Proceedings 10th ACM Conference on Electronic Commerce (EC2009), 2009
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 710 December 2009, 2009
Proceedings of the 26th Annual International Conference on Machine Learning, 2009
2008
IEEE Trans. Inf. Theory, 2008
J. Comput. Syst. Sci., 2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
On the Complexity of Linear Prediction: Risk Bounds, Margin Bounds, and Regularization.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the Machine Learning, 2008
Proceedings of the 21st Annual Conference on Learning Theory, 2008
Proceedings of the 21st Annual Conference on Learning Theory, 2008
Proceedings of the 21st Annual Conference on Learning Theory, 2008
2007
Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007
Proceedings of the Advances in Neural Information Processing Systems 20, 2007
Proceedings of the IJCAI 2007, 2007
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007
Proceedings of the Learning Theory, 20th Annual Conference on Learning Theory, 2007
2006
Proceedings of the Proceedings 7th ACM Conference on Electronic Commerce (EC2006), 2006
Proceedings of the 2006 IEEE Information Theory Workshop, 2006
Proceedings of the Machine Learning, 2006
2005
Proceedings of the UAI '05, 2005
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005
Proceedings of the IJCAI05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005
Proceedings of the Learning Theory, 18th Annual Conference on Learning Theory, 2005
2004
Proceedings of the Proceedings 5th ACM Conference on Electronic Commerce (EC2004), 2004
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004
Proceedings of the Learning Theory, 17th Annual Conference on Learning Theory, 2004
2003
Proceedings of the Proceedings 4th ACM Conference on Electronic Commerce (EC2003), 2003
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003
Proceedings of the Machine Learning, 2003
2002
Neural Networks, 2002
Neural Networks, 2002
Competitive Analysis of the Explore/Exploit Tradeoff.
Proceedings of the Machine Learning, 2002
An Alternate Objective Function for Markovian Fields.
Proceedings of the Machine Learning, 2002
Approximately Optimal Approximate Reinforcement Learning.
Proceedings of the Machine Learning, 2002
2001
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
Proceedings of the Computational Learning Theory, 2001
2000
Proceedings of the Advances in Neural Information Processing Systems 13, 2000
Proceedings of the Advances in Neural Information Processing Systems 13, 2000
1999
Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999