Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Unbiased online active learning in data streams.

[BibT_eX]

[DOI]

Wei Chu

Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Doubly Robust Policy Evaluation and Learning.

[BibT_eX]

[DOI]

Miroslav Dudík

John Langford

Lihong Li

Proceedings of the 28th International Conference on Machine Learning, 2011

2010

An Unbiased, Data-Driven, Offline Evaluation Method of Contextual Bandit Algorithms

[BibT_eX]

[DOI]

Lihong Li

Wei Chu

John Langford

CoRR, 2010

An Optimal High Probability Algorithm for the Contextual Bandit Problem

[BibT_eX]

[DOI]

CoRR, 2010

Reducing reinforcement learning to KWIK online regression.

[BibT_eX]

[DOI]

Lihong Li

Michael L. Littman

Ann. Math. Artif. Intell., 2010

A contextual-bandit approach to personalized news article recommendation.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on World Wide Web, 2010

Parallelized Stochastic Gradient Descent.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Learning from Logged Implicit Exploration Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Online learning for recency search ranking using real-time user feedback.

[BibT_eX]

[DOI]

Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009

Reinforcement Learning in Finite MDPs: PAC Analysis.

[BibT_eX]

[DOI]

Alexander L. Strehl

Lihong Li

Michael L. Littman

J. Mach. Learn. Res., 2009

Provably Efficient Learning with Typed Parametric Models.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2009

Learning and planning in environments with delayed feedback.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2009

A Bayesian Sampling Approach to Exploration in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the UAI 2009, 2009

Reinforcement learning for dialog management using least-squares Policy iteration and fast feature selection.

[BibT_eX]

[DOI]

Lihong Li

Jason D. Williams

Suhrid Balakrishnan

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Workshop summary: Results of the 2009 reinforcement learning competition.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

The adaptive <i>k</i>-meteorologists problem and its application to structure learning and feature selection in reinforcement learning.

[BibT_eX]

[DOI]

Carlos Diuk

Lihong Li

Bethany R. Leffler

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Online exploration in least-squares policy iteration.

[BibT_eX]

[DOI]

Lihong Li

Michael L. Littman

Christopher R. Mansley

Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

2008

CORL: A Continuous-state Offset-dynamics Reinforcement Learner.

[BibT_eX]

[DOI]

Proceedings of the UAI 2008, 2008

Sparse Online Learning via Truncated Gradient.

[BibT_eX]

[DOI]

John Langford

Lihong Li

Tong Zhang

Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Efficient Value-Function Approximation via Online Linear Regression.

[BibT_eX]

[DOI]

Lihong Li

Michael L. Littman

Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning.

[BibT_eX]

[DOI]

Ronald Parr

Lihong Li

Gavin Taylor

Christopher Painter-Wakefield

Michael L. Littman

Proceedings of the Machine Learning, 2008

Knows what it knows: a framework for self-aware learning.

[BibT_eX]

[DOI]

Lihong Li

Michael L. Littman

Thomas J. Walsh

Proceedings of the Machine Learning, 2008

A worst-case comparison between temporal difference and residual gradient with linear function approximation.

[BibT_eX]

[DOI]

Lihong Li

Proceedings of the Machine Learning, 2008

2007

Focus of Attention in Reinforcement Learning.

[BibT_eX]

[DOI]

Lihong Li

Vadim Bulitko

Russell Greiner

J. Univers. Comput. Sci., 2007

Maintaining Equilibria During Exploration in Sponsored Search Auctions.

[BibT_eX]

[DOI]

Proceedings of the Internet and Network Economics, Third International Workshop, 2007

Analyzing feature generation for value-function approximation.

[BibT_eX]

[DOI]

Ronald Parr

Christopher Painter-Wakefield

Lihong Li

Michael L. Littman

Proceedings of the Machine Learning, 2007

Planning and Learning in Environments with Delayed Feedback.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning: ECML 2007, 2007

2006

Incremental Model-based Learners With Formal Learning-Time Guarantees.

[BibT_eX]

[DOI]

Alexander L. Strehl

Lihong Li

Michael L. Littman

Proceedings of the UAI '06, 2006

Towards a Unified Theory of State Abstraction for MDPs.

[BibT_eX]

[DOI]

Lihong Li

Thomas J. Walsh

Michael L. Littman

Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2006

PAC model-free reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2006

2005

Lazy Approximation for Solving Continuous Finite-Horizon MDPs.

[BibT_eX]

[DOI]

Lihong Li

Michael L. Littman

Proceedings of the Proceedings, 2005

2004

Batch Reinforcement Learning with State Importance.

[BibT_eX]

[DOI]

Lihong Li

Vadim Bulitko

Russell Greiner

Proceedings of the Machine Learning: ECML 2004, 2004

2003

Lookahead Pathologies for Single Agent Search.

[BibT_eX]

[DOI]

Proceedings of the IJCAI-03, 2003

Towards Automated Creation of Image Interpretation Systems.

[BibT_eX]

[DOI]

Proceedings of the AI 2003: Advances in Artificial Intelligence, 2003

Lihong Li

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...