Jonathan Baxter

According to our database1, Jonathan Baxter authored at least 41 papers between 1995 and 2020.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2020
Theoretical Models of Learning to Learn.
CoRR, 2020

2019
Some observations concerning Off Training Set (OTS) error.
CoRR, 2019

General Matrix-Matrix Multiplication Using SIMD features of the PIII.
CoRR, 2019

Hebbian Synaptic Modifications in Spiking Neurons that Learn.
CoRR, 2019

92c/MFlops/s, Ultra-Large-Scale Neural-Network Training on a PIII Cluster.
CoRR, 2019

Learning Internal Representations (PhD Thesis).
CoRR, 2019

2009
A tag in the hand: supporting semantic, social, and spatial navigation in museums.
Proceedings of the 27th International Conference on Human Factors in Computing Systems, 2009

Using technologies to support reminiscence.
Proceedings of the 2009 British Computer Society Conference on Human-Computer Interaction, 2009

2008
ArtLinks: fostering social awareness and reflection in museums.
Proceedings of the 2008 Conference on Human Factors in Computing Systems, 2008

2004
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning.
J. Mach. Learn. Res., 2004

2002
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning.
J. Comput. Syst. Sci., 2002

Scalable Internal-State Policy-Gradient Methods for POMDPs.
Proceedings of the Machine Learning, 2002

2001
Experiments with Infinite-Horizon, Policy-Gradient Estimation.
J. Artif. Intell. Res., 2001

Infinite-Horizon Policy-Gradient Estimation.
J. Artif. Intell. Res., 2001

Emmerald: a fast matrix-matrix multiply using Intel's SSE instructions.
Concurr. Comput. Pract. Exp., 2001

A Multi-Agent Policy-Gradient Approach to Network Routing.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

2000
Improved Generalization Through Explicit Optimization of Margins.
Mach. Learn., 2000

Learning to Play Chess Using Temporal Differences.
Mach. Learn., 2000

A Model of Inductive Bias Learning.
J. Artif. Intell. Res., 2000

98¢/Mflops/s, Ultra-Large-Scale Neural-Network Training on a PIII Cluster.
Proceedings of the Proceedings Supercomputing 2000, 2000

Direct gradient-based reinforcement learning.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2000

Reinforcement Learning in POMDP's via Direct Gradient Ascent.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

General Matrix-Matrix Multiplication Using SIMD Features of the PIII (Research Note).
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

Stochastic optimization of controlled partially observable Markov decision processes.
Proceedings of the 39th IEEE Conference on Decision and Control, 2000

1999
Guest Editors' Introduction.
Mach. Learn., 1999

KnightCap: A chess program that learns by combining TD(lambda) with game-tree search
CoRR, 1999

TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search
CoRR, 1999

Boosting Algorithms as Gradient Descent.
Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999

1998
Experiments in Parameter Learning Using Temporal Differences.
J. Int. Comput. Games Assoc., 1998

Direct Optimization of Margins Improves Generalization in Combined Classifiers.
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998

KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search.
Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), 1998

The Canonical Distortion Measure for Vector Quantization and Function Approximation.
Proceedings of the Learning to Learn., 1998

Theoretical Models of Learning to Learn.
Proceedings of the Learning to Learn., 1998

1997
A Bayesian/Information Theoretic Model of Learning to Learn via Multiple Task Sampling.
Mach. Learn., 1997

The Canonical Distortion Measure in Feature Space and 1-NN Classification.
Proceedings of the Advances in Neural Information Processing Systems 10, 1997

The Canonical Distortion Measure for Vector Quantization and Function Approximation.
Proceedings of the Fourteenth International Conference on Machine Learning (ICML 1997), 1997

A Result Relating Convex <i>n</i>-Widths to Covering Numbers with some Applications to Neural Networks.
Proceedings of the Computational Learning Theory, Third European Conference, 1997

1996
Learning to Compress Ergodic Sources.
Proceedings of the 6th Data Compression Conference (DCC '96), Snowbird, Utah, USA, March 31, 1996

A Bayesian/Information Theoretic Model of Bias Learning.
Proceedings of the Ninth Annual Conference on Computational Learning Theory, 1996

1995
Learning Model Bias.
Proceedings of the Advances in Neural Information Processing Systems 8, 1995

Learning Internal Representations.
Proceedings of the Eigth Annual Conference on Computational Learning Theory, 1995


  Loading...