Lex Weaver

According to our database1, Lex Weaver authored at least 12 papers between 1998 and 2001.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2001
Experiments with Infinite-Horizon, Policy-Gradient Estimation.
J. Artif. Intell. Res., 2001

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning.
Proceedings of the UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001

A Multi-Agent Policy-Gradient Approach to Network Routing.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

2000
Learning to Play Chess Using Temporal Differences.
Mach. Learn., 2000

Sorting Integers on the AP1000
CoRR, 2000

Design and Evaluation of Mechanisms for a Multicomputer Object Store
CoRR, 2000

1999
KnightCap: A chess program that learns by combining TD(lambda) with game-tree search
CoRR, 1999

TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search
CoRR, 1999

1998
Experiments in Parameter Learning Using Temporal Differences.
J. Int. Comput. Games Assoc., 1998

Evolution of Neural Networks to Play the Game of Dots-and-Boxes
CoRR, 1998

Pre-fetching tree-structured data in distributed memory
CoRR, 1998

KnightCap: A Chess Programm That Learns by Combining TD(lambda) with Game-Tree Search.
Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), 1998


  Loading...