Michael O. Duff

According to our database1, Michael O. Duff authored at least 7 papers between 1993 and 2003.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2003
Diffusion Approximation for Bayesian Markov Chains.
Proceedings of the Machine Learning, 2003

Design for an Optimal Probe.
Proceedings of the Machine Learning, 2003

2001
Monte-Carlo Algorithms for the Improvement of Finite-State Stochastic Controllers: Application to Bayes-Adaptive Markov Decision Processes.
Proceedings of the Eighth International Workshop on Artificial Intelligence and Statistics, 2001

1996
Local Bandit Approximation for Optimal Learning Problems.
Proceedings of the Advances in Neural Information Processing Systems 9, 1996

1995
Q-Learning for Bandit Problems.
Proceedings of the Machine Learning, 1995

1994
Reinforcement Learning Methods for Continuous-Time Markov Decision Problems.
Proceedings of the Advances in Neural Information Processing Systems 7, 1994

1993
Monte Carlo Matrix Inversion and Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 6, 1993


  Loading...