Dongcui Diao

According to our database1, Dongcui Diao authored at least 5 papers between 2006 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
The Sufficiency of Off-Policyness and Soft Clipping: PPO Is Still Insufficient according to an Off-Policy Measure.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Class Interference of Deep Neural Networks.
CoRR, 2022

Sigmoidally Preconditioned Off-policy Learning: a new exploration method for reinforcement learning.
CoRR, 2022

2009
Multi-Step Dyna Planning for Policy Evaluation and Control.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

2006
Historical Temporal Difference Learning: Some Initial Results.
Proceedings of the Interdisciplinary and Multidisciplinary Research in Computer Science, 2006


  Loading...