Baoxiang Wang

According to our database1, Baoxiang Wang authored at least 15 papers between 2014 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Recurrent Existence Determination Through Policy Optimization.
CoRR, 2019

Private Q-Learning with Functional Noise in Continuous Spaces.
CoRR, 2019

Metatrace Actor-Critic: Online Step-Size Tuning by Meta-gradient Descent for Reinforcement Learning Control.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Recurrent Existence Determination Through Policy Optimization.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Beyond Winning and Losing: Modeling Human Motivations and Behaviors with Vector-Valued Inverse Reinforcement Learning.
Proceedings of the Fifteenth AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2019

2018
Beyond Winning and Losing: Modeling Human Motivations and Behaviors Using Inverse Reinforcement Learning.
CoRR, 2018

Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control.
CoRR, 2018

Policy Optimization with Second-Order Advantage Information.
CoRR, 2018

Policy Optimization with Second-Order Advantage Information.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Policy Optimization with Second-Order Advantage Information.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Motion Sequence Decomposition-Based Hybrid Entropy Feature and Its Application to Fault Diagnosis of a High-Speed Automatic Mechanism.
Entropy, 2017

Research on the control method for high speed locomotion legged robot.
Proceedings of the 2nd International Conference on Advanced Robotics and Mechatronics, 2017

2016
Contextual Combinatorial Cascading Bandits.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
PAID: Prioritizing app issues for developers by tracking user reviews over versions.
Proceedings of the 26th IEEE International Symposium on Software Reliability Engineering, 2015

2014
Architecture design of the workflow execution system containing Humanware Services.
Proceedings of 11th IEEE International Conference on Networking, Sensing and Control, 2014


  Loading...