Ngo Anh Vien

According to our database1, Ngo Anh Vien authored at least 37 papers between 2007 and 2018.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes.
CoRR, 2018

Scalable and Interpretable One-class SVMs with Deep Learning and Random Fourier features.
CoRR, 2018

A Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes.
IEEE Access, 2018

Bayesian Functional Optimization.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Inverse KKT: Learning cost functions of manipulation tasks from demonstrations.
I. J. Robotics Res., 2017

Deep reinforcement learning algorithms for steering an underactuated ship.
Proceedings of the 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2017

A Covariance Matrix Adaptation Evolution Strategy for Direct Policy Search in Reproducing Kernel Hilbert Space.
Proceedings of The 9th Asian Conference on Machine Learning, 2017

2016
Bayes-adaptive hierarchical MDPs.
Appl. Intell., 2016

Policy Search in Reproducing Kernel Hilbert Space.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Relational activity processes for modeling concurrent cooperation.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

2015
POMDP manipulation via trajectory optimization.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Touch based POMDP manipulation via sequential submodular optimization.
Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

Hierarchical Monte-Carlo Planning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Efficient Interactive Multiclass Learning from Binary Feedback.
TiiS, 2014

Approximate planning for bayesian hierarchical reinforcement learning.
Appl. Intell., 2014

Model-Based Relational RL When Object Existence is Partially Observable.
Proceedings of the 31th International Conference on Machine Learning, 2014

Monte carlo bayesian hierarchical reinforcement learning.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

2013
Monte-Carlo tree search for Bayesian reinforcement learning.
Appl. Intell., 2013

Learning via human feedback in continuous state and action spaces.
Appl. Intell., 2013

Upper Confidence Weighted Learning for Efficient Exploration in Multiclass Prediction with Binary Feedback.
Proceedings of the IJCAI 2013, 2013

2012
Monte Carlo Tree Search for Bayesian Reinforcement Learning.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Reinforcement learning combined with human feedback in continuous state and action spaces.
Proceedings of the 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics, 2012

Learning via Human Feedback in Continuous State and Action Spaces.
Proceedings of the Robots Learning Interactively from Human Teachers, 2012

2011
Hessian matrix distribution for Bayesian policy gradient reinforcement learning.
Inf. Sci., 2011

Nomogram Visualization for Ranking Support Vector Machine.
Proceedings of the Advances in Neural Networks - ISNN 2011, 2011

2010
Policy Gradient Based Semi-Markov Decision Problems: Approximation and Estimation Errors.
IEICE Transactions, 2010

Monte Carlo Value Iteration for Continuous-State POMDPs.
Proceedings of the Algorithmic Foundations of Robotics IX, 2010

2009
Policy Gradient SMDP for Resource Allocation and Routing in Integrated Services Networks.
IEICE Transactions, 2009

Probabilistic Ranking Support Vector Machine.
Proceedings of the Advances in Neural Networks, 2009

VRIFA: a nonlinear SVM visualization tool using nomogram and localized radial basis function (LRBF) kernels.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Policy Gradient Semi-markov Decision Process.
Proceedings of the 20th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2008), 2008

Policy Gradient SMDP for Resource Allocation and Routing in Integrated Services Networks.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2008

Efficient Distributed Sensor Dispatch in Mobile Sensor Network.
Proceedings of the 22nd International Conference on Advanced Information Networking and Applications, 2008

Obstacle Avoidance Path Planning for Mobile Robot Based on Multi Colony Ant Algorithm.
Proceedings of the First International Conference on Advances in Computer-Human Interaction, 2008

2007
Heuristic Search Based Exploration in Reinforcement Learning.
Proceedings of the Computational and Ambient Intelligence, 2007

Obstacle Avoidance Path Planning for Mobile Robot Based on Ant-Q Reinforcement Learning Algorithm.
Proceedings of the Advances in Neural Networks, 2007

Natural Gradient Policy for Average Cost SMDP Problem.
Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2007), 2007


  Loading...