Ngo Anh Vien

Gerhard Neumann

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

A Hybrid Approach for Learning to Shift and Grasp with Elaborate Motion Primitives.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

FusionVAE: A Deep Hierarchical Variational Autoencoder for RGB Image Fusion.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

What Matters For Meta-Learning Vision Regression Tasks?

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Deep Black-Box Reinforcement Learning with Movement Primitives.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2022

2021

Deep Learning-Aided Multicarrier Systems.

[BibT_eX]

[DOI]

IEEE Trans. Wirel. Commun., 2021

Differentiable Robust LQR Layers.

[BibT_eX]

[DOI]

Gerhard Neumann

CoRR, 2021

Constrained representation learning for recurrent policy optimisation under uncertainty.

[BibT_eX]

[DOI]

Viet-Hung Dang

Adapt. Behav., 2021

Real-Time Energy Harvesting Aided Scheduling in UAV-Assisted D2D Networks Relying on Deep Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Access, 2021

Non-local Graph Convolutional Network for joint Activity Recognition and Motion Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Residual Feedback Learning for Contact-Rich Manipulation Tasks with Uncertainty.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Differentiable Trust Region Layers for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Fabian Otto

Philipp Becker

Hanna Carolin Maria Ziesche

Gerhard Neumann

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

Deep Energy Autoencoder for Noncoherent Multicarrier MU-SIMO Systems.

[BibT_eX]

[DOI]

IEEE Trans. Wirel. Commun., 2020

Improving Path Planning Methods in 2D Grid Maps.

[BibT_eX]

[DOI]

J. Comput., 2020

Bayes-Adaptive Deep Model-Based Policy Optimisation.

[BibT_eX]

[DOI]

Tai Hoang

CoRR, 2020

Asynchronous framework with Reptile+ algorithm to meta learn partially observable Markov decision process.

[BibT_eX]

[DOI]

Appl. Intell., 2020

Graph-Based Motion Planning Networks.

[BibT_eX]

[DOI]

Tai Hoang

Dimitrios S. Nikolopoulos

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020

Fast Analysis and Prediction in Large Scale Virtual Machines Resource Utilisation.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Cloud Computing and Services Science, 2020

2019

Deep Learning-Based Detector for OFDM-IM.

[BibT_eX]

[DOI]

IEEE Wirel. Commun. Lett., 2019

A covariance matrix adaptation evolution strategy in reproducing kernel Hilbert space.

[BibT_eX]

[DOI]

Viet-Hung Dang

Genet. Program. Evolvable Mach., 2019

Importance sampling policy gradient algorithms in reproducing kernel Hilbert space.

[BibT_eX]

[DOI]

Artif. Intell. Rev., 2019

Distributed Deep Deterministic Policy Gradient for Power Allocation Control in D2D-Based V2V Communications.

[BibT_eX]

[DOI]

IEEE Access, 2019

Non-Cooperative Energy Efficient Power Allocation Game in D2D Communication: A Multi-Agent Deep Reinforcement Learning Approach.

[BibT_eX]

[DOI]

IEEE Access, 2019

2018

Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes.

[BibT_eX]

[DOI]

CoRR, 2018

A Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes.

[BibT_eX]

[DOI]

Tuyen Pham Le

IEEE Access, 2018

Scalable and Interpretable One-Class SVMs with Deep Learning and Random Fourier Features.

[BibT_eX]

[DOI]

Minh-Nghia Nguyen

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2018

Bayesian Functional Optimization.

[BibT_eX]

[DOI]

Heiko Zimmermann

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Inverse KKT: Learning cost functions of manipulation tasks from demonstrations.

[BibT_eX]

[DOI]

Peter Englert

Int. J. Robotics Res., 2017

Deep reinforcement learning algorithms for steering an underactuated ship.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2017

A Functional Optimization Method for Continuous Domains.

[BibT_eX]

[DOI]

Proceedings of the Industrial Networks and Intelligent Systems, 2017

A Covariance Matrix Adaptation Evolution Strategy for Direct Policy Search in Reproducing Kernel Hilbert Space.

[BibT_eX]

[DOI]

Viet-Hung Dang

Proceedings of The 9th Asian Conference on Machine Learning, 2017

2016

Bayes-adaptive hierarchical MDPs.

[BibT_eX]

[DOI]

SeungGwan Lee

Appl. Intell., 2016

Policy Search in Reproducing Kernel Hilbert Space.

[BibT_eX]

[DOI]

Peter Englert

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Relational activity processes for modeling concurrent cooperation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

2015

POMDP manipulation via trajectory optimization.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Touch based POMDP manipulation via sequential submodular optimization.

[BibT_eX]

[DOI]

Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

Hierarchical Monte-Carlo Planning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Efficient Interactive Multiclass Learning from Binary Feedback.

[BibT_eX]

[DOI]

ACM Trans. Interact. Intell. Syst., 2014

Approximate planning for bayesian hierarchical reinforcement learning.

[BibT_eX]

[DOI]

Appl. Intell., 2014

Model-Based Relational RL When Object Existence is Partially Observable.

[BibT_eX]

[DOI]

Proceedings of the 31th International Conference on Machine Learning, 2014

Monte carlo bayesian hierarchical reinforcement learning.

[BibT_eX]

[DOI]

Hung Quoc Ngo

Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

2013

Monte-Carlo tree search for Bayesian reinforcement learning.

[BibT_eX]

[DOI]

Appl. Intell., 2013

Learning via human feedback in continuous state and action spaces.

[BibT_eX]

[DOI]

Appl. Intell., 2013

Upper Confidence Weighted Learning for Efficient Exploration in Multiclass Prediction with Binary Feedback.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2013, 2013

Reasoning with Uncertainties Over Existence of Objects.

[BibT_eX]

[DOI]

Proceedings of the 2013 AAAI Fall Symposia, Arlington, Virginia, USA, November 15-17, 2013, 2013

2012

Monte Carlo Tree Search for Bayesian Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Reinforcement learning combined with human feedback in continuous state and action spaces.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics, 2012

Learning via Human Feedback in Continuous State and Action Spaces.

[BibT_eX]

[DOI]

Proceedings of the Robots Learning Interactively from Human Teachers, 2012

2011

Hessian matrix distribution for Bayesian policy gradient reinforcement learning.

[BibT_eX]

[DOI]

Hwanjo Yu

Inf. Sci., 2011

Nomogram Visualization for Ranking Support Vector Machine.

[BibT_eX]

[DOI]

Nguyen Thi Thanh Thuy

Nguyen Thi Ngoc Vinh

Proceedings of the Advances in Neural Networks - ISNN 2011, 2011

2010

Policy Gradient Based Semi-Markov Decision Problems: Approximation and Estimation Errors.

[BibT_eX]

[DOI]

SeungGwan Lee

IEICE Trans. Inf. Syst., 2010

Monte Carlo Value Iteration for Continuous-State POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Algorithmic Foundations of Robotics IX, 2010

2009

Policy Gradient SMDP for Resource Allocation and Routing in Integrated Services Networks.

[BibT_eX]

[DOI]

IEICE Trans. Commun., 2009

Probabilistic Ranking Support Vector Machine.

[BibT_eX]

[DOI]

Nguyen Thi Thanh Thuy

Nguyen Hoang Viet

Proceedings of the Advances in Neural Networks, 2009

VRIFA: a nonlinear SVM visualization tool using nomogram and localized radial basis function (LRBF) kernels.

[BibT_eX]

[DOI]

Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008

Policy Gradient Semi-markov Decision Process.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2008), 2008

Policy Gradient SMDP for Resource Allocation and Routing in Integrated Services Networks.

[BibT_eX]

[DOI]

Nguyen Hoang Viet

Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2008

Efficient Distributed Sensor Dispatch in Mobile Sensor Network.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Advanced Information Networking and Applications, 2008

Obstacle Avoidance Path Planning for Mobile Robot Based on Multi Colony Ant Algorithm.

[BibT_eX]

[DOI]

Proceedings of the First International Conference on Advances in Computer-Human Interaction, 2008

2007

Heuristic Search Based Exploration in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Computational and Ambient Intelligence, 2007

Obstacle Avoidance Path Planning for Mobile Robot Based on Ant-Q Reinforcement Learning Algorithm.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Networks, 2007

Natural Gradient Policy for Average Cost SMDP Problem.

[BibT_eX]

[DOI]