Quan Liu

Orcid: 0000-0002-8710-1810

Affiliations:

Soochow University, School of Computer Science and Technology, Provincial Key Laboratory for Computer Information Processing Technology, Suzhou, China

According to our database¹, Quan Liu authored at least 82 papers between 2007 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

LinFa-Q: Accurate Q-learning with linear function approximation.

[BibT_eX]

[DOI]

Neurocomputing, 2025

A Lyapunov-Based Convex Optimal Control Approach via Input Convex Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

Offline-to-Online: Case-Based Knowledge Distillation with Large Language Models for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Case-Based Reasoning Research and Development, 2025

Diverse Collaboration in Multi-Agent Reinforcement Learning via Self-Adaptive Method.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Personalized federated reinforcement learning: Balancing personalization and experience sharing via distance constraint.

[BibT_eX]

[DOI]

Expert Syst. Appl., March, 2024

RLUC: Strengthening robustness by attaching constraint considerations to policy network.

[BibT_eX]

[DOI]

Expert Syst. Appl., March, 2024

Taking complementary advantages: Improving exploration via double self-imitation learning in procedurally-generated environments.

[BibT_eX]

[DOI]

Expert Syst. Appl., March, 2024

Hierarchical reinforcement learning with unlimited option scheduling for sparse rewards in continuous spaces.

[BibT_eX]

[DOI]

Expert Syst. Appl., March, 2024

Deep Luenberger observer-based consistency tracking for nonlinear heterogeneous multi-agent systems with uncertain drift dynamics.

[BibT_eX]

[DOI]

Renyang You

Quan Liu

Knowl. Based Syst., 2024

Balanced Subgoals Generation in Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Sifeng Tong

Quan Liu

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

Multi-Agent Self-Motivated Learning via Role Representation.

[BibT_eX]

[DOI]

Yuchao Jin

Quan Liu

Proceedings of the International Joint Conference on Neural Networks, 2024

Offline Reinforcement Learning Based on Next State Supervision.

[BibT_eX]

[DOI]

Jie Yan

Quan Liu

Lihua Zhang

Proceedings of the IEEE International Conference on Acoustics, 2024

Offline Reinforcement Learning with Generative Adversarial Networks and Uncertainty Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Offline Reinforcement Learning with Policy Guidance and Uncertainty Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Deep Deterministic Strategy Gradient Method Using Plot Experience Playback.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE/ACIS International Conference on Computer and Information Science, 2024

2023

Generalized gradient emphasis learning for off-policy evaluation and control with function approximation.

[BibT_eX]

[DOI]

Neural Comput. Appl., November, 2023

Hierarchical reinforcement learning with adaptive scheduling for robot control.

[BibT_eX]

[DOI]

Zhigang Huang

Quan Liu

Fei Zhu

Eng. Appl. Artif. Intell., November, 2023

Addressing implicit bias in adversarial imitation learning with mutual information.

[BibT_eX]

[DOI]

Neural Networks, October, 2023

Temporal-difference emphasis learning with regularized correction for off-policy evaluation and control.

[BibT_eX]

[DOI]

Appl. Intell., September, 2023

A stable actor-critic algorithm for solving robotic tasks with multiple constraints.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., August, 2023

Learning fair representations for accuracy parity.

[BibT_eX]

[DOI]

Eng. Appl. Artif. Intell., March, 2023

Cosine Similarity Based Representation Learning for Adversarial Imitation Learning.

[BibT_eX]

[DOI]

Xiongzhen Zhang

Quan Liu

Lihua Zhang

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

Self-adaptive Inverse Soft-Q Learning for Imitation.

[BibT_eX]

[DOI]

Zhuo Wang

Quan Liu

Xiongzhen Zhang

Proceedings of the Neural Information Processing - 30th International Conference, 2023

Efficient Collaboration via Interaction Information in Multi-agent System.

[BibT_eX]

[DOI]

Meilong Shi

Quan Liu

Zhigang Huang

Proceedings of the Neural Information Processing - 30th International Conference, 2023

A Perturbation-Based Policy Distillation Framework with Generative Adversarial Nets.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Learning Unbiased Rewards with Mutual Information in Adversarial Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Explore the weakness: Instructive exploration adversarial robust reinforcement learning.

[BibT_eX]

[DOI]

Chunyang Wu

Fei Zhu

Quan Liu

J. King Saud Univ. Comput. Inf. Sci., 2022

Best-in-class imitation: Non-negative positive-unlabeled imitation learning from imperfect demonstrations.

[BibT_eX]

[DOI]

Inf. Sci., 2022

Learning fair representations by separating the relevance of potential information.

[BibT_eX]

[DOI]

Inf. Process. Manag., 2022

Improving deep reinforcement learning by safety guarding model via hazardous experience planning.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2022

Vital node searcher: find out critical node measure with deep reinforcement learning.

[BibT_eX]

[DOI]

Guanting Du

Fei Zhu

Quan Liu

Connect. Sci., 2022

Master-Slave Policy Collaboration for Actor-Critic Methods.

[BibT_eX]

[DOI]

Xiaomu Li

Quan Liu

Proceedings of the International Joint Conference on Neural Networks, 2022

2021

Self-guided deep deterministic policy gradient with multi-actor.

[BibT_eX]

[DOI]

Hongming Chen

Quan Liu

Shan Zhong

Neural Comput. Appl., 2021

ARAIL: Learning to rank from incomplete demonstrations.

[BibT_eX]

[DOI]

Inf. Sci., 2021

Gradient temporal-difference learning for off-policy evaluation using emphatic weightings.

[BibT_eX]

[DOI]

Inf. Sci., 2021

Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification.

[BibT_eX]

[DOI]

IEEE CAA J. Autom. Sinica, 2021

Improving exploration efficiency of deep reinforcement learning through samples produced by generative model.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2021

2020

A resource retrieval method of multimedia recommendation system based on deep learning.

[BibT_eX]

[DOI]

Int. J. Auton. Adapt. Commun. Syst., 2020

Multi-agent cooperation Q-learning algorithm based on constrained Markov Game.

[BibT_eX]

[DOI]

Comput. Sci. Inf. Syst., 2020

2019

基于视觉注意力机制的异步优势行动者-评论家算法 (Asynchronous Advantage Actor-Critic Algorithm with Visual Attention Mechanism).

[BibT_eX]

[DOI]

计算机科学, 2019

一种基于生成对抗网络的强化学习算法 (Reinforcement Learning Algorithm Based on Generative Adversarial Networks).

[BibT_eX]

[DOI]

计算机科学, 2019

Efficient reinforcement learning in continuous state and action spaces with Dyna and policy approximation.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2019

SARSA based access control with approximation by TileCoding.

[BibT_eX]

[DOI]

Comput. Sci. Inf. Syst., 2019

Residual Sarsa algorithm with function approximation.

[BibT_eX]

[DOI]

Clust. Comput., 2019

Safe Q-Learning Method Based on Constrained Markov Decision Processes.

[BibT_eX]

[DOI]

IEEE Access, 2019

2018

Single Trajectory Learning: Exploration Versus Exploitation.

[BibT_eX]

[DOI]

Int. J. Pattern Recognit. Artif. Intell., 2018

Policy Space Noise in Deep Deterministic Policy Gradient.

[BibT_eX]

[DOI]

Yan Yan

Quan Liu

Proceedings of the Neural Information Processing - 25th International Conference, 2018

Deep Deterministic Policy Gradient with Clustered Prioritized Sampling.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 25th International Conference, 2018

Accurate Q-Learning.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 25th International Conference, 2018

2017

Efficient actor-critic algorithm with dual piecewise model learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Symposium Series on Computational Intelligence, 2017

2016

Reasoning and predicting POMDP planning complexity via covering numbers.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2016

Learn to human-level control in dynamic environment using incremental batch interrupting temporal abstraction.

[BibT_eX]

[DOI]

Comput. Sci. Inf. Syst., 2016

Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.

[BibT_eX]

[DOI]

Shan Zhong

Quan Liu

Qi-ming Fu

Comput. Intell. Neurosci., 2016

Policy graph pruning and optimization in Monte Carlo Value Iteration for continuous-state POMDPs.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence, 2016

A Kernel-Based Sarsa( \lambda ) Algorithm with Clustering-Based Sample Sparsification.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Deep Q-Learning with Prioritized Sampling.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Sparse Kernel-Based Least Squares Temporal Difference with Prioritized Sweeping.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Covering Number: Analyses for Approximate Continuous-state POMDP Planning (Extended Abstract).

[BibT_eX]

[DOI]

Zongzhang Zhang

Quan Liu

Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

2015

Learning topic of dynamic scene using belief propagation and weighted visual words approach.

[BibT_eX]

[DOI]

Soft Comput., 2015

Human-level moving object recognition from traffic video.

[BibT_eX]

[DOI]

Comput. Sci. Inf. Syst., 2015

Protein-Protein Interaction Network Constructing Based on Text Mining and Reinforcement Learning with Application to Prostate Cancer.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE TrustCom/BigDataSE/ISPA, 2015

Spatiotemporal Saliency Detection Using Slow Feature Analysis and Spatial Information for Dynamic Scenes.

[BibT_eX]

[DOI]

Proceedings of the Intelligence Science and Big Data Engineering. Image and Video Data Engineering, 2015

Intelligent Model Learning Based on Variance for Bayesian Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

A Bayesian Sarsa Learning Algorithm with Bandit-Based Method.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Trajectory Sampling Value Iteration: Improved Dyna Search for MDPs.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

2013

A Gradient Descent Sarsa(λ) Algorithm Based on the Adaptive Reward-shaping Mechanism.

[BibT_eX]

[DOI]

Intell. Autom. Soft Comput., 2013

The second order temporal difference error for Sarsa(λ).

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2013

2012

A parallel scheduling algorithm for reinforcement learning in large state space.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2012

2011

Relevance feedback techniques and genetic algorithm for image retrieval based on multiple features.

[BibT_eX]

[DOI]

Int. J. Model. Identif. Control., 2011

Double elite co-evolutionary genetic algorithm.

[BibT_eX]

[DOI]

Int. J. Comput. Sci. Eng., 2011

2010

A Data Aggregation Algorithm Based on Splay Tree for Wireless Sensor Networks.

[BibT_eX]

[DOI]

J. Comput., 2010

2009

An Aggregation Tree Approach for Event Detection in Wireless Sensor Networks.

[BibT_eX]

[DOI]

J. Softw., 2009

An Efficient Strategy for Enhancing Robustness and Immunization in Wireless Sensor Networks.

[BibT_eX]

[DOI]

J. Networks, 2009

A Method to Automatically Discover and Classify Deep Web Data Source Using Multi-Classifier.

[BibT_eX]

[DOI]

Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

Correlated-Clustering Frame: A Holistic Method of Deep Web Schema Matching Based on Data Mining.

[BibT_eX]

[DOI]

Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

A Reinforcement Learning Algorithm Based on Minimum State Method and Average Reward.

[BibT_eX]

[DOI]

Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

2007

An Tableau Automated Theorem Proving Method Using Logical Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Computation and Intelligence, 2007

A Deep Web Data Integration System For Book Searching Domain.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Intelligent Information Technology Application, 2007

The Design and Implementation of a Topic-Driven Crawler.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Intelligent Information Technology Application, 2007

Study on Competitive Intelligence System based on Web.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Intelligent Information Technology Application, 2007

A Method of Ontology Mapping Based on Subtree Kernel.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Intelligent Information Technology Application, 2007

A Method of Ontology Mapping Based on Instance.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007), 2007

Quan Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...