Quan Liu

Orcid: 0000-0002-8710-1810

Affiliations:
  • Soochow University, School of Computer Science and Technology, Provincial Key Laboratory for Computer Information Processing Technology, Suzhou, China


According to our database1, Quan Liu authored at least 82 papers between 2007 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
LinFa-Q: Accurate Q-learning with linear function approximation.
Neurocomputing, 2025

A Lyapunov-Based Convex Optimal Control Approach via Input Convex Transformer.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

Offline-to-Online: Case-Based Knowledge Distillation with Large Language Models for Reinforcement Learning.
Proceedings of the Case-Based Reasoning Research and Development, 2025

Diverse Collaboration in Multi-Agent Reinforcement Learning via Self-Adaptive Method.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Personalized federated reinforcement learning: Balancing personalization and experience sharing via distance constraint.
Expert Syst. Appl., March, 2024

RLUC: Strengthening robustness by attaching constraint considerations to policy network.
Expert Syst. Appl., March, 2024

Taking complementary advantages: Improving exploration via double self-imitation learning in procedurally-generated environments.
Expert Syst. Appl., March, 2024

Hierarchical reinforcement learning with unlimited option scheduling for sparse rewards in continuous spaces.
Expert Syst. Appl., March, 2024

Deep Luenberger observer-based consistency tracking for nonlinear heterogeneous multi-agent systems with uncertain drift dynamics.
Knowl. Based Syst., 2024

Balanced Subgoals Generation in Hierarchical Reinforcement Learning.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

Multi-Agent Self-Motivated Learning via Role Representation.
Proceedings of the International Joint Conference on Neural Networks, 2024

Offline Reinforcement Learning Based on Next State Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2024

Offline Reinforcement Learning with Generative Adversarial Networks and Uncertainty Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Offline Reinforcement Learning with Policy Guidance and Uncertainty Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Deep Deterministic Strategy Gradient Method Using Plot Experience Playback.
Proceedings of the 24th IEEE/ACIS International Conference on Computer and Information Science, 2024

2023
Generalized gradient emphasis learning for off-policy evaluation and control with function approximation.
Neural Comput. Appl., November, 2023

Hierarchical reinforcement learning with adaptive scheduling for robot control.
Eng. Appl. Artif. Intell., November, 2023

Addressing implicit bias in adversarial imitation learning with mutual information.
Neural Networks, October, 2023

Temporal-difference emphasis learning with regularized correction for off-policy evaluation and control.
Appl. Intell., September, 2023

A stable actor-critic algorithm for solving robotic tasks with multiple constraints.
Frontiers Comput. Sci., August, 2023

Learning fair representations for accuracy parity.
Eng. Appl. Artif. Intell., March, 2023

Cosine Similarity Based Representation Learning for Adversarial Imitation Learning.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

Self-adaptive Inverse Soft-Q Learning for Imitation.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

Efficient Collaboration via Interaction Information in Multi-agent System.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

A Perturbation-Based Policy Distillation Framework with Generative Adversarial Nets.
Proceedings of the IEEE International Conference on Acoustics, 2023

Learning Unbiased Rewards with Mutual Information in Adversarial Imitation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Explore the weakness: Instructive exploration adversarial robust reinforcement learning.
J. King Saud Univ. Comput. Inf. Sci., 2022

Best-in-class imitation: Non-negative positive-unlabeled imitation learning from imperfect demonstrations.
Inf. Sci., 2022

Learning fair representations by separating the relevance of potential information.
Inf. Process. Manag., 2022

Improving deep reinforcement learning by safety guarding model via hazardous experience planning.
Frontiers Comput. Sci., 2022

Vital node searcher: find out critical node measure with deep reinforcement learning.
Connect. Sci., 2022

Master-Slave Policy Collaboration for Actor-Critic Methods.
Proceedings of the International Joint Conference on Neural Networks, 2022

2021
Self-guided deep deterministic policy gradient with multi-actor.
Neural Comput. Appl., 2021

ARAIL: Learning to rank from incomplete demonstrations.
Inf. Sci., 2021

Gradient temporal-difference learning for off-policy evaluation using emphatic weightings.
Inf. Sci., 2021

Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification.
IEEE CAA J. Autom. Sinica, 2021

Improving exploration efficiency of deep reinforcement learning through samples produced by generative model.
Expert Syst. Appl., 2021

2020
A resource retrieval method of multimedia recommendation system based on deep learning.
Int. J. Auton. Adapt. Commun. Syst., 2020

Multi-agent cooperation Q-learning algorithm based on constrained Markov Game.
Comput. Sci. Inf. Syst., 2020

2019
基于视觉注意力机制的异步优势行动者-评论家算法 (Asynchronous Advantage Actor-Critic Algorithm with Visual Attention Mechanism).
计算机科学, 2019

一种基于生成对抗网络的强化学习算法 (Reinforcement Learning Algorithm Based on Generative Adversarial Networks).
计算机科学, 2019

Efficient reinforcement learning in continuous state and action spaces with Dyna and policy approximation.
Frontiers Comput. Sci., 2019

SARSA based access control with approximation by TileCoding.
Comput. Sci. Inf. Syst., 2019

Residual Sarsa algorithm with function approximation.
Clust. Comput., 2019

Safe Q-Learning Method Based on Constrained Markov Decision Processes.
IEEE Access, 2019

2018
Single Trajectory Learning: Exploration Versus Exploitation.
Int. J. Pattern Recognit. Artif. Intell., 2018

Policy Space Noise in Deep Deterministic Policy Gradient.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Deep Deterministic Policy Gradient with Clustered Prioritized Sampling.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Accurate Q-Learning.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

2017
Efficient actor-critic algorithm with dual piecewise model learning.
Proceedings of the 2017 IEEE Symposium Series on Computational Intelligence, 2017

2016
Reasoning and predicting POMDP planning complexity via covering numbers.
Frontiers Comput. Sci., 2016

Learn to human-level control in dynamic environment using incremental batch interrupting temporal abstraction.
Comput. Sci. Inf. Syst., 2016

Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.
Comput. Intell. Neurosci., 2016

Policy graph pruning and optimization in Monte Carlo Value Iteration for continuous-state POMDPs.
Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence, 2016

A Kernel-Based Sarsa( \lambda ) Algorithm with Clustering-Based Sample Sparsification.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Deep Q-Learning with Prioritized Sampling.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Sparse Kernel-Based Least Squares Temporal Difference with Prioritized Sweeping.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Covering Number: Analyses for Approximate Continuous-state POMDP Planning (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

2015
Learning topic of dynamic scene using belief propagation and weighted visual words approach.
Soft Comput., 2015

Human-level moving object recognition from traffic video.
Comput. Sci. Inf. Syst., 2015

Spatiotemporal Saliency Detection Using Slow Feature Analysis and Spatial Information for Dynamic Scenes.
Proceedings of the Intelligence Science and Big Data Engineering. Image and Video Data Engineering, 2015

Intelligent Model Learning Based on Variance for Bayesian Reinforcement Learning.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

A Bayesian Sarsa Learning Algorithm with Bandit-Based Method.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Trajectory Sampling Value Iteration: Improved Dyna Search for MDPs.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

2014
Protein-protein interaction network constructing based on text mining and reinforcement learning with application to prostate cancer.
Proceedings of the 2014 IEEE International Conference on Bioinformatics and Biomedicine, 2014

2013
A Gradient Descent Sarsa(λ) Algorithm Based on the Adaptive Reward-shaping Mechanism.
Intell. Autom. Soft Comput., 2013

The second order temporal difference error for Sarsa(λ).
Proceedings of the 2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2013

2012
A parallel scheduling algorithm for reinforcement learning in large state space.
Frontiers Comput. Sci., 2012

2011
Relevance feedback techniques and genetic algorithm for image retrieval based on multiple features.
Int. J. Model. Identif. Control., 2011

Double elite co-evolutionary genetic algorithm.
Int. J. Comput. Sci. Eng., 2011

2010
A Data Aggregation Algorithm Based on Splay Tree for Wireless Sensor Networks.
J. Comput., 2010

2009
An Aggregation Tree Approach for Event Detection in Wireless Sensor Networks.
J. Softw., 2009

An Efficient Strategy for Enhancing Robustness and Immunization in Wireless Sensor Networks.
J. Networks, 2009

A Method to Automatically Discover and Classify Deep Web Data Source Using Multi-Classifier.
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

Correlated-Clustering Frame: A Holistic Method of Deep Web Schema Matching Based on Data Mining.
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

A Reinforcement Learning Algorithm Based on Minimum State Method and Average Reward.
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

2007
An Tableau Automated Theorem Proving Method Using Logical Reinforcement Learning.
Proceedings of the Advances in Computation and Intelligence, 2007

A Deep Web Data Integration System For Book Searching Domain.
Proceedings of the Workshop on Intelligent Information Technology Application, 2007

The Design and Implementation of a Topic-Driven Crawler.
Proceedings of the Workshop on Intelligent Information Technology Application, 2007

Study on Competitive Intelligence System based on Web.
Proceedings of the Workshop on Intelligent Information Technology Application, 2007

A Method of Ontology Mapping Based on Subtree Kernel.
Proceedings of the Workshop on Intelligent Information Technology Application, 2007

A Method of Ontology Mapping Based on Instance.
Proceedings of the 3rd International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007), 2007


  Loading...