Qichao Zhang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
Learning Rollout from Sampling: An R1-Style Tokenized Traffic Simulation Model.
IEEE Robotics Autom. Lett., May, 2026

PerlAD: Towards Enhanced Closed-Loop End-to-End Autonomous Driving With Pseudo-Simulation-Based Reinforcement Learning.
IEEE Robotics Autom. Lett., May, 2026

Are Full Rollouts Necessary for On-Policy Distillation?
CoRR, May, 2026

Beyond Imitation: Learning Safe End-to-End Autonomous Driving from Hard Negatives.
CoRR, May, 2026

PokeVLA: Empowering Pocket-Sized Vision-Language-Action Model with Comprehensive World Knowledge Guidance.
CoRR, April, 2026

AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning.
CoRR, April, 2026

π-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data.
CoRR, April, 2026

Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning.
CoRR, April, 2026

Dynamic Dual-Granularity Skill Bank for Agentic RL.
CoRR, March, 2026

Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model.
CoRR, March, 2026

DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving.
CoRR, March, 2026

Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving.
CoRR, March, 2026

Learning from Mistakes: Post-Training for Driving VLA with Takeover Data.
CoRR, March, 2026

Mimir: Hierarchical Goal-Driven Diffusion With Uncertainty Propagation for End-to-End Autonomous Driving.
IEEE Robotics Autom. Lett., February, 2026

TakeAD: Preference-Based Post-Optimization for End-to-End Autonomous Driving With Expert Takeover Data.
IEEE Robotics Autom. Lett., February, 2026

MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Autonomous Driving.
CoRR, February, 2026

PaperAudit-Bench: Benchmarking Error Detection in Research Papers for Critical Automated Peer Review.
CoRR, January, 2026

LTA-Gait: In-Network Temporal Activation of Large Vision Models for Gait Recognition.
Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026

Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Spec-o3: A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving.
CoRR, December, 2025

TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning.
CoRR, December, 2025

IP3: Integrated Path-Guided Prediction and Planning for Safe Autonomous Driving.
IEEE Trans. Veh. Technol., November, 2025

CriticSearch: Fine-Grained Credit Assignment for Search Agents via a Retrospective Critic.
CoRR, November, 2025

Perception-Consistency Multimodal Large Language Models Reasoning via Caption-Regularized Policy Optimization.
CoRR, September, 2025

Perceptual Uncertainty-Aware Motion Planning for Autonomous Driving Based on Adaptive Heuristic Reinforcement Learning.
IEEE Trans. Intell. Transp. Syst., August, 2025

World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model.
CoRR, July, 2025

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning.
CoRR, June, 2025

ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving.
CoRR, May, 2025

Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL.
CoRR, May, 2025

Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation.
CoRR, March, 2025

Research on the impact of generative artificial intelligence (GenAI) on enterprise innovation performance: a knowledge management perspective.
J. Knowl. Manag., 2025

FusionNav: Enhancing Zero-Shot Object-Goal Navigation via 3D Semantic Fusion and Farsight Value Reasoning.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2025

Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Salience-Invariant Consistent Policy Learning for Generalization in Visual Reinforcement Learning.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Consistency Policy with Categorical Critic for Autonomous Driving.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Unsupervised Zero-Shot Reinforcement Learning via Dual-Value Forward-Backward Representation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

DeltaDiff: Reality-Driven Diffusion with Anchor Residuals for Faithful SR.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

World4Drive: End-to-End Autonomous Driving via Intention-Aware Physical Latent World Model.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

RLAE: Reinforcement Learning-Assisted Ensemble for LLMs.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Conditional Goal-Oriented Trajectory Prediction for Interacting Vehicles.
IEEE Trans. Neural Networks Learn. Syst., December, 2024

Dream to Drive With Predictive Individual World Model.
IEEE Trans. Intell. Veh., December, 2024

Deep-Reinforcement-Learning-Based Driving Policy at Intersections Utilizing Lane Graph Networks.
IEEE Trans. Cogn. Dev. Syst., October, 2024

Prototypical Context-Aware Dynamics for Generalization in Visual Control With Model-Based Reinforcement Learning.
IEEE Trans. Ind. Informatics, September, 2024

Dynamic-Horizon Model-Based Value Estimation With Latent Imagination.
IEEE Trans. Neural Networks Learn. Syst., July, 2024

Planning-Inspired Hierarchical Trajectory Prediction via Lateral-Longitudinal Decomposition for Autonomous Driving.
IEEE Trans. Intell. Veh., January, 2024

Analysis of Multi-GNSS Multipath for Parameter-Unified Autocorrelation-Based Mitigation and the Impact of Constellation Shifts.
Remote. Sens., 2024

Preliminary Investigation into Data Scaling Laws for Imitation Learning-Based End-to-End Autonomous Driving.
CoRR, 2024

PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning.
CoRR, 2024

User Response Modeling in Reinforcement Learning for Ads Allocation.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

High-quality Synthetic Data is Efficient for Model-based Offline Reinforcement Learning.
Proceedings of the International Joint Conference on Neural Networks, 2024

MonoOcc: Digging into Monocular Semantic Occupancy Prediction.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

2023
An Improved Carrier-Smoothing Code Algorithm for BDS Satellites with SICB.
Remote. Sens., November, 2023

Multi-task safe reinforcement learning for navigating intersections in dense traffic.
J. Frankl. Inst., November, 2023

Planning-inspired Hierarchical Trajectory Prediction for Autonomous Driving.
CoRR, 2023

STEPS: Joint Self-supervised Nighttime Image Enhancement and Depth Estimation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

PnP: Integrated Prediction and Planning for Interactive Lane Change in Dense Traffic.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

2022
TrajGen: Generating Realistic and Diverse Trajectories With Reactive and Feasible Agent Behaviors for Autonomous Driving.
IEEE Trans. Intell. Transp. Syst., 2022

BiFNet: Bidirectional Fusion Network for Road Segmentation.
IEEE Trans. Cybern., 2022

Highway Lane Change Decision-Making via Attention-Based Deep Reinforcement Learning.
IEEE CAA J. Autom. Sinica, 2022

Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning.
CoRR, 2022

Conditional Goal-oriented Trajectory Prediction for Interacting Vehicles with Vectorized Representation.
CoRR, 2022

Multi-task Safe Reinforcement Learning for Navigating Intersections in Dense Traffic.
CoRR, 2022

Guest Editorial on "Computational intelligence in analysis and integration of complex systems".
Complex Intell. Syst., 2022

Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space.
Artif. Intell. Rev., 2022

Offline Reinforcement Learning for Autonomous Driving with Real World Driving Data.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2022

GAP: Goal-Aware Prediction with Hierarchical Interactive Representation for Vehicle Trajectory.
Proceedings of the Data Mining and Big Data - 7th International Conference, 2022

2021
MORSE-STF: A Privacy Preserving Computation System.
CoRR, 2021

A Reinforcement Learning Benchmark for Autonomous Driving in Intersection Scenarios.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2021

IA-CNN: A generalised interpretable convolutional neural network with attention mechanism.
Proceedings of the International Joint Conference on Neural Networks, 2021

Hierarchical Reinforcement Learning-Based Policy Switching Towards Multi-Scenarios Autonomous Driving.
Proceedings of the International Joint Conference on Neural Networks, 2021

Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning.
Proceedings of the ICRAI 2021: 7th International Conference on Robotics and Artificial Intelligence, Guangzhou, China, November 19, 2021

Vehicle Trajectory Prediction Based on Graph Attention Network.
Proceedings of the Cognitive Systems and Information Processing, 2021

2020
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment.
IEEE Trans. Neural Networks Learn. Syst., 2020

Hierarchical optimal control for input-affine nonlinear systems through the formulation of Stackelberg game.
Inf. Sci., 2020

Revocable ciphertext-policy attribute-based encryption in data outsourcing systems from lattices.
Int. J. Embed. Syst., 2020

Blockchain-Based Cache Poisoning Security Protection and Privacy-Aware Access Control in NDN Vehicular Edge Computing Networks.
J. Grid Comput., 2020

Dynamic Horizon Value Estimation for Model-based Reinforcement Learning.
CoRR, 2020

Traceable and Weighted Attribute-Based Encryption Scheme in the Cloud Environment.
IEEE Access, 2020

RailNet: An Information Aggregation Network for Rail Track Segmentation.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

2019
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics.
IEEE Trans. Cybern., 2019

Graph Attention Memory for Visual Navigation.
CoRR, 2019

Securing ICN-Based UAV Ad Hoc Networks with Blockchain.
IEEE Commun. Mag., 2019

Reinforcement Learning and Deep Learning Based Lateral Control for Autonomous Driving [Application Notes].
IEEE Comput. Intell. Mag., 2019

Mixing Position Estimation of AIS Mixed Signal Based on Double-Sliding Window Method.
IEEE Access, 2019

Reinforcement Learning based Lane Change Decision-Making with Imaginary Sampling.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

Model-Free Reinforcement Learning based Lateral Control for Lane Keeping.
Proceedings of the International Joint Conference on Neural Networks, 2019

Lane Change Decision-making through Deep Reinforcement Learning with Rule-based Constraints.
Proceedings of the International Joint Conference on Neural Networks, 2019

Cluster Synchronization of Nonlinearly Coupled Directed Network via Pinning Control Strategy.
Proceedings of the 15th IEEE International Conference on Control and Automation, 2019

2018
Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming.
IEEE Trans. Neural Networks Learn. Syst., 2018

Policy Iteration for H<sub>∞</sub> Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming.
IEEE Trans. Cybern., 2018

Multi-task learning for dangerous object detection in autonomous driving.
Inf. Sci., 2018

Reinforcement Learning and Deep Learning based Lateral Control for Autonomous Driving.
CoRR, 2018

An Autonomous Driving Experience Platform with Learning-Based Functions.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2018

Model-Free Reinforcement Learning for Fully Cooperative Multi-Agent Graphical Games.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Visual Navigation with Actor-Critic Deep Reinforcement Learning.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

DeepSign: Deep Learning based Traffic Sign Recognition.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Reputation-Based Byzantine Fault-Tolerance for Consortium Blockchain.
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018

Value Iteration Algorithm for Optimal Consensus Control of Multi-agent Systems.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

An Efficient Network for Lane Segmentation.
Proceedings of the Cognitive Systems and Signal Processing - 4th International Conference, 2018

2017
Event-Triggered H<sub>∞</sub> Control for Continuous-Time Nonlinear System via Concurrent Learning.
IEEE Trans. Syst. Man Cybern. Syst., 2017

Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs.
Neurocomputing, 2017

Model-Free Optimal Control Based Intelligent Cruise Control with Hardware-in-the-Loop Demonstration [Research Frontier].
IEEE Comput. Intell. Mag., 2017

Event-triggered integral reinforcement learning for nonlinear continuous-time systems.
Proceedings of the 2017 IEEE Symposium Series on Computational Intelligence, 2017

Policy gradient methods with Gaussian process modelling acceleration.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

2016
Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics.
IEEE Trans. Syst. Man Cybern. Syst., 2016

Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics.
IEEE Trans. Cybern., 2016

Event-based input-constrained nonlinear H∞ state feedback with adaptive critic and neural implementation.
Neurocomputing, 2016

Convolutional fitted Q iteration for vision-based control problems.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Event-Triggered Adaptive Dynamic Programming for Uncertain Nonlinear Systems.
Proceedings of the Cognitive Systems and Signal Processing, 2016

2015
Event-Triggered H ∞ Control for Continuous-Time Nonlinear System.
Proceedings of the Advances in Neural Networks - ISNN 2015, 2015


  Loading...