Shunyu Liu

Orcid: 0000-0003-0584-9129

Affiliations:
  • Nanyang Technological University, Singapore
  • Zhejiang University, Hangzhou, China (PhD 2024)
  • Sun Yat-sen University, Guangzhou, China (former)


According to our database1, Shunyu Liu authored at least 64 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
VeriGUI: Verifiable Long-Chain GUI Dataset.
CoRR, August, 2025

SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding.
CoRR, June, 2025

MeRF: Motivation-enhanced Reinforcement Finetuning for Large Reasoning Models.
CoRR, June, 2025

Intra-Trajectory Consistency for Reward Modeling.
CoRR, June, 2025

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning.
CoRR, June, 2025

BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning.
CoRR, June, 2025

SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data.
CoRR, May, 2025

VORTA: Efficient Video Diffusion via Routing Sparse Attention.
CoRR, May, 2025

Bi-level Mean Field: Dynamic Grouping for Large-Scale MARL.
CoRR, May, 2025

Curricular Subgoals for Inverse Reinforcement Learning.
IEEE Trans. Intell. Transp. Syst., March, 2025

Reinforced Model Merging.
CoRR, March, 2025

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization.
CoRR, March, 2025

A Survey of Direct Preference Optimization.
CoRR, March, 2025

Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems.
CoRR, March, 2025

Dynamic Parallel Tree Search for Efficient LLM Reasoning.
CoRR, February, 2025

Reasoning with Reinforced Functional Token Tuning.
CoRR, February, 2025

Utilizing RBC system for taxation policy evaluation: An adaptive interaction framework based on deep reinforcement learning.
Expert Syst. Appl., 2025

Disentangled Condensation for Large-scale Graphs.
Proceedings of the ACM on Web Conference 2025, 2025

Powerformer: A Section-adaptive Transformer for Power Flow Adjustment.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

CADP: Towards Better Centralized Learning for Decentralized Execution in MARL.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

From GNNs to Trees: Multi-Granular Interpretability for Graph Neural Networks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Supervised Optimism Correction: Be Confident When LLMs Are Sure.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Dynamic Parallel Tree Search for Efficient LLM Reasoning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Cooperative Policy Agreement: Learning Diverse Policy for Offline MARL.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Disentangled Table-Graph Representation for Interpretable Transmission Line Fault Location.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Holistic Semantic Representation for Navigational Trajectory Generation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Spatiotemporal-Augmented Graph Neural Networks for Human Mobility Simulation.
IEEE Trans. Knowl. Data Eng., November, 2024

Progressive decision-making framework for power system topology control.
Expert Syst. Appl., January, 2024

Multi-agent Continuous Control with Generative Flow Networks.
Neural Networks, 2024

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search.
CoRR, 2024

Odyssey: Empowering Agents with Open-World Skills.
CoRR, 2024

Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-task Attribution Map.
CoRR, 2024

Advantage-Aware Policy Optimization for Offline Reinforcement Learning.
CoRR, 2024

Disentangled Condensation for Large-scale Graphs.
CoRR, 2024

COLA: Cross-city Mobility Transformer for Human Trajectory Simulation.
Proceedings of the ACM on Web Conference 2024, 2024

Simple Graph Condensation.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024

A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Unveiling Global Interactive Patterns across Graphs: Towards Interpretable Graph Neural Networks.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Multi-Channel Graph Fusion Representation for Tabular Data Imputation.
Proceedings of the International Joint Conference on Neural Networks, 2024

Improving Adversarial Robustness via Feature Pattern Consistency Constraint.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Unified Mask Graph Modeling for Incomplete Tabular Learning.
Proceedings of the Neural Information Processing - 31st International Conference, 2024

Learning a Mini-Batch Graph Transformer via Two-Stage Interaction Augmentation.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

2023
Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework.
IEEE Trans. Syst. Man Cybern. Syst., December, 2023

Distribution Knowledge Embedding for Graph Pooling.
IEEE Trans. Knowl. Data Eng., August, 2023

Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning.
CoRR, 2023

Adversarial Erasing with Pruned Elements: Towards Better Graph Lottery Ticket.
CoRR, 2023

Curricular Subgoals for Inverse Reinforcement Learning.
CoRR, 2023

Improving Expressivity of GNNs with Subgraph-specific Factor Embedded Normalization.
CoRR, 2023

Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
CoRR, 2023

Lookaround Optimizer: k steps around, 1 step average.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Expressivity of GNNs with Subgraph-specific Factor Embedded Normalization.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Attribution Guided Layerwise Knowledge Amalgamation from Graph Neural Networks.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

Message-passing Selection: Towards Interpretable GNNs for Graph Classification.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Adversarial Erasing with Pruned Elements: Towards Better Graph Lottery Tickets.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges.
CoRR, 2022

Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework.
CoRR, 2022

Distribution-Aware Graph Representation Learning for Transient Stability Assessment of Power System.
Proceedings of the International Joint Conference on Neural Networks, 2022

2021
Distribution Knowledge Embedding for Graph Pooling.
CoRR, 2021

Imbalanced Sample Generation and Evaluation for Power System Transient Stability Using CTGAN.
ICO, 2021

2018
The Recognition of Driving Action Based on EEG Signals Using Wavelet-CSP Algorithm.
Proceedings of the 23rd IEEE International Conference on Digital Signal Processing, 2018


  Loading...