Shunyu Liu

Orcid: 0000-0003-0584-9129

Affiliations:

Nanyang Technological University, Singapore
Zhejiang University, Hangzhou, China (PhD 2024)
Sun Yat-sen University, Guangzhou, China (former)

According to our database¹, Shunyu Liu authored at least 68 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

A Survey on Agentic Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

CoVeR: Conformal Calibration for Versatile and Reliable Autoregressive Next-Token Prediction.

[BibT_eX]

[DOI]

CoRR, September, 2025

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, August, 2025

VeriGUI: Verifiable Long-Chain GUI Dataset.

[BibT_eX]

[DOI]

CoRR, August, 2025

SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding.

[BibT_eX]

[DOI]

CoRR, June, 2025

MeRF: Motivation-enhanced Reinforcement Finetuning for Large Reasoning Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

Intra-Trajectory Consistency for Reward Modeling.

[BibT_eX]

[DOI]

CoRR, June, 2025

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, June, 2025

BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, June, 2025

SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data.

[BibT_eX]

[DOI]

CoRR, May, 2025

VORTA: Efficient Video Diffusion via Routing Sparse Attention.

[BibT_eX]

[DOI]

CoRR, May, 2025

Bi-level Mean Field: Dynamic Grouping for Large-Scale MARL.

[BibT_eX]

[DOI]

CoRR, May, 2025

Curricular Subgoals for Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., March, 2025

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization.

[BibT_eX]

[DOI]

CoRR, March, 2025

A Survey of Direct Preference Optimization.

[BibT_eX]

[DOI]

CoRR, March, 2025

Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems.

[BibT_eX]

[DOI]

CoRR, March, 2025

Dynamic Parallel Tree Search for Efficient LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, February, 2025

Reasoning with Reinforced Functional Token Tuning.

[BibT_eX]

[DOI]

CoRR, February, 2025

Utilizing RBC system for taxation policy evaluation: An adaptive interaction framework based on deep reinforcement learning.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2025

Disentangled Condensation for Large-scale Graphs.

[BibT_eX]

[DOI]

Proceedings of the ACM on Web Conference 2025, 2025

Powerformer: A Section-adaptive Transformer for Power Flow Adjustment.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

Odyssey : Empowering Minecraft Agents with Open-World Skills.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

CADP: Towards Better Centralized Learning for Decentralized Execution in MARL.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

Reinforced Model Merging.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

From GNNs to Trees: Multi-Granular Interpretability for Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Supervised Optimism Correction: Be Confident When LLMs Are Sure.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Dynamic Parallel Tree Search for Efficient LLM Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Cooperative Policy Agreement: Learning Diverse Policy for Offline MARL.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Disentangled Table-Graph Representation for Interpretable Transmission Line Fault Location.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Holistic Semantic Representation for Navigational Trajectory Generation.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Spatiotemporal-Augmented Graph Neural Networks for Human Mobility Simulation.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., November, 2024

Progressive decision-making framework for power system topology control.

[BibT_eX]

[DOI]

Expert Syst. Appl., January, 2024

Multi-agent Continuous Control with Generative Flow Networks.

[BibT_eX]

[DOI]

Neural Networks, 2024

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search.

[BibT_eX]

[DOI]

CoRR, 2024

Odyssey: Empowering Agents with Open-World Skills.

[BibT_eX]

[DOI]

CoRR, 2024

Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-task Attribution Map.

[BibT_eX]

[DOI]

CoRR, 2024

Advantage-Aware Policy Optimization for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Disentangled Condensation for Large-scale Graphs.

[BibT_eX]

[DOI]

CoRR, 2024

COLA: Cross-city Mobility Transformer for Human Trajectory Simulation.

[BibT_eX]

[DOI]

Proceedings of the ACM on Web Conference 2024, 2024

Simple Graph Condensation.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024

A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Unveiling Global Interactive Patterns across Graphs: Towards Interpretable Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Multi-Channel Graph Fusion Representation for Tabular Data Imputation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

Improving Adversarial Robustness via Feature Pattern Consistency Constraint.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Unified Mask Graph Modeling for Incomplete Tabular Learning.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 31st International Conference, 2024

Learning a Mini-Batch Graph Transformer via Two-Stage Interaction Augmentation.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

2023

Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework.

[BibT_eX]

[DOI]

IEEE Trans. Syst. Man Cybern. Syst., December, 2023

Distribution Knowledge Embedding for Graph Pooling.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., August, 2023

Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Adversarial Erasing with Pruned Elements: Towards Better Graph Lottery Ticket.

[BibT_eX]

[DOI]

CoRR, 2023

Curricular Subgoals for Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Improving Expressivity of GNNs with Subgraph-specific Factor Embedded Normalization.

[BibT_eX]

[DOI]

CoRR, 2023

Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?

[BibT_eX]

[DOI]

CoRR, 2023

Lookaround Optimizer: k steps around, 1 step average.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Expressivity of GNNs with Subgraph-specific Factor Embedded Normalization.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Attribution Guided Layerwise Knowledge Amalgamation from Graph Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 30th International Conference, 2023

Message-passing Selection: Towards Interpretable GNNs for Graph Classification.

[BibT_eX]

[DOI]

Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Adversarial Erasing with Pruned Elements: Towards Better Graph Lottery Tickets.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges.

[BibT_eX]

[DOI]

CoRR, 2022

Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework.

[BibT_eX]

[DOI]

CoRR, 2022

Distribution-Aware Graph Representation Learning for Transient Stability Assessment of Power System.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2022

2021

Distribution Knowledge Embedding for Graph Pooling.

[BibT_eX]

[DOI]

CoRR, 2021

Imbalanced Sample Generation and Evaluation for Power System Transient Stability Using CTGAN.

[BibT_eX]

[DOI]

ICO, 2021

2018

The Recognition of Driving Action Based on EEG Signals Using Wavelet-CSP Algorithm.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Conference on Digital Signal Processing, 2018

Shunyu Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...