Siyuan Li

Orcid: 0000-0001-7965-598X

Affiliations:
  • Harbin Institute of Technology, Faculty of Computing, China
  • Tsinghua University, Institute for Interdisciplinary Information Sciences, Beijing, China (former)


According to our database1, Siyuan Li authored at least 28 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Collab-Solver: Collaborative Solving Policy Learning for Mixed-Integer Linear Programming.
CoRR, August, 2025

DHEvo: Data-Algorithm Based Heuristic Evolution for Generalizable MILP Solving.
CoRR, July, 2025

Toward Automatic Market Making: An Imitative Reinforcement Learning Approach With Predictive Representation Learning.
IEEE Trans. Emerg. Top. Comput. Intell., June, 2025

Skywork Open Reasoner 1 Technical Report.
CoRR, May, 2025

Auxiliary Reward Generation With Transition Distance Representation Learning.
IEEE Trans Autom. Sci. Eng., 2025

SkillTree: Explainable Skill-Based Deep Reinforcement Learning for Long-Horizon Control Tasks.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Safe Planner: Empowering Safety Awareness in Large Pre-Trained Models for Robot Task Planning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
A multi-agent learning framework for mixed-integer linear programming.
INFOR Inf. Syst. Oper. Res., November, 2024

IOB: integrating optimization transfer and behavior transfer for multi-policy reuse.
Auton. Agents Multi Agent Syst., June, 2024

An Imitative Reinforcement Learning Framework for Autonomous Dogfight.
CoRR, 2024

Auxiliary Reward Generation with Transition Distance Representation Learning.
CoRR, 2024

IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

MacMic: Executing Iceberg Orders via Hierarchical Reinforcement Learning.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Robust Visual Imitation Learning with Inverse Dynamics Representations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Classifying ambiguous identities in hidden-role Stochastic games with multi-agent reinforcement learning.
Auton. Agents Multi Agent Syst., October, 2023

Learning to Solve Tasks with Exploring Prior Behaviours.
IROS, 2023

Behavior Contrastive Learning for Unsupervised Skill Discovery.
Proceedings of the International Conference on Machine Learning, 2023

Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
IDRL: Identifying Identities in Multi-Agent Reinforcement Learning with Ambiguous Identities.
CoRR, 2022

CUP: Critic-Guided Policy Reuse.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Active Hierarchical Exploration with Stable Subgoal Representation Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Efficient Hierarchical Exploration with Stable Subgoal Representation Learning.
CoRR, 2021

Offline Reinforcement Learning with Reverse Model-based Imagination.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Subgoal Representations with Slow Dynamics.
Proceedings of the 9th International Conference on Learning Representations, 2021

2019
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Context-Aware Policy Reuse.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
An Optimal Online Method of Selecting Source Policies for Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018


  Loading...