Lei Song

Orcid: 0000-0003-2552-0443

Affiliations:

Microsoft Research Asia, China (since 2020)
JD.com, China (former)
Chinese Academy of Sciences, Institute of Software, State Key Laboratory of Computer Science, Beijing, China (former)
University of Technology Sydney, Quantum Computation and Intelligent Systems, Australia (former)
Saarland University, Department of Computer Science, Saarbrücken, Germany (2012 - 2014)
Max Planck Institute for Informatics, Saarbrücken, Germany (2012 - 2014)
IT University of Copenhagen, Denmark (2009 - 2012)
Shanghai Jiao Tong University, China (2009 - 2012)

According to our database¹, Lei Song authored at least 53 papers between 2010 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Rethinking Reward Models for Multi-Domain Test-Time Scaling.

[BibT_eX]

[DOI]

CoRR, October, 2025

PixelCraft: A Multi-Agent System for High-Fidelity Visual Reasoning on Structured Images.

[BibT_eX]

[DOI]

CoRR, September, 2025

In-Context Compositional Q-Learning for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

The Illusion of Readiness: Stress Testing Large Frontier Models on Multimodal Medical Benchmarks.

[BibT_eX]

[DOI]

CoRR, September, 2025

Sample-efficient LLM Optimization with Reset Replay.

[BibT_eX]

[DOI]

CoRR, August, 2025

Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data.

[BibT_eX]

[DOI]

CoRR, March, 2025

PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation.

[BibT_eX]

[DOI]

CoRR, January, 2025

Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Graph Neural Network Enhanced Retrieval for Question Answering of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Unveiling Markov heads in Pretrained Language Models for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

From Complex to Atomic: Enhancing Augmented Generation via Knowledge-Aware Dual Rewriting and Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Efficient Discovery of Pareto Front for Multi-Objective Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Mildly Constrained Evaluation Policy for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front.

[BibT_eX]

[DOI]

CoRR, 2024

Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention.

[BibT_eX]

[DOI]

CoRR, 2024

Graph Neural Network Enhanced Retrieval for Question Answering of LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Protecting Your LLMs with Information Bottleneck.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Diversification of Adaptive Policy for Effective Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

TimeX++: Learning Time-Series Explanations with Information Bottleneck.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Whittle Index with Multiple Actions and State Constraint for Inventory Management.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Games, 2024

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Model Checking for Probabilistic Multiagent Systems.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., September, 2023

Pre-Trained Large Language Models for Industrial Control.

[BibT_eX]

[DOI]

CoRR, 2023

A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management.

[BibT_eX]

[DOI]

CoRR, 2023

H-TSP: Hierarchically Solving the Large-Scale Travelling Salesman Problem.

[BibT_eX]

[DOI]

CoRR, 2023

Robust Situational Reinforcement Learning in Face of Context Disturbances.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

H-TSP: Hierarchically Solving the Large-Scale Traveling Salesman Problem.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Pointerformer: Deep Reinforced Multi-Pointer Transformer for the Traveling Salesman Problem.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management.

[BibT_eX]

[DOI]

CoRR, 2022

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Data Mining, 2022

2018

Probabilistic bisimulation for realistic schedulers.

[BibT_eX]

[DOI]

Acta Informatica, 2018

Model Checking Probabilistic Epistemic Logic for Probabilistic Multiagent Systems.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2016

Reward-Bounded Reachability Probability for Uncertain Weighted MDPs.

[BibT_eX]

[DOI]

Vahid Hashemi

Holger Hermanns

Lei Song

Proceedings of the Verification, Model Checking, and Abstract Interpretation, 2016

Verify LTL with Fairness Assumptions Efficiently.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Symposium on Temporal Representation and Reasoning, 2016

Compositional Bisimulation Minimization for Interval Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the Language and Automata Theory and Applications, 2016

2015

Distribution-based Bisimulation and Bisimulation Metric in Probabilistic Automata.

[BibT_eX]

[DOI]

Yuan Feng

Lei Song

Lijun Zhang

CoRR, 2015

Planning for Stochastic Games with Co-Safe Objectives.

[BibT_eX]

[DOI]

Lei Song

Yuan Feng

Lijun Zhang

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

A Simple Probabilistic Extension of Modal Mu-calculus.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Probabilistic Bisimulation for Realistic Schedulers.

[BibT_eX]

[DOI]

Proceedings of the FM 2015: Formal Methods, 2015

Decentralized Bisimulation for Multiagent Systems.

[BibT_eX]

[DOI]

Lei Song

Yuan Feng

Lijun Zhang

Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

2014

Bisimulations and Logical Characterizations on Continuous-Time Markov Decision Processes.

[BibT_eX]

[DOI]

Lei Song

Lijun Zhang

Jens Chr. Godskesen

Proceedings of the Verification, Model Checking, and Abstract Interpretation, 2014

Probably safe or live.

[BibT_eX]

[DOI]

Joost-Pieter Katoen

Lei Song

Lijun Zhang

Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), 2014

2013

Revisiting Weak Simulation for Substochastic Markov Chains.

[BibT_eX]

[DOI]

David N. Jansen

Lei Song

Lijun Zhang

Proceedings of the Quantitative Evaluation of Systems - 10th International Conference, 2013

Incremental Bisimulation Abstraction Refinement.

[BibT_eX]

[DOI]

Lei Song