Lei Song

Orcid: 0000-0003-2552-0443

Affiliations:
  • Microsoft Research Asia, China (since 2020)
  • JD.com, China (former)
  • Chinese Academy of Sciences, Institute of Software, State Key Laboratory of Computer Science, Beijing, China (former)
  • University of Technology Sydney, Quantum Computation and Intelligent Systems, Australia (former)
  • Saarland University, Department of Computer Science, Saarbrücken, Germany (2012 - 2014)
  • Max Planck Institute for Informatics, Saarbrücken, Germany (2012 - 2014)
  • IT University of Copenhagen, Denmark (2009 - 2012)
  • Shanghai Jiao Tong University, China (2009 - 2012)


According to our database1, Lei Song authored at least 47 papers between 2010 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Sample-efficient LLM Optimization with Reset Replay.
CoRR, August, 2025

Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data.
CoRR, March, 2025

PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation.
CoRR, January, 2025

Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning.
Trans. Mach. Learn. Res., 2025

Graph Neural Network Enhanced Retrieval for Question Answering of Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Efficient Discovery of Pareto Front for Multi-Objective Reinforcement Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Mildly Constrained Evaluation Policy for Offline Reinforcement Learning.
Trans. Mach. Learn. Res., 2024

C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front.
CoRR, 2024

Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention.
CoRR, 2024

Graph Neural Network Enhanced Retrieval for Question Answering of LLMs.
CoRR, 2024

Protecting Your LLMs with Information Bottleneck.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Diversification of Adaptive Policy for Effective Offline Reinforcement Learning.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

TimeX++: Learning Time-Series Explanations with Information Bottleneck.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Whittle Index with Multiple Actions and State Constraint for Inventory Management.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning.
Proceedings of the IEEE Conference on Games, 2024

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Model Checking for Probabilistic Multiagent Systems.
J. Comput. Sci. Technol., September, 2023

Pre-Trained Large Language Models for Industrial Control.
CoRR, 2023

A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management.
CoRR, 2023

H-TSP: Hierarchically Solving the Large-Scale Travelling Salesman Problem.
CoRR, 2023

Robust Situational Reinforcement Learning in Face of Context Disturbances.
Proceedings of the International Conference on Machine Learning, 2023

H-TSP: Hierarchically Solving the Large-Scale Traveling Salesman Problem.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Pointerformer: Deep Reinforced Multi-Pointer Transformer for the Traveling Salesman Problem.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management.
CoRR, 2022

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets.
Proceedings of the IEEE International Conference on Data Mining, 2022

2018
Probabilistic bisimulation for realistic schedulers.
Acta Informatica, 2018

Model Checking Probabilistic Epistemic Logic for Probabilistic Multiagent Systems.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2016
Reward-Bounded Reachability Probability for Uncertain Weighted MDPs.
Proceedings of the Verification, Model Checking, and Abstract Interpretation, 2016

Verify LTL with Fairness Assumptions Efficiently.
Proceedings of the 23rd International Symposium on Temporal Representation and Reasoning, 2016

Compositional Bisimulation Minimization for Interval Markov Decision Processes.
Proceedings of the Language and Automata Theory and Applications, 2016

2015
Distribution-based Bisimulation and Bisimulation Metric in Probabilistic Automata.
CoRR, 2015

Planning for Stochastic Games with Co-Safe Objectives.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

A Simple Probabilistic Extension of Modal Mu-calculus.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Probabilistic Bisimulation for Realistic Schedulers.
Proceedings of the FM 2015: Formal Methods, 2015

Decentralized Bisimulation for Multiagent Systems.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

2014
Incremental Bisimulation Abstraction Refinement.
ACM Trans. Embed. Comput. Syst., 2014

Bisimulations and Logical Characterizations on Continuous-Time Markov Decision Processes.
Proceedings of the Verification, Model Checking, and Abstract Interpretation, 2014

Probably safe or live.
Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS), 2014

2013
Bisimulations Meet PCTL Equivalences for Probabilistic Automata
Log. Methods Comput. Sci., 2013

Revisiting Weak Simulation for Substochastic Markov Chains.
Proceedings of the Quantitative Evaluation of Systems - 10th International Conference, 2013

2012
The Branching Time Spectrum for Continuous-time MDPs
CoRR, 2012

Late Weak Bisimulation for Markov Automata
CoRR, 2012

Broadcast Abstraction in a Stochastic Calculus for Mobile Networks.
Proceedings of the Theoretical Computer Science, 2012

2011
A Stochastic Broadcast Pi-Calculus
Proceedings of the Proceedings Ninth Workshop on Quantitative Aspects of Programming Languages, 2011

2010
Probabilistic Mobility Models for Mobile and Wireless Networks.
Proceedings of the Theoretical Computer Science, 2010


  Loading...