Lu Wang

Orcid: 0000-0002-7305-1496

Affiliations:
  • Microsoft, Beijing, China
  • East China Normal University, School of Computer Science and Technology, Shanghai, China


According to our database1, Lu Wang authored at least 68 papers between 2014 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
AdaptFlow: Adaptive Workflow Optimization via Meta-Learning.
CoRR, August, 2025

WarriorMath: Enhancing the Mathematical Ability of Large Language Models with a Defect-aware Framework.
CoRR, August, 2025

LettinGo: Explore User Profile Generation for Recommendation System.
CoRR, June, 2025

Rebalancing Discriminative Responses for Knowledge Tracing.
ACM Trans. Inf. Syst., May, 2025

Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

Text2Grad: Reinforcement Learning from Natural Language Feedback.
CoRR, May, 2025

RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning.
CoRR, May, 2025

UFO2: The Desktop AgentOS.
CoRR, April, 2025

Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
CoRR, February, 2025

VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model.
CoRR, February, 2025

Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance.
CoRR, February, 2025

Large Action Models: From Inception to Implementation.
Trans. Mach. Learn. Res., 2025

Te-PID: An Adaptive Erasure Coding Temperature Management System for Optimized Cloud Storage.
Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering, 2025

Self-Evolved Reward Learning for LLMS.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RuAG: Learned-rule-augmented Generation for Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

AXIS: Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Large Action Models: From Inception to Implementation.
CoRR, 2024

Token-level Proximal Policy Optimization for Query Generation.
CoRR, 2024

Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents.
CoRR, 2024

Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation.
CoRR, 2024

Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning.
CoRR, 2024

Nissist: An Incident Mitigation Copilot based on Troubleshooting Guides.
CoRR, 2024

Contrastive Learning with Negative Sampling Correction.
CoRR, 2024

COIN: Chance-Constrained Imitation Learning for Uncertainty-aware Adaptive Resource Oversubscription Policy.
CoRR, 2024

Risk-aware Adaptive Virtual CPU Oversubscription in Microsoft Cloud via Prototypical Human-in-the-loop Imitation Learning.
CoRR, 2024

Interpretable Imitation Learning with Dynamic Causal Relations.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-view Total Correlation.
Proceedings of the Uncertainty in Artificial Intelligence, 2024

SELF-GUARD: Empower the LLM to Safeguard Itself.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Nissist: An Incident Mitigation Copilot based on Troubleshooting Guides.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

COIN: Chance-Constrained Imitation Learning for Safe and Adaptive Resource Oversubscription under Uncertainty.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
TaskWeaver: A Code-First Agent Framework.
CoRR, 2023

Dynamic DAG Discovery for Interpretable Imitation Learning.
CoRR, 2023

Diffusion-based Time Series Data Imputation for Microsoft 365.
CoRR, 2023

Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models.
CoRR, 2023

Reinforcement Logic Rule Learning for Temporal Point Processes.
CoRR, 2023

Introspective Tips: Large Language Model for In-Context Decision Making.
CoRR, 2023

Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering.
CoRR, 2023

Conservative State Value Estimation for Offline Reinforcement Learning.
CoRR, 2023

Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning.
Proceedings of the ACM Web Conference 2023, 2023

Multi-Agent Reinforcement Learning with Shared Policy for Cloud Quota Management Problem.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Diffusion-Based Time Series Data Imputation for Cloud Failure Prediction at Microsoft 365.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

TraceDiag: Adaptive, Interpretable, and Efficient Root Cause Analysis on Large-Scale Microservice Systems.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

Conservative State Value Estimation for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Robust Positive-Unlabeled Learning via Noise Negative Sample Self-correction.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Contextual Self-attentive Temporal Point Process for Physical Decommissioning Prediction of Cloud Assets.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Root Cause Analysis for Microservice Systems via Hierarchical Reinforcement Learning from Human Feedback.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

FedSkill: Privacy Preserved Interpretable Skill Learning via Imitation.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

Snape: Reliable and Low-Cost Computing with Mixture of Spot and On-Demand VMs.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Hierarchical Multiagent Reinforcement Learning for Allocating Guaranteed Display Ads.
IEEE Trans. Neural Networks Learn. Syst., 2022

Spot Virtual Machine Eviction Prediction in Microsoft Cloud.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

An empirical investigation of missing data handling in cloud node failure prediction.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

T-SMOTE: Temporal-oriented Synthetic Minority Oversampling Technique for Imbalanced Time Series Classification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Explaining Point Processes by Learning Interpretable Temporal Logic Rules.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
TCL: Transformer-based Dynamic Graph Modelling via Contrastive Learning.
CoRR, 2021

2020
Adversarial Cooperative Imitation Learning for Dynamic Treatment Regimes✱.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Infinite-horizon Off-Policy Policy Evaluation with Multiple Behavior Policies.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Learning Robust Representations with Graph Denoising Policy Network.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Continuous Patient-Centric Sequence Generation via Sequentially Coupled Adversarial Learning.
Proceedings of the Database Systems for Advanced Applications, 2019

2018
Supervised Reinforcement Learning with Recurrent Neural Network for Dynamic Treatment Recommendation.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Personalized Prescription for Comorbidity.
Proceedings of the Database Systems for Advanced Applications, 2018

2014
Maximizing Multi-scale Spatial Statistical Discrepancy.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014


  Loading...