Lu Wang
Orcid: 0000-0002-7305-1496Affiliations:
- Microsoft, Beijing, China
- East China Normal University, School of Computer Science and Technology, Shanghai, China
According to our database1,
Lu Wang
authored at least 68 papers
between 2014 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
WarriorMath: Enhancing the Mathematical Ability of Large Language Models with a Defect-aware Framework.
CoRR, August, 2025
CoRR, June, 2025
ACM Trans. Inf. Syst., May, 2025
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025
RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning.
CoRR, May, 2025
Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
CoRR, February, 2025
VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model.
CoRR, February, 2025
CoRR, February, 2025
Trans. Mach. Learn. Res., 2025
Te-PID: An Adaptive Erasure Coding Temperature Management System for Optimized Cloud Storage.
Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents.
CoRR, 2024
Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation.
CoRR, 2024
Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning.
CoRR, 2024
COIN: Chance-Constrained Imitation Learning for Uncertainty-aware Adaptive Resource Oversubscription Policy.
CoRR, 2024
Risk-aware Adaptive Virtual CPU Oversubscription in Microsoft Cloud via Prototypical Human-in-the-loop Imitation Learning.
CoRR, 2024
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024
SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-view Total Correlation.
Proceedings of the Uncertainty in Artificial Intelligence, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024
COIN: Chance-Constrained Imitation Learning for Safe and Adaptive Resource Oversubscription under Uncertainty.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
CoRR, 2023
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering.
CoRR, 2023
Learning Cooperative Oversubscription for Cloud by Chance-Constrained Multi-Agent Reinforcement Learning.
Proceedings of the ACM Web Conference 2023, 2023
Multi-Agent Reinforcement Learning with Shared Policy for Cloud Quota Management Problem.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023
Diffusion-Based Time Series Data Imputation for Cloud Failure Prediction at Microsoft 365.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023
TraceDiag: Adaptive, Interpretable, and Efficient Root Cause Analysis on Large-Scale Microservice Systems.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Contextual Self-attentive Temporal Point Process for Physical Decommissioning Prediction of Cloud Assets.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Root Cause Analysis for Microservice Systems via Hierarchical Reinforcement Learning from Human Feedback.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
2022
Hierarchical Multiagent Reinforcement Learning for Allocating Guaranteed Display Ads.
IEEE Trans. Neural Networks Learn. Syst., 2022
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022
An empirical investigation of missing data handling in cloud node failure prediction.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022
NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
T-SMOTE: Temporal-oriented Synthetic Minority Oversampling Technique for Imbalanced Time Series Classification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
2021
2020
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
2019
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019
Continuous Patient-Centric Sequence Generation via Sequentially Coupled Adversarial Learning.
Proceedings of the Database Systems for Advanced Applications, 2019
2018
Supervised Reinforcement Learning with Recurrent Neural Network for Dynamic Treatment Recommendation.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018
Proceedings of the Database Systems for Advanced Applications, 2018
2014
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014