Kai Wang

Orcid: 0000-0002-2446-987X

Affiliations:

Georgia Institute of Technology, Atlanta, GA, USA
Massachusetts Institute of Technology, Cambridge, MA, USA (2023)
Harvard University, Cambridge, MA, USA (PhD 2023)
University of Southern California, Center for Artificial Intelligence in Society, Los Angeles, CA, USA (former)

According to our database¹, Kai Wang authored at least 40 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Bilevel Optimization of Synthetic Trajectories for Multi-Turn LLM Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, May, 2026

KOALA: Knowledge of Optimization and Learning Algorithms for Healthcare.

[BibT_eX]

[DOI]

Kai Wang

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Networked Restless Multi-Arm Bandits with Reinforcement Learning.

[BibT_eX]

[DOI]

Hanmo Zhang

Zenghui Sun

Kai Wang

CoRR, December, 2025

A Fully First-Order Layer for Differentiable Optimization.

[BibT_eX]

[DOI]

CoRR, December, 2025

Neural Index Policies for Restless Multi-Action Bandits with Heterogeneous Budgets.

[BibT_eX]

[DOI]

Himadri S. Pandey

Kai Wang

Gian-Gabriel P. Garcia

CoRR, October, 2025

Diffusion-DFL: Decision-focused Diffusion Models for Stochastic Optimization.

[BibT_eX]

[DOI]

CoRR, October, 2025

Finding a Multiple Follower Stackelberg Equilibrium: A Fully First-Order Method.

[BibT_eX]

[DOI]

April Niu

Kai Wang

Juba Ziani

CoRR, September, 2025

Revealing Potential Biases in LLM-Based Recommender Systems in the Cold Start Setting.

[BibT_eX]

[DOI]

CoRR, August, 2025

Non-Stationary Restless Multi-Armed Bandits with Provable Guarantee.

[BibT_eX]

[DOI]

Yu-Heng Hung

Ping-Chun Hsieh

Kai Wang

CoRR, August, 2025

One-Step Flow Policy Mirror Descent.

[BibT_eX]

[DOI]

CoRR, July, 2025

Soft Diffusion Actor-Critic: Efficient Online Reinforcement Learning for Diffusion Policy.

[BibT_eX]

[DOI]

CoRR, February, 2025

What's in a Query: Polarity-Aware Distribution-Based Fair Ranking.

[BibT_eX]

[DOI]

Proceedings of the ACM on Web Conference 2025, 2025

What is the Right Notion of Distance between Predict-then-Optimize Tasks?

[BibT_eX]

[DOI]

Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2025

Efficient Online Reinforcement Learning for Diffusion Policy.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Primal-Dual Spectral Representation for Off-policy Evaluation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

2024

epiDAMIK 2024: The 7th International Workshop on Epidemiology meets Data Mining and Knowledge Discovery.

[BibT_eX]

[DOI]

Marie-Laure Charpignon

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

2023

Characterizing and Improving the Robustness of Predict-Then-Optimize Frameworks.

[BibT_eX]

[DOI]

Proceedings of the Decision and Game Theory for Security: 14th International Conference, 2023

Restless Multi-Armed Bandits for Maternal and Child Health: Results from Decision-Focused Learning.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Modeling Robustness in Decision-Focused Learning as a Stackelberg Game.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Optimistic Whittle Index Policy: Online Learning for Restless Bandits.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Care Domain.

[BibT_eX]

[DOI]

CoRR, 2022

Decision-Focused Learning without Decision-Making: Learning Locally Optimized Decision Losses.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Coordinating Followers to Reach Better Equilibria: End-to-End Gradient Descent for Stackelberg Games.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Harnessing Heterogeneity: Learning from Decomposed Feedback in Bayesian Modeling.

[BibT_eX]

[DOI]

CoRR, 2021

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dual-Mandate Patrols: Multi-Armed Bandits for Green Security.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Robust Spatial-Temporal Incident Prediction.

[BibT_eX]

[DOI]

Ayan Mukhopadhyay

Kai Wang

Andrew Perrault

Mykel J. Kochenderfer

Milind Tambe

Yevgeniy Vorobeychik

Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Automatically Learning Compact Quality-aware Surrogates for Optimization Problems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Active Screening on Recurrent Diseases Contact Networks with Uncertainty: A Reinforcement Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the Multi-Agent-Based Simulation XXI - 21st International Workshop, 2020

Scalable Game-Focused Learning of Adversary Models: Data-to-Decisions in Network Security Games.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019

Improving GP-UCB Algorithm by Harnessing Decomposed Feedback.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Learning to Signal in the Goldilocks Zone: Improving Adversary Compliance in Security Games.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Mobile Game Theory with Street Gangs.

[BibT_eX]

[DOI]

P. Jeffrey Brantingham

Milind Tambe

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

DeepFP for Finding Nash Equilibrium in Continuous Action Spaces.

[BibT_eX]

[DOI]

Proceedings of the Decision and Game Theory for Security - 10th International Conference, 2019

Deep Fictitious Play for Games with Continuous Action Spaces.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018

The Price of Usability: Designing Operationalizable Strategies for Security Games.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Equilibrium Refinement in Security Games with Arbitrary Scheduling Constraints.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Strategic Coordination of Human Patrollers and Mobile Sensors With Signaling for Security Games.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Kai Wang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...