Kai Wang

Orcid: 0000-0002-2446-987X

Affiliations:
  • Georgia Institute of Technology, Atlanta, GA, USA
  • Massachusetts Institute of Technology, Cambridge, MA, USA (2023)
  • Harvard University, Cambridge, MA, USA (PhD 2023)
  • University of Southern California, Center for Artificial Intelligence in Society, Los Angeles, CA, USA (former)


According to our database1, Kai Wang authored at least 39 papers between 2018 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
KOALA: Knowledge of Optimization and Learning Algorithms for Healthcare.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Networked Restless Multi-Arm Bandits with Reinforcement Learning.
CoRR, December, 2025

A Fully First-Order Layer for Differentiable Optimization.
CoRR, December, 2025

Neural Index Policies for Restless Multi-Action Bandits with Heterogeneous Budgets.
CoRR, October, 2025

Diffusion-DFL: Decision-focused Diffusion Models for Stochastic Optimization.
CoRR, October, 2025

Finding a Multiple Follower Stackelberg Equilibrium: A Fully First-Order Method.
CoRR, September, 2025

Revealing Potential Biases in LLM-Based Recommender Systems in the Cold Start Setting.
CoRR, August, 2025

Non-Stationary Restless Multi-Armed Bandits with Provable Guarantee.
CoRR, August, 2025

One-Step Flow Policy Mirror Descent.
CoRR, July, 2025

Soft Diffusion Actor-Critic: Efficient Online Reinforcement Learning for Diffusion Policy.
CoRR, February, 2025

What's in a Query: Polarity-Aware Distribution-Based Fair Ranking.
Proceedings of the ACM on Web Conference 2025, 2025

What is the Right Notion of Distance between Predict-then-Optimize Tasks?
Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2025

Efficient Online Reinforcement Learning for Diffusion Policy.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Primal-Dual Spectral Representation for Off-policy Evaluation.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

2024
epiDAMIK 2024: The 7th International Workshop on Epidemiology meets Data Mining and Knowledge Discovery.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

2023
Characterizing and Improving the Robustness of Predict-Then-Optimize Frameworks.
Proceedings of the Decision and Game Theory for Security: 14th International Conference, 2023

Restless Multi-Armed Bandits for Maternal and Child Health: Results from Decision-Focused Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Modeling Robustness in Decision-Focused Learning as a Stackelberg Game.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Optimistic Whittle Index Policy: Online Learning for Restless Bandits.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Care Domain.
CoRR, 2022

Decision-Focused Learning without Decision-Making: Learning Locally Optimized Decision Losses.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Coordinating Followers to Reach Better Equilibria: End-to-End Gradient Descent for Stackelberg Games.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Harnessing Heterogeneity: Learning from Decomposed Feedback in Bayesian Modeling.
CoRR, 2021

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning.
CoRR, 2021

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dual-Mandate Patrols: Multi-Armed Bandits for Green Security.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Robust Spatial-Temporal Incident Prediction.
Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Automatically Learning Compact Quality-aware Surrogates for Optimization Problems.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Active Screening on Recurrent Diseases Contact Networks with Uncertainty: A Reinforcement Learning Approach.
Proceedings of the Multi-Agent-Based Simulation XXI - 21st International Workshop, 2020

Scalable Game-Focused Learning of Adversary Models: Data-to-Decisions in Network Security Games.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019
Improving GP-UCB Algorithm by Harnessing Decomposed Feedback.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Learning to Signal in the Goldilocks Zone: Improving Adversary Compliance in Security Games.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Mobile Game Theory with Street Gangs.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

DeepFP for Finding Nash Equilibrium in Continuous Action Spaces.
Proceedings of the Decision and Game Theory for Security - 10th International Conference, 2019

Deep Fictitious Play for Games with Continuous Action Spaces.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
The Price of Usability: Designing Operationalizable Strategies for Security Games.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Equilibrium Refinement in Security Games with Arbitrary Scheduling Constraints.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Strategic Coordination of Human Patrollers and Mobile Sensors With Signaling for Security Games.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018


  Loading...