Yuan Zhou

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Asymptotic optimality of base-stock policies for lost-sales inventory systems with stochastic lead times.

[BibT_eX]

[DOI]

Oper. Res. Lett., 2024

Optimal Policies for Dynamic Pricing and Inventory Control with Nonparametric Censored Demands.

[BibT_eX]

[DOI]

Boxiao Chen

Manag. Sci., 2024

A Minibatch-SGD-Based Learning Meta-Policy for Inventory Systems with Myopic Optimal Policy.

[BibT_eX]

[DOI]

CoRR, 2024

Closing the Gaps: Optimality of Sample Average Approximation for Data-Driven Newsvendor Problems.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Robust Situational Reinforcement Learning in Face of Context Disturbances.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Learning Sparse Group Models Through Boolean Relaxation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Personalized Pricing with Group Fairness Constraint.

[BibT_eX]

[DOI]

Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023

2022

Dynamic Pricing and Inventory Control with Fixed Ordering Cost and Incomplete Demand Information.

[BibT_eX]

[DOI]

Manag. Sci., 2022

Assortment Optimization Under the Multivariate MNL Model.

[BibT_eX]

[DOI]

CoRR, 2022

Bayesian-Nash-Incentive-Compatible Mechanism for Blockchain Transaction Fee Allocation.

[BibT_eX]

[DOI]

Zishuo Zhao

CoRR, 2022

Fairness-aware Network Revenue Management with Demand Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Dynamic Car Dispatching and Pricing: Revenue and Fairness for Ridesharing Platforms.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Proximal Exploration for Model-guided Protein Sequence Design.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Off-Policy Reinforcement Learning with Delayed Rewards.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Learning Long-Term Reward Redistribution via Randomized Return Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Imitation Learning from Observations under Transition Model Disparity.

[BibT_eX]

[DOI]

Tanmay Gangwani

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Optimal Policy for Dynamic Assortment Planning Under Multinomial Logit Models.

[BibT_eX]

[DOI]

Math. Oper. Res., 2021

Coordinate-wise Control Variates for Deep Policy Gradients.

[BibT_eX]

[DOI]

Yuanyi Zhong

CoRR, 2021

Linear bandits with limited adaptivity and learning distributional optimal design.

[BibT_eX]

[DOI]

Yufei Ruan

Jiaqi Yang

Proceedings of the STOC '21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, 2021

Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Tight Regret Bounds for Infinite-armed Linear Contextual Bandits.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Near-Optimal MNL Bandits Under Risk Criteria.

[BibT_eX]

[DOI]

Guangyu Xi

Chao Tao

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Dynamic Assortment Optimization with Changing Contextual Information.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2020

Efficient Competitive Self-Play Policy Optimization.

[BibT_eX]

[DOI]

Yuanyi Zhong

CoRR, 2020

Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage Decomposition.

[BibT_eX]

[DOI]

CoRR, 2020

Collaborative Top Distribution Identifications with Limited Interaction.

[BibT_eX]

[DOI]

Nikolai Karpov

CoRR, 2020

Almost Optimal Model-Free Reinforcement Learningvia Reference-Advantage Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning Guidance Rewards with Trajectory-space Smoothing.

[BibT_eX]

[DOI]

Tanmay Gangwani

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning Structural Genetic Information via Graph Neural Embedding.

[BibT_eX]

[DOI]

Proceedings of the Bioinformatics Research and Applications - 16th International Symposium, 2020

Multinomial Logit Bandit with Low Switching Cost.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Collaborative Top Distribution Identifications with Limited Interaction (Extended Abstract).

[BibT_eX]

[DOI]

Nikolai Karpov

Proceedings of the 61st IEEE Annual Symposium on Foundations of Computer Science, 2020

Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity.

[BibT_eX]

[DOI]

Tanmay Gangwani

Proceedings of the 4th Conference on Robot Learning, 2020

Root-n-Regret for Learning in Markov Decision Processes with Function Approximation and Low Bellman Rank.

[BibT_eX]

[DOI]

Proceedings of the Conference on Learning Theory, 2020

A PTAS for the Bayesian Thresholding Bandit Problem.

[BibT_eX]

[DOI]

Yue Qin

Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Adaptive Double-Exploration Tradeoff for Outlier Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Optimal Design of Process Flexibility for General Production Systems.

[BibT_eX]

[DOI]

Oper. Res., 2019

√n-Regret for Learning in Markov Decision Processes with Function Approximation and Low Bellman Rank.

[BibT_eX]

[DOI]

CoRR, 2019

Tight Regret Bounds for Infinite-armed Linear Contextual Bandits.

[BibT_eX]

[DOI]

Yingkai Li

CoRR, 2019

Thresholding Bandit with Optimal Aggregate Regret.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Exploration via Hindsight Goal Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Collaborative Learning with Limited Interaction: Tight Bounds for Distributed Exploration in Multi-armed Bandits.

[BibT_eX]

[DOI]

Chao Tao

Proceedings of the 60th IEEE Annual Symposium on Foundations of Computer Science, 2019

Nearly Minimax-Optimal Regret for Linearly Parameterized Bandits.

[BibT_eX]

[DOI]

Yingkai Li

Proceedings of the Conference on Learning Theory, 2019

2018

Dynamic Assortment Selection under the Nested Logit Models.

[BibT_eX]

[DOI]

CoRR, 2018

Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Tight Bounds for Collaborative PAC Learning via Multiplicative Weights.

[BibT_eX]

[DOI]

Jiecao Chen

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Best Arm Identification in Linear Bandits with Linear Dimension Dependency.

[BibT_eX]

[DOI]

Chao Tao

Saúl A. Blanco

Proceedings of the 35th International Conference on Machine Learning, 2018

2017

Parameterized Algorithms for Constraint Satisfaction Problems Above Average with Global Cardinality Constraints.

[BibT_eX]

[DOI]

Xue Chen

Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, 2017

Adaptive Multiple-Arm Identification.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Machine Learning, 2017

2015

Optimal Sparse Designs for Process Flexibility via Probabilistic Expanders.

[BibT_eX]

[DOI]

Jiawei Zhang

Oper. Res., 2015

Satisfiability of Ordering CSPs Above Average.

[BibT_eX]

[DOI]

Konstantin Makarychev

CoRR, 2015

Satisfiability of Ordering CSPs above Average is Fixed-Parameter Tractable.

[BibT_eX]

[DOI]

Konstantin Makarychev

Proceedings of the IEEE 56th Annual Symposium on Foundations of Computer Science, 2015

2014

Constant Factor Lasserre Integrality Gaps for Graph Partitioning Problems.

[BibT_eX]

[DOI]

Ali Kemal Sinop

SIAM J. Optim., 2014

Hardness of Robust Graph Isomorphism, Lasserre Gaps, and Asymmetry of Random Graphs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms, 2014

Hypercontractive inequalities via SOS, and the Frankl-Rödl graph.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms, 2014

Approximation schemes via Sherali-Adams hierarchy for dense constraint satisfaction problems and assignment problems.

[BibT_eX]

[DOI]

Yuichi Yoshida

Proceedings of the Innovations in Theoretical Computer Science, 2014

Locally testable codes and cayley graphs.

[BibT_eX]

[DOI]

Parikshit Gopalan

Salil P. Vadhan

Proceedings of the Innovations in Theoretical Computer Science, 2014

Optimal PAC Multiple Arm Identification with Applications to Crowdsourcing.

[BibT_eX]

[DOI]

Jian Li

Proceedings of the 31th International Conference on Machine Learning, 2014

Optimal Strong Parallel Repetition for Projection Games on Low Threshold Rank Graphs.

[BibT_eX]

[DOI]

Madhur Tulsiani

John Wright

Proceedings of the Automata, Languages, and Programming - 41st International Colloquium, 2014

Deterministic Coupon Collection and Better Strong Dispersers.

[BibT_eX]

[DOI]

Raghu Meka

Omer Reingold

Proceedings of the Approximation, 2014

2013

Approximability and proof complexity.

[BibT_eX]

[DOI]

Fernando G. S. L. Brandão

Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete Algorithms, 2013

2012

Hypercontractive inequalities via SOS, with an application to Vertex-Cover

[BibT_eX]

[DOI]

CoRR, 2012

Hypercontractivity, sum-of-squares proofs, and their applications.

[BibT_eX]

[DOI]

Boaz Barak

Proceedings of the 44th Symposium on Theory of Computing Conference, 2012

Approximation algorithms and hardness of the <i>k</i>-route cut problem.

[BibT_eX]

[DOI]

Julia Chuzhoy

Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms, 2012

Polynomial integrality gaps for strong SDP relaxations of Densest <i>k</i>-subgraph.

[BibT_eX]

[DOI]

Aditya Bhaskara

Moses Charikar

Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms, 2012

Linear programming, width-1 CSPs, and robust satisfaction.

[BibT_eX]

[DOI]

Proceedings of the Innovations in Theoretical Computer Science 2012, 2012

Approximating Bounded Occurrence Ordering CSPs.

[BibT_eX]

[DOI]

Proceedings of the Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, 2012

2011

Approximation Algorithms and Hardness of the k-Route Cut Problem

[BibT_eX]

[DOI]

Julia Chuzhoy

CoRR, 2011

Polynomial integrality gaps for strong SDP relaxations of Densest k-subgraph

[BibT_eX]

[DOI]

Aditya Bhaskara

Moses Charikar

CoRR, 2011

Tight Bounds on the Approximability of Almost-satisfiable Horn SAT and Exact Hitting Set.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second Annual ACM-SIAM Symposium on Discrete Algorithms, 2011

Optimal lower bounds for locality sensitive hashing (except when q is tiny).

[BibT_eX]

[DOI]

Yi Wu

Proceedings of the Innovations in Computer Science, 2011

Finding Almost-Perfect Graph Bisections.

[BibT_eX]

[DOI]

Proceedings of the Innovations in Computer Science, 2011

The Fourier Entropy-Influence Conjecture for Certain Classes of Boolean Functions.

[BibT_eX]

[DOI]

John Wright

Proceedings of the Automata, Languages and Programming - 38th International Colloquium, 2011

Hardness of Max-2Lin and Max-3Lin over Integers, Reals, and Large Cyclic Groups.

[BibT_eX]

[DOI]

Yi Wu

Proceedings of the 26th Annual IEEE Conference on Computational Complexity, 2011

Black-Box Reductions in Mechanism Design.

[BibT_eX]

[DOI]

Zhiyi Huang

Lei Wang

Proceedings of the Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, 2011

2010

Surviving Rates of Graphs with Bounded Treewidth for the Firefighter Problem.

[BibT_eX]

[DOI]

SIAM J. Discret. Math., 2010

2009

Tighter Bounds for Facility Games.

[BibT_eX]

[DOI]

Pinyan Lu

Yajun Wang