Zhiwei (Tony) Qin

Orcid: 0000-0001-5383-4816

Affiliations:
  • DiDi Research America, Mountain View, Lyft Rideshare Labs, CA, USA
  • Walmart Labs, San Bruno, CA, USA
  • Columbia University, New York, USA (PhD 2013)


According to our database1, Zhiwei (Tony) Qin authored at least 45 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Reinforcement Learning and Prediction-Based Lookahead Policy for Vehicle Repositioning in Online Ride-Hailing Systems.
IEEE Trans. Intell. Transp. Syst., February, 2024

2023
Offline Model-Based Adaptable Policy Learning for Decision-Making in Out-of-Support Regions.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Combinatorial Optimization Meets Reinforcement Learning: Effective Taxi Order Dispatching at Large-Scale.
IEEE Trans. Knowl. Data Eng., October, 2023

KDD-2023 Workshop on Decision Intelligence and Analytics for Online Marketplaces.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

A Unified Representation Framework for Rideshare Marketplace Equilibrium and Efficiency.
Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, 2023

2022
KDD 2022 Workshop on Decision Intelligence and Analytics for Online Marketplaces: Jobs, Ridesharing, Retail, and Beyond.
SIGKDD Explor., 2022

Spatio-temporal Incentives Optimization for Ride-hailing Services with Offline Deep Reinforcement Learning.
CoRR, 2022

Decision Intelligence and Analytics for Online Marketplaces: Jobs, Ridesharing, Retail and Beyond.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Reinforcement Learning in the Wild: Scalable RL Dispatching Algorithm Deployed in Ridehailing Marketplace.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Multiple Tiered Treatments Optimization with Causal Inference on Response Distribution.
Proceedings of the IEEE International Conference on Big Data, 2022

2021
Partially observable environment estimation with uplift inference for reinforcement learning based recommendation.
Mach. Learn., 2021

Real-world Ride-hailing Vehicle Repositioning using Deep Reinforcement Learning.
CoRR, 2021

Offline Model-based Adaptable Policy Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Value Function is All You Need: A Unified Learning Framework for Ride Hailing Platforms.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Reinforcement Learning for Ridesharing: A Survey.
Proceedings of the 24th IEEE International Intelligent Transportation Systems Conference, 2021

Multi-Objective Distributional Reinforcement Learning for Large-Scale Order Dispatching.
Proceedings of the IEEE International Conference on Data Mining, 2021

Optimizing Bike-Share Repositioning: Networked Inventory Management with Spatiotemporal Modeling.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
Ride-Hailing Order Dispatching at DiDi via Reinforcement Learning.
INFORMS J. Appl. Anal., 2020

Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

2019
Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning.
CoRR, 2019

Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning.
Proceedings of the World Wide Web Conference, 2019

Domain Generation Algorithms detection through deep neural network and ensemble.
Proceedings of the Companion of The 2019 World Wide Web Conference, 2019

A Deep Value-network Based Approach for Multi-Driver Order Dispatching.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Deep Reinforcement Learning with Applications in Transportation.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Deep Reinforcement Learning for Ride-sharing Dispatching and Repositioning.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Deep Reinforcement Learning for Multi-driver Vehicle Dispatching and Repositioning Problem.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

InBEDE: Integrating Contextual Bandit with TD Learning for Joint Pricing and Dispatch of Ride-Hailing Platforms.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Multi-Agent Reinforcement Learning for Order-dispatching via Order-Vehicle Distribution Matching.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Origin-destination Flow Prediction with Vehicle Trajectory Data and Semi-supervised Recurrent Neural Network.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

2018
A Bayesian framework for large-scale geo-demand estimation in on-line retailing.
Ann. Oper. Res., 2018

Large-Scale Targeted Marketing by Supervised PageRank with Seeds.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2018

Deep Reinforcement Learning with Knowledge Transfer for Online Rides Order Dispatching.
Proceedings of the IEEE International Conference on Data Mining, 2018

Optimizing Taxi Carpool Policies via Reinforcement Learning and Spatio-Temporal Mining.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
A Unified Neural Network Approach for Estimating Travel Time and Distance for a Taxi Trip.
CoRR, 2017

2015
An alternating direction method for total variation denoising.
Optim. Methods Softw., 2015

Low-Rank Tensor Recovery for Geo-Demand Estimation in Online Retailing.
Proceedings of the INNS Conference on Big Data 2015, 2015

2014
Robust Low-Rank Tensor Recovery: Models and Algorithms.
SIAM J. Matrix Anal. Appl., 2014

HIPAD - A Hybrid Interior-Point Alternating Direction Algorithm for Knowledge-Based SVM and Feature Selection.
Proceedings of the Learning and Intelligent Optimization, 2014

2013
Optimization Algorithms for Structured Machine Learning and Image Processing Problems.
PhD thesis, 2013

Efficient block-coordinate descent algorithms for the Group Lasso.
Math. Program. Comput., 2013

2012
Structured Sparsity via Alternating Direction Methods.
J. Mach. Learn. Res., 2012

2011
Structured Sparsity via Alternating Directions Methods
CoRR, 2011


  Loading...