Qiwei Di

According to our database1, Qiwei Di authored at least 11 papers between 2023 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Dimension-Independent Convergence of Underdamped Langevin Monte Carlo in KL Divergence.
CoRR, March, 2026

Near-Optimal Regret for KL-Regularized Multi-Armed Bandits.
CoRR, March, 2026

2025
On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference.
CoRR, December, 2025

Best-of-Majority: Minimax-Optimal Strategy for Pass@<i>k</i> Inference Scaling.
CoRR, October, 2025

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Relative-Translation Invariant Wasserstein Distance.
CoRR, 2024

Borda Regret Minimization for Generalized Linear Dueling Bandits.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path.
Proceedings of the International Conference on Machine Learning, 2023


  Loading...