Qiwen Cui

Orcid: 0000-0002-0193-6623

According to our database¹, Qiwen Cui authored at least 21 papers between 2020 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2024

BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques.

[BibT_eX]

[DOI]

CoRR, 2024

(N, K)-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

Refined Sample Complexity for Markov Games with Independent Linear Function Approximation.

[BibT_eX]

[DOI]

Yan Dai

Qiwen Cui

Simon S. Du

CoRR, 2024

Learning Optimal Tax Design in Nonatomic Congestion Games.

[BibT_eX]

[DOI]

Qiwen Cui

Maryam Fazel

Simon S. Du

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Refined Sample Complexity for Markov Games with Independent Linear Function Approximation (Extended Abstract).

[BibT_eX]

[DOI]

Yan Dai

Qiwen Cui

Simon S. Du

Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024

2023

An Efficient Fisher Matrix Approximation Method for Large-Scale Neural Network Optimization.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Offline Congestion Games: How Feedback Type Affects Data Coverage Requirement.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation.

[BibT_eX]

[DOI]

Qiwen Cui

Kaiqing Zhang

Simon S. Du

Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

2022

When is Offline Two-Player Zero-Sum Markov Game Solvable?

[BibT_eX]

[DOI]

Qiwen Cui

Simon S. Du

CoRR, 2022

Near-Optimal Randomized Exploration for Tabular Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Gap-dependent Bounds for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Xinqi Wang

Qiwen Cui

Simon S. Du

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning in Congestion Games with Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

When are Offline Two-Player Zero-Sum Markov Games Solvable?

[BibT_eX]

[DOI]

Qiwen Cui

Simon S. Du

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus.

[BibT_eX]

[DOI]

Qiwen Cui

Simon S. Du

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Minimax sample complexity for turn-based stochastic game.

[BibT_eX]

[DOI]

Qiwen Cui

Lin F. Yang

Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Randomized Exploration in Reinforcement Learning with General Value Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning?

[BibT_eX]

[DOI]

Qiwen Cui

Lin F. Yang

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Qiwen Cui

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...