Canzhe Zhao

Orcid: 0000-0003-1080-9412

According to our database¹, Canzhe Zhao authored at least 17 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Decentralized Asynchronous Multi-player Bandits.

[BibT_eX]

[DOI]

CoRR, September, 2025

Heavy-tailed Linear Bandits: Adversarial Robustness, Best-of-both-worlds, and Beyond.

[BibT_eX]

[DOI]

Canzhe Zhao

Shinji Ito

Shuai Li

CoRR, August, 2025

Towards Provably Efficient Learning of Imperfect Information Extensive-Form Games with Linear Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2025

Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Logarithmic Regret for Linear Markov Decision Processes with Adversarial Corruptions.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Toward joint utilization of absolute and relative bandit feedback for conversational recommendation.

[BibT_eX]

[DOI]

User Model. User Adapt. Interact., November, 2024

2023

Clustering of conversational bandits with posterior sampling for user preference learning and elicitation.

[BibT_eX]

[DOI]

User Model. User Adapt. Interact., November, 2023

Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization.

[BibT_eX]

[DOI]

Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Learning Adversarial Linear Mixture Markov Decision Processes with Bandit Feedback and Unknown Transition.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Best-of-three-worlds Analysis for Linear Bandits with Follow-the-regularized-leader Algorithm.

[BibT_eX]

[DOI]

Fang Kong

Canzhe Zhao

Shuai Li

Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

2022

Knowledge-aware Conversational Preference Elicitation with Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model.

[BibT_eX]

[DOI]

Cheng Chen

Canzhe Zhao

Shuai Li

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Conservative Contextual Combinatorial Cascading Bandit.

[BibT_eX]

[DOI]

CoRR, 2021

Comparison-based Conversational Recommender System with Relative Bandit Feedback.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Clustering of Conversational Bandits for User Preference Learning and Elicitation.

[BibT_eX]

[DOI]

Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Canzhe Zhao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...