Canzhe Zhao

Orcid: 0000-0003-1080-9412

According to our database1, Canzhe Zhao authored at least 12 papers between 2021 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Clustering of conversational bandits with posterior sampling for user preference learning and elicitation.
User Model. User Adapt. Interact., November, 2023

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback.
CoRR, 2023

Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Learning Adversarial Linear Mixture Markov Decision Processes with Bandit Feedback and Unknown Transition.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Best-of-three-worlds Analysis for Linear Bandits with Follow-the-regularized-leader Algorithm.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

2022
Knowledge-aware Conversational Preference Elicitation with Bandit Feedback.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Conservative Contextual Combinatorial Cascading Bandit.
CoRR, 2021

Comparison-based Conversational Recommender System with Relative Bandit Feedback.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Clustering of Conversational Bandits for User Preference Learning and Elicitation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021


  Loading...