Ziniu Li

According to our database1, Ziniu Li authored at least 17 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Why Transformers Need Adam: A Hessian Perspective.
CoRR, 2024

2023
Policy Optimization in RLHF: The Impact of Out-of-preference Data.
CoRR, 2023

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models.
CoRR, 2023

Deploying Offline Reinforcement Learning with Human Feedback.
CoRR, 2023

Theoretical Analysis of Offline Imitation With Supplementary Dataset.
CoRR, 2023

Provably Efficient Adversarial Imitation Learning with Unknown Transitions.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Imitation Learning from Imperfection: Theoretical Justifications and Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Error Bounds of Imitating Policies and Environments for Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis.
CoRR, 2022

A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle.
CoRR, 2022

Rethinking ValueDice: Does It Really Improve Performance?
CoRR, 2022

HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Nearly Minimax Optimal Adversarial Imitation Learning with Known and Unknown Transitions.
CoRR, 2021

2020
Solving the Inverse Design Problem of Electrical Fuse With Machine Learning.
IEEE Access, 2020

Error Bounds of Imitating Policies and Environments.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Efficient Exploration by Novelty-Pursuit.
Proceedings of the Distributed Artificial Intelligence - Second International Conference, 2020

2019
On Value Discrepancy of Imitation Learning.
CoRR, 2019


  Loading...