Yihan Du

Orcid: 0000-0002-3912-7039

According to our database1, Yihan Du authored at least 24 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization.
CoRR, 2024

Cascading Reinforcement Learning.
CoRR, 2024

2023
Upscaling of longwave downward radiation from instantaneous to any temporal scale: Algorithms, validation, and comparison.
Int. J. Appl. Earth Obs. Geoinformation, March, 2023

A Uniform Model for Correcting Shortwave Downward Radiation Over Rugged Terrain at Various Scales.
IEEE Trans. Geosci. Remote. Sens., 2023

Errata on "Improved Algorithm to Derive All-Sky Longwave Downward Radiation From Space: Application to Fengyun-4A Measurements".
IEEE Trans. Geosci. Remote. Sens., 2023

Improved Algorithm to Derive All-Sky Longwave Downward Radiation From Space: Application to Fengyun-4A Measurements.
IEEE Trans. Geosci. Remote. Sens., 2023

Provably Safe Reinforcement Learning with Step-wise Violation Constraints.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multi-task Representation Learning for Pure Exploration in Linear Bandits.
Proceedings of the International Conference on Machine Learning, 2023

Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Collaborative Pure Exploration in Kernel Bandit.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Risk-Sensitive Reinforcement Learning: Iterated CVaR and the Worst Path.
CoRR, 2022

Branching Reinforcement Learning.
CoRR, 2022

Branching Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

2021
Combinatorial Pure Exploration with Bottleneck Reward Function and its Extension to General Reward Functions.
CoRR, 2021

Continuous Mean-Covariance Bandits.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Combinatorial Pure Exploration with Bottleneck Reward Function.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A One-Size-Fits-All Solution to Conservative Bandit Problems.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Combinatorial Pure Exploration with Full-Bandit or Partial Linear Feedback.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Object-adaptive LSTM network for real-time visual tracking with adversarial data augmentation.
Neurocomputing, 2020

Combinatorial Pure Exploration with Partial or Full-Bandit Linear Feedback.
CoRR, 2020

Combinatorial Pure Exploration for Dueling Bandit.
Proceedings of the 37th International Conference on Machine Learning, 2020

Dueling Bandits: From Two-dueling to Multi-dueling.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019
Direct Object Recognition Without Line-Of-Sight Using Optical Coherence.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Object-Adaptive LSTM Network for Visual Tracking.
Proceedings of the 24th International Conference on Pattern Recognition, 2018


  Loading...