Yihan Du

Seo Taek Kong

R. Srikant

CoRR, October, 2025

A Method to Derive Long-Term Global Hourly Near-Surface Air Temperature by Combining Remote Sensing and Reanalysis Datasets.

[BibT_eX]

[DOI]

IEEE Geosci. Remote. Sens. Lett., 2025

Reinforcement Learning with Segment Feedback.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

To Ignore or Not: Understanding the Influence of Hillshade.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

EVIT-YOLOv8: Construction and research on African Swine Fever facial expression recognition.

[BibT_eX]

[DOI]

Comput. Electron. Agric., 2024

Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Cascading Reinforcement Learning.

[BibT_eX]

[DOI]

R. Srikant

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Upscaling of longwave downward radiation from instantaneous to any temporal scale: Algorithms, validation, and comparison.

[BibT_eX]

[DOI]

Int. J. Appl. Earth Obs. Geoinformation, March, 2023

A Uniform Model for Correcting Shortwave Downward Radiation Over Rugged Terrain at Various Scales.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

Errata on "Improved Algorithm to Derive All-Sky Longwave Downward Radiation From Space: Application to Fengyun-4A Measurements".

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

Improved Algorithm to Derive All-Sky Longwave Downward Radiation From Space: Application to Fengyun-4A Measurements.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

Provably Safe Reinforcement Learning with Step-wise Violation Constraints.

[BibT_eX]

[DOI]

Nuoya Xiong

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multi-task Representation Learning for Pure Exploration in Linear Bandits.

[BibT_eX]

[DOI]

Wen Sun

Proceedings of the International Conference on Machine Learning, 2023

Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Collaborative Pure Exploration in Kernel Bandit.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Risk-Sensitive Reinforcement Learning: Iterated CVaR and the Worst Path.

[BibT_eX]

[DOI]

CoRR, 2022

Branching Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Branching Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

2021

Combinatorial Pure Exploration with Bottleneck Reward Function and its Extension to General Reward Functions.

[BibT_eX]

[DOI]

CoRR, 2021

Continuous Mean-Covariance Bandits.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Combinatorial Pure Exploration with Bottleneck Reward Function.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A One-Size-Fits-All Solution to Conservative Bandit Problems.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Combinatorial Pure Exploration with Full-Bandit or Partial Linear Feedback.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Object-adaptive LSTM network for real-time visual tracking with adversarial data augmentation.

[BibT_eX]

[DOI]

Neurocomputing, 2020

Combinatorial Pure Exploration with Partial or Full-Bandit Linear Feedback.

[BibT_eX]

[DOI]

CoRR, 2020

Combinatorial Pure Exploration for Dueling Bandit.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Dueling Bandits: From Two-dueling to Multi-dueling.

[BibT_eX]

[DOI]