Zeyu Jia

According to our database1, Zeyu Jia authored at least 24 papers between 2019 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Gaussian Sequence Model: Sample Complexities of Testing, Estimation and LFHT.
CoRR, July, 2025

Outcome-Based Online Reinforcement Learning: Algorithms and Fundamental Limits.
CoRR, May, 2025

Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning.
CoRR, May, 2025

Do We Need to Verify Step by Step? Rethinking Process Supervision from a Theoretical Perspective.
CoRR, February, 2025

Ensuring resilience in active distribution networks: A security-constrained robust approach with energy storage and demand response.
J. Comput. Methods Sci. Eng., 2025

On the Minimax Regret of Sequential Probability Assignment via Square-Root Entropy.
Proceedings of the Thirty Eighth Annual Conference on Learning Theory, 2025

2024
How Does Variance Shape the Regret in Contextual Bandits?
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data.
Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024

2023
When is Agnostic Reinforcement Learning Statistically Tractable?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Entropic characterization of optimal rates for learning Gaussian mixtures.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Linear Reinforcement Learning with Ball Structure Action Space.
Proceedings of the International Conference on Algorithmic Learning Theory, 2023

2022
Bandwidth Optimization of MEMS Accelerometers in Fluid Medium Environment.
Sensors, 2022

Intrinsic Dimension Estimation Using Wasserstein Distance.
J. Mach. Learn. Res., 2022

Rate of convergence of the smoothed empirical Wasserstein distance.
CoRR, 2022

2021
Search Direction Correction with Normalized Gradient Makes First-Order Methods Faster.
SIAM J. Sci. Comput., 2021

The application of machine learning algorithms in predicting the length of stay following femoral neck fracture.
Int. J. Medical Informatics, 2021

Intrinsic Dimension Estimation.
CoRR, 2021

Identification of Key Gene Modules and Hub Genes of Hypertension Based on WGCNA Algorithm.
Proceedings of the BIBE 2021: The Fifth International Conference on Biological Information and Biomedical Engineering, 2021

2020
Towards solving 2-TBSG efficiently.
Optim. Methods Softw., 2020

Model-Based Reinforcement Learning with Value-Targeted Regression.
Proceedings of the 2nd Annual Conference on Learning for Dynamics and Control, 2020

Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Model-Based Reinforcement Learning with Value-Targeted Regression.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Toward Solving 2-TBSG Efficiently.
CoRR, 2019

Feature-Based Q-Learning for Two-Player Stochastic Games.
CoRR, 2019


  Loading...