Jiacai Liu

Orcid: 0000-0003-1936-2506

According to our database1, Jiacai Liu authored at least 11 papers between 2013 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
On the Convergence of Policy Mirror Descent with Temporal Difference Evaluation.
CoRR, September, 2025

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents.
CoRR, September, 2025

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy.
CoRR, July, 2025

Skywork Open Reasoner 1 Technical Report.
CoRR, May, 2025

Equal Division Contribution Values of Trapezoidal Fuzzy Numbers and Their Application to Profit Allocation in Cold Chain Logistics for Agricultural Products.
Symmetry, 2025

ϕ-Update: A Class of Policy Update Methods with Policy Convergence Guarantee.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization.
CoRR, 2024

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs.
CoRR, 2024

Elementary Analysis of Policy Gradient Methods.
CoRR, 2024

2023
On the Linear Convergence of Policy Gradient under Hadamard Parameterization.
CoRR, 2023

2013
Detection of hydrothermally alteration rocks in the east Gandise, Tibet (China) using aster imagery.
Proceedings of the 2013 IEEE International Geoscience and Remote Sensing Symposium, 2013


  Loading...