Tao Ren

Orcid: 0009-0001-4807-5664

Affiliations:
  • Peking University, Guanghua School of Management, Beijing, China


According to our database1, Tao Ren authored at least 11 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Adaptive Robust Estimator for Multi-Agent Reinforcement Learning.
CoRR, March, 2026

Optimal low-rank stochastic gradient estimation for LLM training.
CoRR, March, 2026

Omni-Masked Gradient Descent: Memory-Efficient Optimization via Mask Traversal with Improved Convergence.
CoRR, March, 2026

Nonparametric Bayesian Optimization for General Rewards.
CoRR, February, 2026

2025
RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training.
CoRR, October, 2025

Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer.
CoRR, February, 2025

Exploring and Exploiting Model Uncertainty in Bayesian Optimization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

FLOPS: Forward Learning with OPtimal Sampling.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

A Unified Framework for Multi-Stage Decision Optimization with Deep Reinforcement Learning and Foundation Models.
Proceedings of the 21st IEEE International Conference on Automation Science and Engineering, 2025

2024
Deep Reinforcement Learning for Solving Management Problems: Towards A Large Management Mode.
CoRR, 2024

RiskMiner: Discovering Formulaic Alphas via Risk Seeking Monte Carlo Tree Search.
Proceedings of the 5th ACM International Conference on AI in Finance, 2024


  Loading...