Guozheng Ma

Orcid: 0000-0003-1884-6103

According to our database1, Guozheng Ma authored at least 20 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
STRIDE: Learnable Stepwise Language Feedback for LLM Reasoning.
CoRR, May, 2026

What Makes Value Learning Efficient in Residual Reinforcement Learning?
CoRR, February, 2026

Language-based Trial and Error Falls Behind in the Era of Experience.
CoRR, January, 2026

Towards Reliable Medical LLMs: Benchmarking and Enhancing Confidence Estimation of Large Language Models in Medical Consultation.
CoRR, January, 2026

Consistency-Regularized Multi-Stage Joint-Perception Graph Fuzzy Clustering Algorithm.
Neurocomputing, 2026

2025
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning.
Int. J. Comput. Vis., October, 2025

Rethinking the Role of Dynamic Sparse Training for Scalable Deep Reinforcement Learning.
CoRR, October, 2025

UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios.
CoRR, September, 2025

MeRF: Motivation-enhanced Reinforcement Finetuning for Large Reasoning Models.
CoRR, June, 2025

Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
CoRR, April, 2025

Are Large Language Models Really Robust to Word-Level Perturbations?
Trans. Mach. Learn. Res., 2025

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Faster and Better 3D Splatting via Group Training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping.
CoRR, 2024

Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Normalization Enhances Generalization in Visual Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

2013
An optimization scheme for highway emergency dispatching management.
Proceedings of the Ninth International Conference on Natural Computation, 2013


  Loading...