Yinmin Zhang

Orcid: 0000-0002-4825-470X

According to our database1, Yinmin Zhang authored at least 21 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
PRIME: A Process-Outcome Alignment Benchmark for Verifiable Reasoning in Mathematics and Engineering.
CoRR, February, 2026

R-Align: Enhancing Generative Reward Models through Rationale-Centric Meta-Judging.
CoRR, February, 2026

STEP3-VL-10B Technical Report.
CoRR, January, 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning.
CoRR, January, 2026

PRIME: A Process-Outcome Alignment Benchmark for Verifiable Reasoning in Mathematics and Engineering.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction.
CoRR, November, 2025

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning.
CoRR, July, 2025

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Multi-matrix Factorization Attention.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
MaskMA: Towards Zero-Shot Multi-Agent Decision Making with Mask-Based Collaborative Learning.
Trans. Mach. Learn. Res., 2024

Adaptive pessimism via target Q-value for offline reinforcement learning.
Neural Networks, 2024

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Masked Pretraining for Multi-Agent Decision Making.
CoRR, 2023

Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection.
CoRR, 2022

2021
Learning Geometry-Guided Depth via Projective Modeling for Monocular 3D Object Detection.
CoRR, 2021

Delving Into Localization Errors for Monocular 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021


  Loading...