Yinmin Zhang

Orcid: 0000-0002-4825-470X

According to our database¹, Yinmin Zhang authored at least 21 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

PRIME: A Process-Outcome Alignment Benchmark for Verifiable Reasoning in Mathematics and Engineering.

[BibT_eX]

[DOI]

CoRR, February, 2026

R-Align: Enhancing Generative Reward Models through Rationale-Centric Meta-Judging.

[BibT_eX]

[DOI]

CoRR, February, 2026

STEP3-VL-10B Technical Report.

[BibT_eX]

[DOI]

Multimodal Intelligence Team

CoRR, January, 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning.

[BibT_eX]

[DOI]

CoRR, January, 2026

PRIME: A Process-Outcome Alignment Benchmark for Verifiable Reasoning in Mathematics and Engineering.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction.

[BibT_eX]

[DOI]

CoRR, November, 2025

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning.

[BibT_eX]

[DOI]

CoRR, July, 2025

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Multi-matrix Factorization Attention.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

MaskMA: Towards Zero-Shot Multi-Agent Decision Making with Mask-Based Collaborative Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Adaptive pessimism via target Q-value for offline reinforcement learning.

[BibT_eX]

[DOI]

Neural Networks, 2024

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Masked Pretraining for Multi-Agent Decision Making.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Learning Geometry-Guided Depth via Projective Modeling for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Delving Into Localization Errors for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Yinmin Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...