Yingzhe Peng
Orcid: 0009-0008-6077-6704
According to our database1,
Yingzhe Peng
authored at least 13 papers
between 2023 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.
CoRR, August, 2025
L-CLIPScore: a Lightweight Embedding-based Captioning Metric for Evaluating and Training.
CoRR, July, 2025
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL.
CoRR, March, 2025
Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks.
Proceedings of the 30th International Conference on Intelligent User Interfaces, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
First Place Solution to the Multiple-choice Video QA Track of The Second Perception Test Challenge.
CoRR, 2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
2023
CoRR, 2023