Zhenfei Yin

Orcid: 0000-0002-8666-1103

According to our database¹, Zhenfei Yin authored at least 94 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Dynamic Mixture of Latent Memories for Self-Evolving Agents.

[BibT_eX]

[DOI]

CoRR, May, 2026

ReCrit: Transition-Aware Reinforcement Learning for Scientific Critic Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2026

CreFlow: Corrective Reflow for Sparse-Reward Embodied Video Diffusion RL.

[BibT_eX]

[DOI]

CoRR, May, 2026

Strategic Exploitation in LLM Agent Markets: A Simulation Framework for E-Commerce Trust.

[BibT_eX]

[DOI]

CoRR, May, 2026

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction.

[BibT_eX]

[DOI]

CoRR, May, 2026

ComSim: Building Scalable Real-World Robot Data Generation via Compositional Simulation.

[BibT_eX]

[DOI]

CoRR, April, 2026

Select-then-Solve: Paradigm Routing as Inference-Time Optimization for LLM Agents.

[BibT_eX]

[DOI]

CoRR, April, 2026

CoEnv: Driving Embodied Multi-Agent Collaboration via Compositional Environment.

[BibT_eX]

[DOI]

CoRR, April, 2026

Stabilizing Rubric Integration Training via Decoupled Advantage Normalization.

[BibT_eX]

[DOI]

CoRR, March, 2026

Ego to World: Collaborative Spatial Reasoning in Embodied Systems via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, March, 2026

SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents.

[BibT_eX]

[DOI]

CoRR, February, 2026

Charting Empirical Laws for LLM Fine-Tuning in Scientific Multi-Discipline Learning.

[BibT_eX]

[DOI]

CoRR, February, 2026

TodoEvolve: Learning to Architect Agent Planning Systems.

[BibT_eX]

[DOI]

CoRR, February, 2026

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents.

[BibT_eX]

[DOI]

CoRR, February, 2026

LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning.

[BibT_eX]

[DOI]

CoRR, February, 2026

Behavioral Consistency Validation for LLM Agents: An Analysis of Trading-Style Switching through Stock-Market Simulation.

[BibT_eX]

[DOI]

CoRR, February, 2026

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, February, 2026

Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents.

[BibT_eX]

[DOI]

CoRR, February, 2026

Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents.

[BibT_eX]

[DOI]

CoRR, January, 2026

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, January, 2026

TouchGuide: Inference-Time Steering of Visuomotor Policies via Touch Guidance.

[BibT_eX]

[DOI]

CoRR, January, 2026

Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge.

[BibT_eX]

[DOI]

CoRR, January, 2026

Think3D: Thinking with Space for Spatial Reasoning.

[BibT_eX]

[DOI]

CoRR, January, 2026

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey.

[BibT_eX]

[DOI]

Francisco Piedrahita Velez

Trans. Mach. Learn. Res., 2026

2025

GenderBias-VL: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., December, 2025

RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic.

[BibT_eX]

[DOI]

CoRR, December, 2025

From Word to World: Can Large Language Models be Implicit Text-based World Models?

[BibT_eX]

[DOI]

CoRR, December, 2025

Memory in the Age of AI Agents.

[BibT_eX]

[DOI]

CoRR, December, 2025

LiveSearchBench: An Automatically Constructed Benchmark for Retrieval and Reasoning over Dynamic Knowledge.

[BibT_eX]

[DOI]

CoRR, November, 2025

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning.

[BibT_eX]

[DOI]

CoRR, October, 2025

SecureWebArena: A Holistic Security Evaluation Benchmark for LVLM-based Web Agents.

[BibT_eX]

[DOI]

CoRR, October, 2025

CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards.

[BibT_eX]

[DOI]

CoRR, October, 2025

A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory.

[BibT_eX]

[DOI]

CoRR, October, 2025

Scaling Behaviors of LLM Reinforcement Learning Post-Training: An Empirical Study in Mathematical Reasoning.

[BibT_eX]

[DOI]

CoRR, September, 2025

LatentEvolve: Self-Evolving Test-Time Scaling in Latent Space.

[BibT_eX]

[DOI]

CoRR, September, 2025

Diagnose, Localize, Align: A Full-Stack Framework for Reliable LLM Multi-Agent Systems under Instruction Conflicts.

[BibT_eX]

[DOI]

CoRR, September, 2025

SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines.

[BibT_eX]

[DOI]

CoRR, September, 2025

Eigen-1: Adaptive Multi-Agent Refinement with Monitor-Based RAG for Scientific Reasoning.

[BibT_eX]

[DOI]

CoRR, September, 2025

Interleaving Reasoning for Better Text-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, September, 2025

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey.

[BibT_eX]

[DOI]

CoRR, September, 2025

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., August, 2025

aiXiv: A Next-Generation Open Access Ecosystem for Scientific Discovery Generated by AI Scientists.

[BibT_eX]

[DOI]

CoRR, August, 2025

VeriGUI: Verifiable Long-Chain GUI Dataset.

[BibT_eX]

[DOI]

CoRR, August, 2025

When Autonomy Goes Rogue: Preparing for Risks of Multi-Agent Collusion in Social Systems.

[BibT_eX]

[DOI]

CoRR, July, 2025

BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset.

[BibT_eX]

[DOI]

CoRR, July, 2025

Position: Intelligent Science Laboratory Requires the Integration of Cognitive and Embodied AI.

[BibT_eX]

[DOI]

CoRR, June, 2025

A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis.

[BibT_eX]

[DOI]

CoRR, May, 2025

LabUtopia: High-Fidelity Simulation and Hierarchical Benchmark for Scientific Embodied Agents.

[BibT_eX]

[DOI]

CoRR, May, 2025

X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs.

[BibT_eX]

[DOI]

CoRR, May, 2025

MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems.

[BibT_eX]

[DOI]

CoRR, May, 2025

CompBench: Benchmarking Complex Instruction-guided Image Editing.

[BibT_eX]

[DOI]

CoRR, May, 2025

AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research.

[BibT_eX]

[DOI]

CoRR, May, 2025

Towards Physically Plausible Video Generation via VLM Planning.

[BibT_eX]

[DOI]

CoRR, March, 2025

ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks.

[BibT_eX]

[DOI]

CoRR, March, 2025

Robust face anti-spoofing with Dual Probabilistic Modeling.

[BibT_eX]

[DOI]

Pattern Recognit., 2025

Actial: Activate Spatial Reasoning Ability of Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

VIKI‑R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Chain-of-Imagination for Reliable Instruction Following in Decision Making.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

RH20T-P: A Primitive-Level Robotic Manipulation Dataset towards Composable Generalization Agents in Real-world Scenarios.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

WorldSimBench: Towards Video Generation Models as World Simulators.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Are We There Yet? Revealing the Risks of Utilizing Large Language Models in Scholarly Peer Review.

[BibT_eX]

[DOI]

CoRR, 2024

OASIS: Open Agent Social Interaction Simulations with One Million Agents.

[BibT_eX]

[DOI]

CoRR, 2024

WorldSimBench: Towards Video Generation Models as World Simulators.

[BibT_eX]

[DOI]

CoRR, 2024

Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation.

[BibT_eX]

[DOI]

CoRR, 2024

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents.

[BibT_eX]

[DOI]

CoRR, 2024

Assessment of Multimodal Large Language Models in Alignment with Human Values.

[BibT_eX]

[DOI]

CoRR, 2024

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control.

[BibT_eX]

[DOI]

CoRR, 2024

Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities.

[BibT_eX]

[DOI]

CoRR, 2024

3D Point Cloud Pre-Training with Knowledge Distilled from 2D Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Depicting Beyond Scores: Advancing Image Quality Assessment Through Multi-modal Language Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Octavius: Mitigating Task Interference in MLLMs via MoE.

[BibT_eX]

[DOI]

CoRR, 2023

Latent Distribution Adjusting for Face Anti-Spoofing.

[BibT_eX]

[DOI]

CoRR, 2023

LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

3D Point Cloud Pre-training with Knowledge Distillation from 2D Images.

[BibT_eX]

[DOI]

CoRR, 2022

Benchmarking Omni-Vision Representation Through the Lens of Visual Realms.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

One to Transfer All: A Universal Transfer Framework for Vision Foundation Model with Few Data.

[BibT_eX]

[DOI]

CoRR, 2021

INTERN: A New Learning Paradigm Towards General Vision.

[BibT_eX]

[DOI]

CoRR, 2021

Few-Shot Domain Expansion for Face Anti-Spoofing.

[BibT_eX]

[DOI]

CoRR, 2021

CelebA-Spoof Challenge 2020 on Face Anti-Spoofing: Methods and Results.

[BibT_eX]

[DOI]

CoRR, 2021

2020

CelebA-Spoof: Large-Scale Face Anti-spoofing Dataset with Rich Annotations.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Zhenfei Yin

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...