Quanfeng Lu

According to our database1, Quanfeng Lu authored at least 18 papers between 2024 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
TVWorld: Foundations for Remote-Control TV Agents.
CoRR, January, 2026

MM-Eureka: Toward Stable Multimodal Reasoning via Rule-based Reinforcement Learning with Policy Drift Control.
Trans. Mach. Learn. Res., 2026

Exact Solution and Large-Scale Scaling Analysis of the Imaginary Creutz-Stark Ladder.
Entropy, 2026

2025
SWIRL: A Staged Workflow for Interleaved Reinforcement Learning in Mobile GUI Control.
CoRR, August, 2025

UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation.
CoRR, June, 2025

LLM4Ranking: An Easy-to-use Framework of Utilizing Large Language Models for Document Reranking.
CoRR, April, 2025

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning.
CoRR, March, 2025

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GUIOdyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation.
CoRR, 2024

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models.
CoRR, 2024

PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models.
CoRR, 2024

GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices.
CoRR, 2024

ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning.
CoRR, 2024

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ChartAssistant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024


  Loading...