Fuwen Luo

According to our database1, Fuwen Luo authored at least 18 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding.
CoRR, May, 2025

Visual Abstract Thinking Empowers Multimodal Reasoning.
CoRR, May, 2025

How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game.
CoRR, March, 2025

DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms.
CoRR, March, 2025

Perspective Transition of Large Language Models for Solving Subjective Tasks.
CoRR, January, 2025

Perspective Transition of Large Language Models for Solving Subjective Tasks.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding.
CoRR, 2024

ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models.
CoRR, 2024

Towards Unified Alignment Between Agents, Humans, and Environment.
CoRR, 2024

Position: Towards Unified Alignment Between Agents, Humans, and Environment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Model Composition for Multimodal Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf.
CoRR, 2023

Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models.
CoRR, 2023

2022
FasterMoE: modeling and optimizing training of large-scale dynamic pre-trained models.
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022


  Loading...