Fanrui Zhang

Orcid: 0000-0002-1078-430X

According to our database1, Fanrui Zhang authored at least 23 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
MDK12-Bench: A Comprehensive Evaluation of Multimodal Large Language Models on Multidisciplinary Exams.
CoRR, August, 2025

CX-Mind: A Pioneering Multimodal Large Language Model for Interleaved Reasoning in Chest X-ray via Curriculum-Guided Reinforcement Learning.
CoRR, August, 2025

Sekai: A Video Dataset towards World Exploration.
CoRR, June, 2025

A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation.
CoRR, June, 2025

Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning.
CoRR, May, 2025

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models.
CoRR, April, 2025

ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges.
CoRR, March, 2025

ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy.
CoRR, March, 2025

Hierarchical Knowledge Prompt Tuning for Multi-task Test-Time Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Multi-granularity and Multi-modal Prompt Learning for Person Re-Identification.
Proceedings of the Computational Visual Media - 13th International Conference, 2025

2024
Event-Driven Heterogeneous Network for Video Deraining.
Int. J. Comput. Vis., December, 2024

ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization.
CoRR, 2024

Hierarchical Information Enhancement Network for Cascade Prediction in Social Networks.
CoRR, 2024

Multi-perspective Memory Enhanced Network for Identifying Key Nodes in Social Networks.
CoRR, 2024

ESCNet: Entity-enhanced and Stance Checking Network for Multi-modal Fact-Checking.
Proceedings of the ACM on Web Conference 2024, 2024

RAG-Guided Large Language Models for Visual Spatial Description with Adaptive Hallucination Corrector.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Natural Language-centered Inference Network for Multi-modal Fake News Detection.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

2023
Knowledge-Enhanced Hierarchical Information Correlation Learning for Multi-Modal Rumor Detection.
CoRR, 2023

Hierarchical Semantic Enhancement Network for Multimodal Fake News Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

ECENet: Explainable and Context-Enhanced Network for Muti-modal Fact verification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Team Zhang at Factify 2: Unimodal Feature-enhanced and Cross-modal Correlation learning for Multi-Modal Fact Verification.
Proceedings of De-Factify 2: 2nd Workshop on Multimodal Fact Checking and Hate Speech Detection, 2023

NTIRE 2023 Challenge on Stereo Image Super-Resolution: Methods and Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Tungsten Oxide Flow Sensor and its Performance Regulation.
IEEE Trans. Instrum. Meas., 2022


  Loading...