Fanhu Zeng

Orcid: 0009-0008-5167-0094

According to our database1, Fanhu Zeng authored at least 29 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Unveiling Fine-Grained Visual Traces: Evaluating Multimodal Interleaved Reasoning Chains in Multimodal STEM Tasks.
CoRR, April, 2026

CL-VISTA: Benchmarking Continual Learning in Video Large Language Models.
CoRR, April, 2026

Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients.
CoRR, March, 2026

Imagination Helps Visual Reasoning, But Not Yet in Latent Space.
CoRR, February, 2026

HyTRec: A Hybrid Temporal-Aware Attention Architecture for Long Behavior Sequential Recommendation.
CoRR, February, 2026

TR-DQ: Time-Rotation Diffusion Quantization.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
CoRR, December, 2025

MCITlib: Multimodal Continual Instruction Tuning Library and Benchmark.
CoRR, August, 2025

A Comprehensive Survey on Continual Learning in Generative Models.
CoRR, June, 2025

Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration.
CoRR, June, 2025

Token Reduction Should Go Beyond Efficiency in Generative Models - From Vision, Language to Multimodality.
CoRR, May, 2025

EventVAD: Training-Free Event-Aware Video Anomaly Detection.
CoRR, April, 2025

Towards Efficient and General-Purpose Few-Shot Misclassification Detection for Vision-Language Models.
CoRR, March, 2025

HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model.
CoRR, March, 2025

Federated Continual Instruction Tuning.
CoRR, March, 2025

TR-DQ: Time-Rotation Diffusion Quantization.
CoRR, March, 2025

Parameter Efficient Merging for Multimodal Large Language Models with Complementary Parameter Adaptation.
CoRR, February, 2025

EventVAD: Training-Free Event-Aware Video Anomaly Detection.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Federated Continual Instruction Tuning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ModalPrompt: Towards Efficient Multimodal Continual Instruction Tuning with Dual-Modality Guided Prompt.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

MambaIC: State Space Models for High-Performance Learned Image Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ChartEdit: How Far Are MLLMs From Automating Chart Analysis? Evaluating MLLMs' Capability via Chart Editing.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
DESIRE: Dynamic Knowledge Consolidation for Rehearsal-Free Continual Learning.
CoRR, 2024

ModalPrompt:Dual-Modality Guided Prompt for Continual Learning of Large Multimodal Models.
CoRR, 2024

Enhancing Outlier Knowledge for Few-Shot Out-of-Distribution Detection with Extensible Local Prompts.
CoRR, 2024

CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models.
CoRR, 2024

2023
PPT: Token Pruning and Pooling for Efficient Vision Transformers.
CoRR, 2023


  Loading...