Yiwu Zhong

According to our database¹, Yiwu Zhong authored at least 26 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, April, 2026

DOSE: Data Selection for Multi-Modal LLMs via Off-the-Shelf Models.

[BibT_eX]

[DOI]

CoRR, April, 2026

TextShield-R1: Reinforced Reasoning for Tampered Text Detection.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Rethinking Chain-of-Thought Reasoning for Videos.

[BibT_eX]

[DOI]

CoRR, December, 2025

Webly-Supervised Image Manipulation Localization via Category-Aware Auto-Annotation.

[BibT_eX]

[DOI]

CoRR, August, 2025

AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Fine-Grained Spatiotemporal Grounding on Egocentric Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

PAVE: Patching and Adapting Video Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Revisiting Tampered Scene Text Detection in the Era of Generative AI.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Omni-IML: Towards Unified Image Manipulation Localization.

[BibT_eX]

[DOI]

CoRR, 2024

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models.

[BibT_eX]

[DOI]

CoRR, 2024

Generalized Tampered Scene Text Detection in the era of Generative AI.

[BibT_eX]

[DOI]

CoRR, 2024

Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models.

[BibT_eX]

[DOI]

CoRR, 2024

Beyond Embeddings: The Promise of Visual Table in Visual Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Enhancing Temporal Modeling of Video LLMs via Time Gating.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Towards Learning a Generalist Model for Embodied Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation.

[BibT_eX]

[DOI]

CoRR, 2023

Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Concise and Descriptive Attributes for Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

RegionCLIP: Region-based Language-Image Pretraining.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Grounded Language-Image Pre-training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Learning to Generate Scene Graph from Natural Language Supervision.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Simple Baseline for Weakly-Supervised Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Comprehensive Image Captioning via Scene Graph Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Yiwu Zhong

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...