Chenhui Gou

According to our database1, Chenhui Gou authored at least 12 papers between 2022 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Emerging Properties in Unified Multimodal Pretraining.
CoRR, May, 2025

Seed1.5-VL Technical Report.
CoRR, May, 2025

Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model.
CoRR, March, 2025

DrVideo: Document Retrieval Based Long Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Evaluating and Advancing Multimodal Large Language Models in Ability Lens.
CoRR, 2024

EZIGen: Enhancing zero-shot subject-driven image generation with precise subject encoding and decoupled guidance.
CoRR, 2024

How Well Can Vision Language Models See Image Details?
CoRR, 2024

InfiniBench: A Comprehensive Benchmark for Large Multimodal Models in Very Long Video Understanding.
CoRR, 2024

Strong and Controllable Blind Image Decomposition.
CoRR, 2024

JRDB-PanoTrack: An Open-World Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2022
RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


  Loading...