Cong Wei

Orcid: 0009-0006-0835-784X

Affiliations:
  • University of Waterloo, Canada


According to our database1, Cong Wei authored at least 15 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
MoCha: Towards Movie-Grade Talking Character Synthesis.
CoRR, March, 2025

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers.
CoRR, March, 2025

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation.
Trans. Mach. Learn. Res., 2024

AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks.
Trans. Mach. Learn. Res., 2024

Mantis: Interleaved Multi-Image Instruction Tuning.
Trans. Mach. Learn. Res., 2024

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision.
CoRR, 2024

AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks.
CoRR, 2024

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation.
CoRR, 2024

UniIR: Training and Benchmarking Universal Multimodal Information Retrievers.
Proceedings of the Computer Vision - ECCV 2024, 2024

MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
DreamEdit: Subject-driven Image Editing.
Trans. Mach. Learn. Res., 2023

Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023


  Loading...