Size Wu

According to our database1, Size Wu authored at least 17 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Knowledge Visualization: A Benchmark and Method for Knowledge-Intensive Text-to-Image Generation.
CoRR, April, 2026

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing.
CoRR, February, 2026

Skywork UniPic 3.0: Unified Multi-Image Composition via Sequence Modeling.
CoRR, January, 2026

2025
RecTok: Reconstruction Distillation along Rectified Flow.
CoRR, December, 2025

Generative Photographic Control for Scene-Consistent Video Cinematic Editing.
CoRR, November, 2025

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation.
CoRR, October, 2025

Controllable Human-centric Keyframe Interpolation with Generative Prior.
CoRR, June, 2025

DST-Det: Open-Vocabulary Object Detection via Dynamic Self-Training.
IEEE Trans. Circuits Syst. Video Technol., May, 2025

OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation.
CoRR, May, 2025

Harmonizing Visual Representations for Unified Multimodal Understanding and Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

F-LMM: Grounding Frozen Large Multimodal Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

OMG-Seg: Is One Model Good Enough for all Segmentation?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CLIM: Contrastive Language-Image Mosaic for Region Representation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection.
CoRR, 2023

Aligning Bag of Regions for Open-Vocabulary Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2021
Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021


  Loading...