Yu Liu

Orcid: 0000-0001-8071-3745

Affiliations:
  • Alibaba Group, Machine Intelligence Technology Lab


According to our database1, Yu Liu authored at least 51 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
AnyDoor: Zero-Shot Image Customization With Region-to-Region Reference.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models.
CoRR, June, 2025

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning.
CoRR, June, 2025

Instability in Diffusion ODEs: An Explanation for Inaccurate Image Reconstruction.
CoRR, June, 2025

AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model.
CoRR, June, 2025

ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing.
CoRR, March, 2025

VACE: All-in-One Video Creation and Editing.
CoRR, March, 2025

DiffDoctor: Diagnosing Image Diffusion Models Before Treating.
CoRR, January, 2025

VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization.
CoRR, January, 2025

ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling.
CoRR, January, 2025

ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Improved Video VAE for Latent Video Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MangaNinja: Line Art Colorization with Precise Reference Following.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
ChatDiT: A Training-Free Baseline for Task-Agnostic Free-Form Chatting with Diffusion Transformers.
CoRR, 2024

IDEA-Bench: How Far are Generative Models from Professional Designing?
CoRR, 2024

The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control.
CoRR, 2024

In-Context LoRA for Diffusion Transformers.
CoRR, 2024

Group Diffusion Transformers are Unsupervised Multitask Learners.
CoRR, 2024

ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer.
CoRR, 2024

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations.
CoRR, 2024

InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior.
CoRR, 2024

Zero-shot Image Editing with Reference Imitation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Lipschitz Singularities in Diffusion Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

DreamClean: Restoring Clean Image Using Deep Diffusion Prior.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Exploring Guided Sampling of Conditional GANs.
Proceedings of the Computer Vision - ECCV 2024, 2024

LivePhoto: Real Image Animation with Text-Guided Motion Control.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Check, Locate, Rectify: A Training-Free Layout Calibration System for Text- to- Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AnyDoor: Zero-shot Object-level Image Customization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Dream Video: Composing Your Dream Videos with Customized Subject and Motion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
VideoLCM: Video Latent Consistency Model.
CoRR, 2023

CCM: Adding Conditional Controls to Text-to-Image Consistency Models.
CoRR, 2023

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion.
CoRR, 2023

Eliminating Lipschitz Singularities in Diffusion Models.
CoRR, 2023

Cones 2: Customizable Image Synthesis with Multiple Subjects.
CoRR, 2023

Customizable Image Synthesis with Multiple Subjects.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cones: Concept Neurons in Diffusion Models for Customized Generation.
Proceedings of the International Conference on Machine Learning, 2023

Composer: Creative and Controllable Image Synthesis with Composable Conditions.
Proceedings of the International Conference on Machine Learning, 2023

Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dimensionality-Varying Diffusion Process.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Animating Images to Transfer CLIP for Video-Text Retrieval.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints.
Proceedings of the Computer Vision - ECCV 2022, 2022

DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion Models.
Proceedings of the 6th International Conference on Computer Science and Artificial Intelligence, 2022

A Trend-Driven Fashion Design System for Rapid Response Marketing in E-commerce.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Once and for All: Self-supervised Multi-modal Co-training on One-billion Videos at Alibaba.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Communication Efficient SGD via Gradient Sampling With Bayes Prior.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Self-Supervised Video Representation Learning by Context and Motion Decoupling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021


  Loading...