Xin Tao

Orcid: 0000-0001-9126-4746

Affiliations:

Kuaishou Technology, Beijing, China

According to our database¹, Xin Tao authored at least 65 papers between 2014 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Terra: Explorable Native 3D World Model with Point Latents.

[BibT_eX]

[DOI]

CoRR, October, 2025

Less is More: Improving LLM Reasoning with Minimal Test-Time Intervention.

[BibT_eX]

[DOI]

CoRR, October, 2025

Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance.

[BibT_eX]

[DOI]

CoRR, October, 2025

Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs.

[BibT_eX]

[DOI]

CoRR, September, 2025

Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

[BibT_eX]

[DOI]

CoRR, September, 2025

Score Augmentation for Diffusion Models.

[BibT_eX]

[DOI]

CoRR, August, 2025

DVIS++: Improved Decoupled Framework for Universal Video Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

Imbalance in Balance: Online Concept Balancing in Generation Models.

[BibT_eX]

[DOI]

CoRR, July, 2025

VMoBA: Mixture-of-Block Attention for Video Diffusion Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

Training-Free Efficient Video Generation via Dynamic Token Carving.

[BibT_eX]

[DOI]

CoRR, May, 2025

VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption.

[BibT_eX]

[DOI]

CoRR, May, 2025

BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation.

[BibT_eX]

[DOI]

CoRR, April, 2025

Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings.

[BibT_eX]

[DOI]

CoRR, March, 2025

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, March, 2025

MTV-Inpaint: Multi-Task Long Video Inpainting.

[BibT_eX]

[DOI]

CoRR, March, 2025

RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification.

[BibT_eX]

[DOI]

CoRR, March, 2025

Stable Segment Anything Model.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Towards Precise Scaling Laws for Video Diffusion Transformers.

[BibT_eX]

[DOI]

Victor Shea-Jay Huang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Owl-1: Omni World Model for Consistent Long Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs.

[BibT_eX]

[DOI]

CoRR, 2024

VideoTetris: Towards Compositional Text-to-Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge.

[BibT_eX]

[DOI]

S. Farhad Hosseini-Benvidi

Fengbin Guan

Ahmad Mahmoudi-Aznaveh

CoRR, 2024

UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

Motion Inversion for Video Customization.

[BibT_eX]

[DOI]

CoRR, 2024

VideoTetris: Towards Compositional Text-to-Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Perception-Oriented Video Frame Interpolation via Asymmetric Blending.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge.

[BibT_eX]

[DOI]

S. Farhad Hosseini-Benvidi

Ahmad Mahmoudi-Aznaveh

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Stable Segment Anything Model.

[BibT_eX]

[DOI]

CoRR, 2023

1st Place Solution for the 5th LSVOS Challenge: Video Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Feature Decoupling-Recycling Network for Fast Interactive Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Scene-Generalizable Interactive Segmentation of Radiance Fields.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Compression-Aware Video Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Text-Guided Human Image Manipulation via Image-Text Shared Space.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

H-VFI: Hierarchical Frame Interpolation for Videos with Large Motions.

[BibT_eX]

[DOI]

CoRR, 2022

DeViT: Deformed Vision Transformers in Video Inpainting.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

NTIRE 2022 Image Inpainting Challenge: Report.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Image Multi-Inpainting via Progressive Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Particularity Beyond Commonality: Unpaired Identity Transfer with Multiple References.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

VCNet: A Robust Approach to Blind Image Inpainting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

MuCAN: Multi-correspondence Aggregation Network for Video Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

Landmark Assisted CycleGAN for Cartoon Face Generation.

[BibT_eX]

[DOI]

CoRR, 2019

AIM 2019 Challenge on Image Extreme Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

Pablo Navarrete Michelini

Wenbin Chen

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

AIM 2019 Challenge on Video Extreme Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Attribute-Driven Spontaneous Motion in Unpaired Image Translation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Wide-Context Semantic Image Extrapolation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dynamic Scene Deblurring With Parameter Selective Sharing and Nested Skip Connections.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Scale-recurrent Network for Deep Image Deblurring.

[BibT_eX]

[DOI]

CoRR, 2018

Image Inpainting via Generative Multi-column Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Scale-Recurrent Network for Deep Image Deblurring.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Facelet-Bank for Fast Portrait Manipulation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Convolutional Neural Pyramid for Image Processing.

[BibT_eX]

[DOI]

CoRR, 2017

Zero-Order Reverse Filtering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Detail-Revealing Deep Video Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

2016

Regional foremost matching for internet scene images.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2016

Deep Automatic Portrait Matting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

2015

Break Ames room illusion: depth from general single images.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2015

Video Super-Resolution via Deep Draft-Ensemble Learning.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Handling motion blur in multi-frame super-resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Inverse Kernels for Fast Spatial Deconvolution.

[BibT_eX]

[DOI]

Li Xu

Xin Tao

Jiaya Jia

Proceedings of the Computer Vision - ECCV 2014, 2014

Xin Tao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...