Xin Tao

Orcid: 0000-0001-9126-4746

Affiliations:
  • Kuaishou Technology, Beijing, China


According to our database1, Xin Tao authored at least 60 papers between 2014 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Score Augmentation for Diffusion Models.
CoRR, August, 2025

DVIS++: Improved Decoupled Framework for Universal Video Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

Imbalance in Balance: Online Concept Balancing in Generation Models.
CoRR, July, 2025

VMoBA: Mixture-of-Block Attention for Video Diffusion Models.
CoRR, June, 2025

Training-Free Efficient Video Generation via Dynamic Token Carving.
CoRR, May, 2025

VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption.
CoRR, May, 2025

BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation.
CoRR, April, 2025

Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings.
CoRR, March, 2025

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers.
CoRR, March, 2025

MTV-Inpaint: Multi-Task Long Video Inpainting.
CoRR, March, 2025

RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification.
CoRR, March, 2025

Stable Segment Anything Model.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Towards Precise Scaling Laws for Video Diffusion Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Owl-1: Omni World Model for Consistent Long Video Generation.
CoRR, 2024

SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs.
CoRR, 2024

VideoTetris: Towards Compositional Text-to-Video Generation.
CoRR, 2024

SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance.
CoRR, 2024

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge.
CoRR, 2024

UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark.
CoRR, 2024

Motion Inversion for Video Customization.
CoRR, 2024

VideoTetris: Towards Compositional Text-to-Video Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Perception-Oriented Video Frame Interpolation via Asymmetric Blending.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Stable Segment Anything Model.
CoRR, 2023

1st Place Solution for the 5th LSVOS Challenge: Video Instance Segmentation.
CoRR, 2023

1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation.
CoRR, 2023

Feature Decoupling-Recycling Network for Fast Interactive Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Scene-Generalizable Interactive Segmentation of Radiance Fields.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Compression-Aware Video Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Text-Guided Human Image Manipulation via Image-Text Shared Space.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

H-VFI: Hierarchical Frame Interpolation for Videos with Large Motions.
CoRR, 2022

DeViT: Deformed Vision Transformers in Video Inpainting.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022


Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Image Multi-Inpainting via Progressive Generative Adversarial Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Particularity Beyond Commonality: Unpaired Identity Transfer with Multiple References.
Proceedings of the Computer Vision - ECCV 2020, 2020

VCNet: A Robust Approach to Blind Image Inpainting.
Proceedings of the Computer Vision - ECCV 2020, 2020

MuCAN: Multi-correspondence Aggregation Network for Video Super-Resolution.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Landmark Assisted CycleGAN for Cartoon Face Generation.
CoRR, 2019


AIM 2019 Challenge on Video Extreme Super-Resolution: Methods and Results.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Attribute-Driven Spontaneous Motion in Unpaired Image Translation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Wide-Context Semantic Image Extrapolation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dynamic Scene Deblurring With Parameter Selective Sharing and Nested Skip Connections.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Scale-recurrent Network for Deep Image Deblurring.
CoRR, 2018

Image Inpainting via Generative Multi-column Convolutional Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Scale-Recurrent Network for Deep Image Deblurring.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Facelet-Bank for Fast Portrait Manipulation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Convolutional Neural Pyramid for Image Processing.
CoRR, 2017

Zero-Order Reverse Filtering.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Detail-Revealing Deep Video Super-Resolution.
Proceedings of the IEEE International Conference on Computer Vision, 2017

High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Regional foremost matching for internet scene images.
ACM Trans. Graph., 2016

Deep Automatic Portrait Matting.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
Break Ames room illusion: depth from general single images.
ACM Trans. Graph., 2015

Video Super-Resolution via Deep Draft-Ensemble Learning.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Handling motion blur in multi-frame super-resolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Inverse Kernels for Fast Spatial Deconvolution.
Proceedings of the Computer Vision - ECCV 2014, 2014


  Loading...