Xiaodong Cun

Orcid: 0000-0003-3607-2236

According to our database1, Xiaodong Cun authored at least 48 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.
CoRR, 2024

Depth-aware Test-Time Training for Zero-shot Video Object Segmentation.
CoRR, 2024

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation.
CoRR, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models.
CoRR, 2024

Towards A Better Metric for Text-to-Video Generation.
CoRR, 2024

Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models.
CoRR, 2023

AnimateZero: Video Diffusion Models are Zero-Shot Image Animators.
CoRR, 2023

MagicStick: Controllable Video Editing via Control Handle Transformations.
CoRR, 2023

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model.
CoRR, 2023

Sketch Video Synthesis.
CoRR, 2023

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation.
CoRR, 2023

EvalCrafter: Benchmarking and Evaluating Large Video Generation Models.
CoRR, 2023

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models.
CoRR, 2023

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation.
CoRR, 2023

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance.
CoRR, 2023

Explicit Visual Prompting for Universal Foreground Segmentations.
CoRR, 2023

TaleCrafter: Interactive Story Visualization with Multiple Characters.
CoRR, 2023

Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos.
CoRR, 2023

T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations.
CoRR, 2023

Interactive Story Visualization with Multiple Characters.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

Inserting Anybody in Diffusion Models via Celeb Basis.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ToonTalker: Cross-Domain Face Reenactment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Shadocnet: Learning Spatial-Aware Tokens in Transformer for Document Shadow Removal.
Proceedings of the IEEE International Conference on Acoustics, 2023

Generating Human Motion from Textual Descriptions with Discrete Representations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

3D GAN Inversion with Facial Symmetry Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DPE: Disentanglement of Pose and Expression for General Video Portrait Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Explicit Visual Prompting for Low-Level Structure Segmentations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Learning Enriched Illuminants for Cross and Single Sensor Color Constancy.
CoRR, 2022

VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild.
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN.
Proceedings of the Computer Vision - ECCV 2022, 2022

Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization.
Proceedings of the Computer Vision - ECCV 2022, 2022

Uformer: A General U-Shaped Transformer for Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization.
CoRR, 2021

Uformer: A General U-Shaped Transformer for Image Restoration.
CoRR, 2021

Split then Refine: Stacked Attention-guided ResUNets for Blind Single Image Visible Watermark Removal.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Improving the Harmony of the Composite Image by Spatial-Separated Attention Module.
IEEE Trans. Image Process., 2020

Defocus Blur Detection via Depth Distillation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Towards Ghost-Free Shadow Removal via Dual Hierarchical Aggregation Network and Shadow Matting GAN.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Depth-Assisted Full Resolution Network for Single Image-Based View Synthesis.
IEEE Computer Graphics and Applications, 2019

2018
Applying stochastic second-order entropy images to multi-modal image registration.
Signal Process. Image Commun., 2018

Image Splicing Localization via Semi-global Network and Fully Connected Conditional Random Fields.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018


  Loading...