Xuansong Xie

Orcid: 0000-0002-3671-799X

According to our database¹, Xuansong Xie authored at least 105 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

DivAvatar: Diverse 3D Avatar Generation with a Single Prompt.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis.

[BibT_eX]

[DOI]

Alexander G. Hauptmann

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Domain Adaptation Transformer for Unsupervised Driving-Scene Segmentation in Adverse Conditions.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., December, 2024

RobustMVS: Single Domain Generalized Deep Multi-View Stereo.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., October, 2024

MorphNeRF: Text-Guided 3D-Aware Editing via Morphing Generative Neural Radiance Fields.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing.

[BibT_eX]

[DOI]

CoRR, 2024

Strictly-ID-Preserved and Controllable Accessory Advertising Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

DivAvatar: Diverse 3D Avatar Generation with a Single Prompt.

[BibT_eX]

[DOI]

CoRR, 2024

WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope.

[BibT_eX]

[DOI]

CoRR, 2024

MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

AnyText: Multilingual Visual Text Generation and Editing.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

DreamView: Injecting View-Specific Text Guidance Into Text-to-3D Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

ShoeModel: Learning to Wear on the User-Specified Shoes via Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

FMViT: A Multiple-Frequency Mixing Vision Transformer.

[BibT_eX]

[DOI]

Wei Tan

Yifeng Geng

Xuansong Xie

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

3DToonify: Creating Your High-Fidelity 3D Stylized Avatar Easily from 2D Portrait Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaptation by Combining 3D GANs and Diffusion Priors.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Multi-Modal Instruction Tuned LLMs with Fine-Grained Visual Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ChromaFusionNet (CFNet): Natural Fusion of Fine-Grained Color Editing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Improving Diffusion-Based Image Restoration with Error Contraction and Error Correction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Improving Nighttime Driving-Scene Segmentation via Dual Image-Adaptive Learnable Filters.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., October, 2023

Tracking with Human-Intent Reasoning.

[BibT_eX]

[DOI]

CoRR, 2023

DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors.

[BibT_eX]

[DOI]

CoRR, 2023

DreaMoving: A Human Video Generation Framework based on Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization.

[BibT_eX]

[DOI]

CoRR, 2023

FaceChain: A Playground for Identity-Preserving Portrait Generation.

[BibT_eX]

[DOI]

CoRR, 2023

PGformer: Proxy-Bridged Game Transformer for Multi-Person Extremely Interactive Motion Prediction.

[BibT_eX]

[DOI]

CoRR, 2023

Overcoming Topology Agnosticism: Enhancing Skeleton-Based Action Recognition through Redefined Skeletal Topology Awareness.

[BibT_eX]

[DOI]

CoRR, 2023

Synthesizing Realistic Image Restoration Training Pairs: A Diffusion Approach.

[BibT_eX]

[DOI]

CoRR, 2023

Semi-supervised Deep Multi-view Stereo.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

CostFormer: Cost Transformer for Cost Aggregation in Multi-view Stereo.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

HDFormer: High-order Directed Transformer for 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

DamoFD: Digging into Backbone Design on Face Detection.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Boosting Novel Category Discovery Over Domains with Soft Contrastive Learning and All in One Classifier.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

RSFNet: A White-Box Image Retouching Approach using Region-Specific Color Filters.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PointDC: Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-modal Distillation and Super-Voxel Clustering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Longshortnet: Exploring Temporal and Semantic Features Fusion In Streaming Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Procontext: Exploring Progressive Context Transformer for Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

Optimal Proposal Learning for Deployable End-to-End Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NTIRE 2023 Video Colorization Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improving Training and Inference of Face Recognition Models via Random Temperature Scaling.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

DCT-net: domain-calibrated translation for portrait stylization.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2022

GMLight: Lighting Estimation via Geometric Distribution Approximation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

DDColor: Towards Photo-Realistic and Semantic-Aware Image Colorization via Dual Decoders.

[BibT_eX]

[DOI]

CoRR, 2022

Improving Training and Inference of Face Recognition Models via Random Temperature Scaling.

[BibT_eX]

[DOI]

CoRR, 2022

Hypergraph Transformer for Skeleton-based Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable Filters.

[BibT_eX]

[DOI]

CoRR, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

et al.

CoRR, 2022

Towards Counterfactual Image Manipulation via CLIP.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Beyond a Video Frame Interpolator: A Space Decoupled Learning Approach to Continuous Image Transition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Structure-Aware Flow Generation for Human Body Reshaping.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unpaired Cartoon Image Synthesis via Gated Cycle Mapping.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Active Boundary Loss for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

DecorIn: An Automatic Method for Plane-Based Decorating.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2021

Noise-Resistant Deep Metric Learning with Probabilistic Instance Filtering.

[BibT_eX]

[DOI]

CoRR, 2021

Attention-guided Temporal Coherent Video Object Matting.

[BibT_eX]

[DOI]

CoRR, 2021

Noise-resistant Deep Metric Learning with Ranking-based Instance Selection.

[BibT_eX]

[DOI]

CoRR, 2021

GMLight: Lighting Estimation via Geometric Distribution Approximation.

[BibT_eX]

[DOI]

CoRR, 2021

Active Boundary Loss for Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2021

Attention-guided Temporally Coherent Video Object Matting.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Diverse Image Inpainting with Bidirectional and Autoregressive Transformers.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Enhancing Viewing Experience of Generated Visual Storylines for Promotional Videos.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Sparse Needlets for Lighting Estimation with Spherical Transport Loss.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

WaveFill: A Wavelet-based Generation Network for Image Inpainting.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unbalanced Feature Transport for Exemplar-Based Image Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

GAN Prior Embedded Network for Blind Face Restoration in the Wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Noise-Resistant Deep Metric Learning With Ranking-Based Instance Selection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

EMLight: Lighting Estimation via Spherical Distribution Approximation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

An AI-empowered Visual Storyline Generator.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Region-Adaptive Texture Enhancement For Detailed Person Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Boosting Semantic Human Matting With Coarse Annotations.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Adversarial Image Composition with Auxiliary Illumination.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Generating Engaging Promotional Videos for E-commerce Platforms (Student Abstract).

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Generating Persuasive Visual Storylines for Promotional Videos.

[BibT_eX]

[DOI]

CoRR, 2019

Domain Specific and Idiom Adaptive Video Summarization.

[BibT_eX]

[DOI]

Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Personalized Video Summarization with Idiom Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Discriminative Coronary Artery Tracking via 3D CNN in Cardiac CT Angiography.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Learned Full-Sampling Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Automated Segmentation Of Pulmonary Lobes Using Coordination-Guided Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

Volume R-CNN: Unified Framework for CT Object Detection and Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

Attention-Aware Multi-Stroke Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Concept Detection based on Multi-label Classification and Image Captioning Approach - DAMO at ImageCLEF 2019.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2019, 2019

Generating Persuasive Visual Storylines for Promotional Videos.

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

A Multi-Task Learning Framework for Extracting Bacteria Biotope Information.

[BibT_eX]

[DOI]

Proceedings of The 5th Workshop on BioNLP Open Shared Tasks, 2019

Xuansong Xie

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...