Xuansong Xie

Orcid: 0000-0002-3671-799X

According to our database1, Xuansong Xie authored at least 87 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models.
CoRR, 2024

Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception.
CoRR, 2024

DivAvatar: Diverse 3D Avatar Generation with a Single Prompt.
CoRR, 2024

WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope.
CoRR, 2024

En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data.
CoRR, 2024

ChromaFusionNet (CFNet): Natural Fusion of Fine-Grained Color Editing.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Improving Diffusion-Based Image Restoration with Error Contraction and Error Correction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Improving Nighttime Driving-Scene Segmentation via Dual Image-Adaptive Learnable Filters.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

Tracking with Human-Intent Reasoning.
CoRR, 2023

DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors.
CoRR, 2023

DreaMoving: A Human Video Generation Framework based on Diffusion Models.
CoRR, 2023

Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning.
CoRR, 2023

Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models.
CoRR, 2023

FMViT: A multiple-frequency mixing Vision Transformer.
CoRR, 2023

AnyText: Multilingual Visual Text Generation And Editing.
CoRR, 2023

WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models.
CoRR, 2023

Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation.
CoRR, 2023

Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization.
CoRR, 2023

FaceChain: A Playground for Identity-Preserving Portrait Generation.
CoRR, 2023

PGformer: Proxy-Bridged Game Transformer for Multi-Person Extremely Interactive Motion Prediction.
CoRR, 2023

Overcoming Topology Agnosticism: Enhancing Skeleton-Based Action Recognition through Redefined Skeletal Topology Awareness.
CoRR, 2023

Synthesizing Realistic Image Restoration Training Pairs: A Diffusion Approach.
CoRR, 2023

Semi-supervised Deep Multi-view Stereo.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

CostFormer: Cost Transformer for Cost Aggregation in Multi-view Stereo.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

HDFormer: High-order Directed Transformer for 3D Human Pose Estimation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

DamoFD: Digging into Backbone Design on Face Detection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Boosting Novel Category Discovery Over Domains with Soft Contrastive Learning and All in One Classifier.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

RSFNet: A White-Box Image Retouching Approach using Region-Specific Color Filters.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PointDC: Unsupervised Semantic Segmentation of 3D Point Clouds via Cross-modal Distillation and Super-Voxel Clustering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Longshortnet: Exploring Temporal and Semantic Features Fusion In Streaming Perception.
Proceedings of the IEEE International Conference on Acoustics, 2023

Procontext: Exploring Progressive Context Transformer for Tracking.
Proceedings of the IEEE International Conference on Acoustics, 2023

WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

Optimal Proposal Learning for Deployable End-to-End Pedestrian Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023


FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improving Training and Inference of Face Recognition Models via Random Temperature Scaling.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
DCT-net: domain-calibrated translation for portrait stylization.
ACM Trans. Graph., 2022

GMLight: Lighting Estimation via Geometric Distribution Approximation.
IEEE Trans. Image Process., 2022

DDColor: Towards Photo-Realistic and Semantic-Aware Image Colorization via Dual Decoders.
CoRR, 2022

Improving Training and Inference of Face Recognition Models via Random Temperature Scaling.
CoRR, 2022

Hypergraph Transformer for Skeleton-based Action Recognition.
CoRR, 2022

Improving Nighttime Driving-Scene Segmentation via Dual Image-adaptive Learnable Filters.
CoRR, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.
CoRR, 2022

Towards Counterfactual Image Manipulation via CLIP.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Beyond a Video Frame Interpolator: A Space Decoupled Learning Approach to Continuous Image Transition.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Structure-Aware Flow Generation for Human Body Reshaping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unpaired Cartoon Image Synthesis via Gated Cycle Mapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Active Boundary Loss for Semantic Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
DecorIn: An Automatic Method for Plane-Based Decorating.
IEEE Trans. Vis. Comput. Graph., 2021

Noise-Resistant Deep Metric Learning with Probabilistic Instance Filtering.
CoRR, 2021

Attention-guided Temporal Coherent Video Object Matting.
CoRR, 2021

GMLight: Lighting Estimation via Geometric Distribution Approximation.
CoRR, 2021

Active Boundary Loss for Semantic Segmentation.
CoRR, 2021

Attention-guided Temporally Coherent Video Object Matting.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Diverse Image Inpainting with Bidirectional and Autoregressive Transformers.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Enhancing Viewing Experience of Generated Visual Storylines for Promotional Videos.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Sparse Needlets for Lighting Estimation with Spherical Transport Loss.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

WaveFill: A Wavelet-based Generation Network for Image Inpainting.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unbalanced Feature Transport for Exemplar-Based Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

GAN Prior Embedded Network for Blind Face Restoration in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Noise-Resistant Deep Metric Learning With Ranking-Based Instance Selection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

EMLight: Lighting Estimation via Spherical Distribution Approximation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
An AI-empowered Visual Storyline Generator.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Region-Adaptive Texture Enhancement For Detailed Person Image Synthesis.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Boosting Semantic Human Matting With Coarse Annotations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Adversarial Image Composition with Auxiliary Illumination.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Generating Engaging Promotional Videos for E-commerce Platforms (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Domain Specific and Idiom Adaptive Video Summarization.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Personalized Video Summarization with Idiom Adaptation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Discriminative Coronary Artery Tracking via 3D CNN in Cardiac CT Angiography.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Learned Full-Sampling Reconstruction.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Automated Segmentation Of Pulmonary Lobes Using Coordination-Guided Deep Neural Networks.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

Volume R-CNN: Unified Framework for CT Object Detection and Instance Segmentation.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

Attention-Aware Multi-Stroke Style Transfer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Concept Detection based on Multi-label Classification and Image Captioning Approach - DAMO at ImageCLEF 2019.
Proceedings of the Working Notes of CLEF 2019, 2019

Generating Persuasive Visual Storylines for Promotional Videos.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

A Multi-Task Learning Framework for Extracting Bacteria Biotope Information.
Proceedings of The 5th Workshop on BioNLP Open Shared Tasks, 2019


  Loading...