Xuefeng Xiao

Orcid: 0009-0009-8258-1243

Affiliations:
  • ByteDance Inc., Beijing, China
  • South China University of Technology, Guangzhou, China (former)


According to our database1, Xuefeng Xiao authored at least 59 papers between 2017 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Adversarial Distribution Matching for Diffusion Distillation Towards Efficient Image and Video Synthesis.
CoRR, July, 2025

VmambaIR: Visual State Space Model for Image Restoration.
IEEE Trans. Circuits Syst. Video Technol., June, 2025

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models.
CoRR, June, 2025

Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation.
CoRR, June, 2025

Seedance 1.0: Exploring the Boundaries of Video Generation Models.
CoRR, June, 2025

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training.
CoRR, June, 2025

SeedEdit 3.0: Fast and High-Quality Generative Image Editing.
CoRR, June, 2025

Seedream 3.0 Technical Report.
CoRR, April, 2025

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model.
CoRR, April, 2025

OrchMLLM: Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training.
CoRR, March, 2025

Training-free Diffusion Acceleration with Bottleneck Sampling.
CoRR, March, 2025

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model.
CoRR, March, 2025

Training-free and Adaptive Sparse Attention for Efficient Long Video Generation.
CoRR, February, 2025

Diffusion Adversarial Post-Training for One-Step Video Generation.
CoRR, January, 2025

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RayFlow: Instance-Aware Diffusion Acceleration via Adaptive Flow Trajectories.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

FlexSP: Accelerating Large Language Model Training via Flexible Sequence Parallelism.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization.
CoRR, 2024

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM.
CoRR, 2024

Data-Centric and Heterogeneity-Adaptive Sequence Parallelism for Efficient LLM Training.
CoRR, 2024

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning.
CoRR, 2024

UniFL: Improve Stable Diffusion via Unified Feedback Learning.
CoRR, 2024

ByteEdit: Boost, Comply and Accelerate Generative Image Editing.
CoRR, 2024

ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models.
CoRR, 2024

DiffusionGPT: LLM-Driven Text-to-Image Generation System.
CoRR, 2024

UniFL: Improve Latent Diffusion Model via Unified Feedback Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

TreeReward: Improve Diffusion Model via Tree-Structured Feedback Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Outlier-aware Slicing for Post-Training Quantization in Vision Transformer.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

AffineQuant: Affine Transformation Quantization for Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ByteEdit: Boost, Comply and Accelerate Generative Image Editing.
Proceedings of the Computer Vision - ECCV 2024, 2024

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection.
CoRR, 2023

DLIP: Distilling Language-Image Pre-training.
CoRR, 2023

Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models.
CoRR, 2023

UGC: Unified GAN Compression for Efficient Image-to-Image Translation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

AlignDet: Aligning Pre-training and Fine-tuning in Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Multi-Objective Evolutionary for Object Detection Mobile Architectures Search.
CoRR, 2022

Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios.
CoRR, 2022

Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation.
CoRR, 2022

TRT-ViT: TensorRT-oriented Vision Transformer.
CoRR, 2022

SepViT: Separable Vision Transformer.
CoRR, 2022

Progressive Automatic Design of Search Space for One-Shot Neural Architecture Search.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

ScalableViT: Rethinking the Context-Oriented Generalization of Vision Transformer.
Proceedings of the Computer Vision, 2022

Multi-granularity Distillation Scheme Towards Lightweight Semi-supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Fast and Accurate Quantized Camera Scene Detection on Smartphones, Mobile AI 2021 Challenge: Report.
CoRR, 2021

Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Online Multi-Granularity Distillation for GAN Compression.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Box-Level Tube Tracking and Refinement for Vehicles Anomaly Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2019
An Empirical Study of Propagation-based Methods for Video Object Segmentation.
CoRR, 2019

2018
Accelerating and Compressing LSTM Based Model for Online Handwritten Chinese Character Recognition.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

2017
Building fast and compact convolutional neural networks for offline handwritten Chinese character recognition.
Pattern Recognit., 2017

Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

A Comprehensive Analysis of Misclassified Handwritten Chinese Character Samples by Incorporating Human Recognition.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017


  Loading...