Xianfang Zeng

Orcid: 0000-0003-1251-2129

According to our database1, Xianfang Zeng authored at least 34 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale.
CoRR, August, 2025

SC-Captioner: Improving Image Captioning with Self-Correction by Reinforcement Learning.
CoRR, August, 2025

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation.
CoRR, June, 2025

Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers.
CoRR, June, 2025

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization.
CoRR, May, 2025

DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds.
CoRR, May, 2025

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models.
CoRR, May, 2025

Step1X-Edit: A Practical Framework for General Image Editing.
CoRR, April, 2025

StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians.
CoRR, April, 2025

OmniSVG: A Unified Scalable Vector Graphics Generation Model.
CoRR, April, 2025

Adding Before Pruning: Sparse Filter Fusion for Deep Convolutional Neural Networks via Auxiliary Attention.
IEEE Trans. Neural Networks Learn. Syst., March, 2025

FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding.
CoRR, March, 2025

Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model.
CoRR, March, 2025

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model.
CoRR, February, 2025

MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent.
CoRR, February, 2025

Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
MikuDance: Animating Character Art with Mixed Motion Dynamics.
CoRR, 2024

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models.
CoRR, 2024

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Paint3D: Paint Anything 3D With Lighting-Less Texture Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Real-Time Audio-Guided Multi-Face Reenactment.
IEEE Signal Process. Lett., 2022

Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
A Learning Framework for n-Bit Quantized Neural Networks Toward FPGAs.
IEEE Trans. Neural Networks Learn. Syst., 2021

Deep Superpixel Convolutional Network for Image Recognition.
IEEE Signal Process. Lett., 2021

Pruning by Training: A Novel Deep Neural Network Compression Framework for Image Processing.
IEEE Signal Process. Lett., 2021

Unpaired salient object translation via spatial attention prior.
Neurocomputing, 2021

SelFSR: Self-Conditioned Face Super-Resolution in the Wild via Flow Field Degradation Network.
CoRR, 2021

2020
APB2FaceV2: Real-Time Audio-Guided Multi-Face Reenactment.
CoRR, 2020

Semantic Graph Based Place Recognition for 3D Point Clouds.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

FReeNet: Multi-Identity Face Reenactment.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
FaceSwapNet: Landmark Guided Many-to-Many Face Reenactment.
CoRR, 2019


  Loading...