Zhucun Xue

Orcid: 0009-0005-2627-4397

According to our database¹, Zhucun Xue authored at least 45 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., May, 2026

PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset.

[BibT_eX]

[DOI]

CoRR, May, 2026

Advancing Narrative Long Video Generation via Training-Free Identity-Aware Memory.

[BibT_eX]

[DOI]

CoRR, May, 2026

SPIKE: An Adaptive Dual Controller Framework for Cost-Efficient Long-Horizon Game Agents.

[BibT_eX]

[DOI]

CoRR, May, 2026

Evolution of Optimization Methods: Algorithms, Scenarios, and Evaluations.

[BibT_eX]

[DOI]

CoRR, April, 2026

Evolution of Video Generative Foundations.

[BibT_eX]

[DOI]

CoRR, April, 2026

UniICL: Systematizing Unified Multimodal In-context Learning through a Capability-Oriented Taxonomy.

[BibT_eX]

[DOI]

CoRR, March, 2026

Large-Scale Multidimensional Knowledge Profiling of Scientific Literature.

[BibT_eX]

[DOI]

CoRR, January, 2026

M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding.

[BibT_eX]

[DOI]

CoRR, January, 2026

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

LLM-Oriented Token-Adaptive Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

UltraLBM-UNet: Ultralight Bidirectional Mamba-based Model for Skin Lesion Segmentation.

[BibT_eX]

[DOI]

CoRR, December, 2025

Transform Trained Transformer: Accelerating Naive 4K Video Generation Over 10⨉.

[BibT_eX]

[DOI]

CoRR, December, 2025

OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing.

[BibT_eX]

[DOI]

CoRR, December, 2025

ImitDiff: Transferring Foundation-Model Priors for Distraction-Robust Visuomotor Policy.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., November, 2025

EMOv2: Pushing 5M Vision Model Frontier.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

InstanceV: Instance-Level Video Generation.

[BibT_eX]

[DOI]

CoRR, November, 2025

IVEBench: Modern Benchmark Suite for Instruction-Guided Video Editing Assessment.

[BibT_eX]

[DOI]

CoRR, October, 2025

LLM-Oriented Token-Adaptive Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, October, 2025

Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow.

[BibT_eX]

[DOI]

CoRR, September, 2025

Semantic Frame Interpolation.

[BibT_eX]

[DOI]

CoRR, July, 2025

HV-MMBench: Benchmarking MLLMs for Human-Centric Video Understanding.

[BibT_eX]

[DOI]

CoRR, July, 2025

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions.

[BibT_eX]

[DOI]

CoRR, June, 2025

AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding.

[BibT_eX]

[DOI]

CoRR, June, 2025

Image Inversion: A Survey from GANs to Diffusion and Beyond.

[BibT_eX]

[DOI]

CoRR, February, 2025

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Multi-Modal Retrieval Augmented Visual Understanding and Generation.

[BibT_eX]

[DOI]

Zhucun Xue

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

A Comprehensive Library for Benchmarking Multi-Class Visual Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

EMOv2: Pushing 5M Vision Model Frontier.

[BibT_eX]

[DOI]

CoRR, 2024

MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning.

[BibT_eX]

[DOI]

CoRR, 2024

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, 2023

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Rethinking Mobile Block for Efficient Neural Models.

[BibT_eX]

[DOI]

CoRR, 2023

Rethinking Mobile Block for Efficient Attention-based Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2020

Fisheye Distortion Rectification from Deep Straight Lines.

[BibT_eX]

[DOI]

Zhucun Xue

Nan Xue

Gui-Song Xia

CoRR, 2020

Texture Mixing by Interpolating Deep Statistics via Gaussian Models.

[BibT_eX]

[DOI]

Zhucun Xue

Ziming Wang

IEEE Access, 2020

APB2FACE: Audio-Guided Face Reenactment with Auxiliary Pose and Blink Signals.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Learning to Calibrate Straight Lines for Fisheye Image Rectification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

A survey on vision-based UAV navigation.

[BibT_eX]

[DOI]

Geo spatial Inf. Sci., 2018

Zhucun Xue

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...