Zhucun Xue

Orcid: 0009-0005-2627-4397

According to our database1, Zhucun Xue authored at least 45 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark.
Int. J. Comput. Vis., May, 2026

PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset.
CoRR, May, 2026

Advancing Narrative Long Video Generation via Training-Free Identity-Aware Memory.
CoRR, May, 2026

SPIKE: An Adaptive Dual Controller Framework for Cost-Efficient Long-Horizon Game Agents.
CoRR, May, 2026

Evolution of Optimization Methods: Algorithms, Scenarios, and Evaluations.
CoRR, April, 2026

Evolution of Video Generative Foundations.
CoRR, April, 2026

UniICL: Systematizing Unified Multimodal In-context Learning through a Capability-Oriented Taxonomy.
CoRR, March, 2026

Large-Scale Multidimensional Knowledge Profiling of Scientific Literature.
CoRR, January, 2026

M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding.
CoRR, January, 2026

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

LLM-Oriented Token-Adaptive Knowledge Distillation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
UltraLBM-UNet: Ultralight Bidirectional Mamba-based Model for Skin Lesion Segmentation.
CoRR, December, 2025

Transform Trained Transformer: Accelerating Naive 4K Video Generation Over 10⨉.
CoRR, December, 2025

OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing.
CoRR, December, 2025

ImitDiff: Transferring Foundation-Model Priors for Distraction-Robust Visuomotor Policy.
IEEE Robotics Autom. Lett., November, 2025

EMOv2: Pushing 5M Vision Model Frontier.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

InstanceV: Instance-Level Video Generation.
CoRR, November, 2025

IVEBench: Modern Benchmark Suite for Instruction-Guided Video Editing Assessment.
CoRR, October, 2025

LLM-Oriented Token-Adaptive Knowledge Distillation.
CoRR, October, 2025

Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow.
CoRR, September, 2025

Semantic Frame Interpolation.
CoRR, July, 2025

HV-MMBench: Benchmarking MLLMs for Human-Centric Video Understanding.
CoRR, July, 2025

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions.
CoRR, June, 2025

AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding.
CoRR, June, 2025

Image Inversion: A Survey from GANs to Diffusion and Beyond.
CoRR, February, 2025

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Multi-Modal Retrieval Augmented Visual Understanding and Generation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

A Comprehensive Library for Benchmarking Multi-Class Visual Anomaly Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
EMOv2: Pushing 5M Vision Model Frontier.
CoRR, 2024

MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning.
CoRR, 2024

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection.
CoRR, 2024

Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark.
CoRR, 2024

2023
Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection.
CoRR, 2023

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation.
CoRR, 2023

Rethinking Mobile Block for Efficient Neural Models.
CoRR, 2023

Rethinking Mobile Block for Efficient Attention-based Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2020
Fisheye Distortion Rectification from Deep Straight Lines.
CoRR, 2020

Texture Mixing by Interpolating Deep Statistics via Gaussian Models.
IEEE Access, 2020

APB2FACE: Audio-Guided Face Reenactment with Auxiliary Pose and Blink Signals.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Learning to Calibrate Straight Lines for Fisheye Image Rectification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
A survey on vision-based UAV navigation.
Geo spatial Inf. Sci., 2018


  Loading...