Yuechen Zhang

Orcid: 0009-0000-9112-0216

According to our database¹, Yuechen Zhang authored at least 38 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

SwiftI2V: Efficient High-Resolution Image-to-Video Generation via Conditional Segment-wise Generation.

[BibT_eX]

[DOI]

CoRR, May, 2026

Mini-Gemini: Mining the Potential of Multi-Modality Vision Language Models.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2026

Order Is Not Layout: Order-to-Space Bias in Image Generation.

[BibT_eX]

[DOI]

CoRR, March, 2026

Utonia: Toward One Encoder for All Point Clouds.

[BibT_eX]

[DOI]

CoRR, March, 2026

RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry.

[BibT_eX]

[DOI]

CoRR, March, 2026

A visual Attention-Based model for VR sickness assessment.

[BibT_eX]

[DOI]

Displays, 2026

2025

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation.

[BibT_eX]

[DOI]

CoRR, December, 2025

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist.

[BibT_eX]

[DOI]

CoRR, November, 2025

DreamOmni2: Multimodal Instruction-based Editing and Generation.

[BibT_eX]

[DOI]

CoRR, October, 2025

DreamVE: Unified Instruction-based Image and Video Editing.

[BibT_eX]

[DOI]

CoRR, August, 2025

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., February, 2025

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, January, 2025

MambaGuard: A CLIP-Mamba Approach for OOD Generated Image Detection.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Training-Free Efficient Video Generation via Dynamic Token Carving.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

LYRA: An Efficient and Speech-Centric Framework for Omni-Cognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DreamOmni: Unified Image Generation and Editing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

ControlNeXt: Powerful and Efficient Control for Image and Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

LDD-YOLO: An Improved Lightweight Detection Method for Steel Surface Defects Based on YOLOv8.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

RTS-DETR: Efficient Real-Time DETR for Small Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

RFSD-YOLO: An Enhanced X-Ray Object Detection Model for Prohibited Items.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

HSD-YOLO: A Lightweight and Accurate Method for PCB Defect Detection.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

Objective Evaluation of VR Sickness and Analysis of Its Relationship with VR Presence.

[BibT_eX]

[DOI]

Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

Prompt Highlighter: Interactive Control for Multi-Modal LLMs.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Video-P2P: Video Editing with Cross-Attention Control.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FSD-YOLO: An Improved Method for Steel Surface Defect Detection Based on YOLOv5.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024

MF-YOLO: Multimodal Fusion for Remote Sensing Object Detection Based on YOLOv5s.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024

Progressively Knowledge Distillation via Re-parameterizing Diffusion Reverse Process.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Real-World Image Variation by Aligning Diffusion Inversion Chain.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

BEW-YOLO: An Improved Method for PCB Defect Detection Based on YOLOv7.

[BibT_eX]

[DOI]

Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields.

[BibT_eX]

[DOI]

CoRR, 2022

PCL: Proxy-based Contrastive Learning for Domain Generalization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

High Quality Segmentation for Ultra High-resolution Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Flow-aware synthesis: A generic motion model for video frame interpolation.

[BibT_eX]

[DOI]

Comput. Vis. Media, 2021

Yuechen Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...