Yuechen Zhang

Orcid: 0009-0000-9112-0216

According to our database1, Yuechen Zhang authored at least 38 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SwiftI2V: Efficient High-Resolution Image-to-Video Generation via Conditional Segment-wise Generation.
CoRR, May, 2026

Mini-Gemini: Mining the Potential of Multi-Modality Vision Language Models.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2026

Order Is Not Layout: Order-to-Space Bias in Image Generation.
CoRR, March, 2026

Utonia: Toward One Encoder for All Point Clouds.
CoRR, March, 2026

RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry.
CoRR, March, 2026

A visual Attention-Based model for VR sickness assessment.
Displays, 2026

2025
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation.
CoRR, December, 2025

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist.
CoRR, November, 2025

DreamOmni2: Multimodal Instruction-based Editing and Generation.
CoRR, October, 2025

DreamVE: Unified Instruction-based Image and Video Editing.
CoRR, August, 2025

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance.
IEEE Trans. Vis. Comput. Graph., February, 2025

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers.
CoRR, January, 2025

MambaGuard: A CLIP-Mamba Approach for OOD Generated Image Detection.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Training-Free Efficient Video Generation via Dynamic Token Carving.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

LYRA: An Efficient and Speech-Centric Framework for Omni-Cognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DreamOmni: Unified Image Generation and Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
ControlNeXt: Powerful and Efficient Control for Image and Video Generation.
CoRR, 2024

ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance.
CoRR, 2024

LDD-YOLO: An Improved Lightweight Detection Method for Steel Surface Defects Based on YOLOv8.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

RTS-DETR: Efficient Real-Time DETR for Small Object Detection.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

RFSD-YOLO: An Enhanced X-Ray Object Detection Model for Prohibited Items.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

HSD-YOLO: A Lightweight and Accurate Method for PCB Defect Detection.
Proceedings of the International Joint Conference on Neural Networks, 2024

Objective Evaluation of VR Sickness and Analysis of Its Relationship with VR Presence.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

Prompt Highlighter: Interactive Control for Multi-Modal LLMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Video-P2P: Video Editing with Cross-Attention Control.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FSD-YOLO: An Improved Method for Steel Surface Defect Detection Based on YOLOv5.
Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024

MF-YOLO: Multimodal Fusion for Remote Sensing Object Detection Based on YOLOv5s.
Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024

Progressively Knowledge Distillation via Re-parameterizing Diffusion Reverse Process.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Real-World Image Variation by Aligning Diffusion Inversion Chain.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

BEW-YOLO: An Improved Method for PCB Defect Detection Based on YOLOv7.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields.
CoRR, 2022

PCL: Proxy-based Contrastive Learning for Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

High Quality Segmentation for Ultra High-resolution Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Flow-aware synthesis: A generic motion model for video frame interpolation.
Comput. Vis. Media, 2021


  Loading...