Zijian Zhou

Orcid: 0000-0003-3315-3962

Affiliations:
  • Meta AI
  • King's College London, UK


According to our database1, Zijian Zhou authored at least 25 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Neural Computers.
CoRR, April, 2026

TransText: Alpha-as-RGB Representation for Transparent Text Animation.
CoRR, March, 2026

VecGlypher: Unified Vector Glyph Generation with Language Models.
CoRR, February, 2026

DC-RRG: Diagnosis-centered cascaded radiology report generation.
Expert Syst. Appl., 2026

2025
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming.
CoRR, December, 2025

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory.
CoRR, December, 2025

Scaling Zero-Shot Reference-to-Video Generation.
CoRR, December, 2025

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models.
CoRR, December, 2025

VLPrompt-PSG: Vision-Language Prompting for Panoptic Scene Graph Generation.
Int. J. Comput. Vis., November, 2025

Mixture of States: Routing Token-Level Dynamics for Multimodal Generation.
CoRR, November, 2025

Scaling Sequence-to-Sequence Generative Neural Rendering.
CoRR, October, 2025

MarDini: Masked Auto-regressive Diffusion for Video Generation at Scale.
Trans. Mach. Learn. Res., 2025

Learning Flow Fields in Attention for Controllable Person Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation.
CoRR, 2024

MarDini: Masked Autoregressive Diffusion for Video Generation at Scale.
CoRR, 2024

CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery.
CoRR, 2024

Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning.
CoRR, 2024

OpenPSG: Open-Set Panoptic Scene Graph Generation via Large Multimodal Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation.
CoRR, 2023

Text Promptable Surgical Instrument Segmentation with Vision-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2020
Gaussian Vector: An Efficient Solution for Facial Landmark Detection.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
A New Parallel Detection-Recognition Approach for End-to-End Scene Text Extraction.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019


  Loading...