Peizhao Zhang

Orcid: 0000-0001-7128-191X

According to our database¹, Peizhao Zhang authored at least 57 papers between 2010 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Non-Markov Multi-Round Conversational Image Generation with History-Conditioned MLLMs.

[BibT_eX]

[DOI]

CoRR, January, 2026

Conversational Image Generation: Towards Multi-Round Personalized Generation with Multi-Modal Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

2025

Populate-A-Scene: Affordance-Aware Human Video Generation.

[BibT_eX]

[DOI]

CoRR, July, 2025

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models.

[BibT_eX]

[DOI]

CoRR, April, 2025

Movie Weaver: Tuning-Free Multi-Concept Video Personalization with Anchored Prompts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

An Investigation on Hardware-Aware Vision Transformer Scaling.

[BibT_eX]

[DOI]

ACM Trans. Embed. Comput. Syst., May, 2024

DirectorLLM for Human-Centric Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Imagine yourself: Tuning-Free Personalized Image Generation.

[BibT_eX]

[DOI]

CoRR, 2024

An Analysis on Quantizing Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

REFA: Real-time Egocentric Facial Animations for Virtual Reality.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cache Me if You Can: Accelerating Diffusion Models through Block Caching.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ControlRoom3D: Room Generation Using Semantic Proxy Rooms.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Efficient Quantization Strategies for Latent Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack.

[BibT_eX]

[DOI]

CoRR, 2023

Pruning Compact ConvNets for Efficient Inference.

[BibT_eX]

[DOI]

CoRR, 2023

XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse.

[BibT_eX]

[DOI]

Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Token Merging: Your ViT But Faster.

[BibT_eX]

[DOI]

Christoph Feichtenhofer

Judy Hoffman

Proceedings of the Eleventh International Conference on Learning Representations, 2023

DIME-FM : DIstilling Multimodal and Efficient Foundation Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile Telepresence.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference.

[BibT_eX]

[DOI]

CoRR, 2022

3D-Aware Encoding for Style-based Neural Radiance Fields.

[BibT_eX]

[DOI]

CoRR, 2022

UmeTrack: Unified multi-view end-to-end hand tracking for VR.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

Data Efficient Language-Supervised Zero-Shot Recognition with Optimal Transport Distillation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Hydra Attention: Efficient Attention with Many Heads.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2021

Data Efficient Language-supervised Zero-shot Recognition with Optimal Transport Distillation.

[BibT_eX]

[DOI]

CoRR, 2021

FBNetV5: Neural Architecture Search for Multiple Tasks in One Run.

[BibT_eX]

[DOI]

CoRR, 2021

Unbiased Teacher for Semi-Supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Visual Transformers: Where Do Transformers Really Belong in Vision Models?

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

FP-NAS: Fast Probabilistic Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Low Bandwidth Video-Chat Compression Using Deep Generative Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Rethinking the Self-Attention in Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Data-Efficient Language-Supervised Zero-Shot Learning With Self-Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020

One shot 3D photography.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2020

MEgATrack: monochrome egocentric articulated hand-tracking for virtual reality.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2020

Low Bandwidth Video-Chat Compression using Deep Generative Models.

[BibT_eX]

[DOI]

CoRR, 2020

FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge.

[BibT_eX]

[DOI]

CoRR, 2020

Visual Transformers: Token-based Image Representation and Processing for Computer Vision.

[BibT_eX]

[DOI]

CoRR, 2020

FBNetV3: Joint Architecture-Recipe Search using Neural Acquisition Function.

[BibT_eX]

[DOI]

CoRR, 2020

MLPerf Inference Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Machine Learning at Facebook: Understanding Inference at the Edge.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ChamNet: Towards Efficient Network Design Through Platform-Aware Model Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2018

2014

Leveraging depth cameras and wearable pressure sensors for full-body kinematics and dynamics capture.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2014

2012

Accurate realtime full-body motion capture using a single depth camera.

[BibT_eX]

[DOI]

Xiaolin K. Wei

Peizhao Zhang

Jinxiang Chai

ACM Trans. Graph., 2012

2011

On Combining Fractional-Pixel Interpolation and Motion Estimation: A Cost-Effective Approach.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2011

2010

Inter-mode decision with varied computational complexity.

[BibT_eX]

[DOI]

Proceedings of the Visual Communications and Image Processing 2010, 2010

A split and merge algorithm for inter-mode decision in extended macroblocks.

[BibT_eX]

[DOI]

Jiyuan Lu

Peizhao Zhang

Hongyang Chao

Proceedings of the International Conference on Image Processing, 2010

An Integrated Algorithm for Fractional Pixel Interpolation and Motion Estimation of H.264.

[BibT_eX]

[DOI]

Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010

Peizhao Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...