Peizhao Zhang

According to our database1, Peizhao Zhang authored at least 46 papers between 2010 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis.
CoRR, 2023

Efficient Quantization Strategies for Latent Diffusion Models.
CoRR, 2023

ControlRoom3D: Room Generation using Semantic Proxy Rooms.
CoRR, 2023

Cache Me if You Can: Accelerating Diffusion Models through Block Caching.
CoRR, 2023

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack.
CoRR, 2023

Pruning Compact ConvNets for Efficient Inference.
CoRR, 2023

Token Merging: Your ViT But Faster.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

DIME-FM : DIstilling Multimodal and Efficient Foundation Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile Telepresence.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference.
CoRR, 2022

XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse.
CoRR, 2022

3D-Aware Encoding for Style-based Neural Radiance Fields.
CoRR, 2022

UmeTrack: Unified multi-view end-to-end hand tracking for VR.
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

Data Efficient Language-Supervised Zero-Shot Recognition with Optimal Transport Distillation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Hydra Attention: Efficient Attention with Many Heads.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2021
Data Efficient Language-supervised Zero-shot Recognition with Optimal Transport Distillation.
CoRR, 2021

FBNetV5: Neural Architecture Search for Multiple Tasks in One Run.
CoRR, 2021

Unbiased Teacher for Semi-Supervised Object Detection.
Proceedings of the 9th International Conference on Learning Representations, 2021

Visual Transformers: Where Do Transformers Really Belong in Vision Models?
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

FP-NAS: Fast Probabilistic Neural Architecture Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Low Bandwidth Video-Chat Compression Using Deep Generative Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Rethinking the Self-Attention in Vision Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

FBNetV3: Joint Architecture-Recipe Search Using Predictor Pretraining.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Data-Efficient Language-Supervised Zero-Shot Learning With Self-Distillation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
One shot 3D photography.
ACM Trans. Graph., 2020

MEgATrack: monochrome egocentric articulated hand-tracking for virtual reality.
ACM Trans. Graph., 2020

Low Bandwidth Video-Chat Compression using Deep Generative Models.
CoRR, 2020

FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge.
CoRR, 2020

Visual Transformers: Token-based Image Representation and Processing for Computer Vision.
CoRR, 2020

FBNetV3: Joint Architecture-Recipe Search using Neural Acquisition Function.
CoRR, 2020


Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild.
Proceedings of the Computer Vision - ECCV 2020, 2020

FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Machine Learning at Facebook: Understanding Inference at the Edge.
Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ChamNet: Towards Efficient Network Design Through Platform-Aware Model Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Mixed Precision Quantization of ConvNets via Differentiable Neural Architecture Search.
CoRR, 2018

2014
Leveraging depth cameras and wearable pressure sensors for full-body kinematics and dynamics capture.
ACM Trans. Graph., 2014

2012
Accurate realtime full-body motion capture using a single depth camera.
ACM Trans. Graph., 2012

2011
On Combining Fractional-Pixel Interpolation and Motion Estimation: A Cost-Effective Approach.
IEEE Trans. Circuits Syst. Video Technol., 2011

2010
Inter-mode decision with varied computational complexity.
Proceedings of the Visual Communications and Image Processing 2010, 2010

A split and merge algorithm for inter-mode decision in extended macroblocks.
Proceedings of the International Conference on Image Processing, 2010

An Integrated Algorithm for Fractional Pixel Interpolation and Motion Estimation of H.264.
Proceedings of the 2010 Data Compression Conference (DCC 2010), 2010


  Loading...