Zheng Zhang

Affiliations:

Microsoft Research Asia, Beijing, China
Huazhong University of Science and Technology, School of Electronic Information and Communications, Wuhan, China (former)

According to our database¹, Zheng Zhang authored at least 44 papers between 2015 and 2023.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2023

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

DeepMIM: Deep Supervision for Masked Image Modeling.

[BibT_eX]

[DOI]

CoRR, 2023

All in Tokens: Unifying Output Space of Visual Tasks via Soft Token.

[BibT_eX]

[DOI]

CoRR, 2023

Side Adapter Network for Open-Vocabulary Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

On Data Scaling in Masked Image Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Revealing the Dark Secrets of Masked Image Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-training for Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Exploring Discrete Diffusion Models for Image Captioning.

[BibT_eX]

[DOI]

CoRR, 2022

Could Giant Pretrained Image Models Extract Universal Representations?

[BibT_eX]

[DOI]

CoRR, 2022

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation.

[BibT_eX]

[DOI]

CoRR, 2022

iCAR: Bridging Image Classification and Image-text Alignment for Visual Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

Could Giant Pre-trained Image Models Extract Universal Representations?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

A Simple Approach and Benchmark for 21, 000-Category Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

SimMIM: a Simple Framework for Masked Image Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Video Swin Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Swin Transformer V2: Scaling Up Capacity and Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model.

[BibT_eX]

[DOI]

CoRR, 2021

Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Self-Supervised Learning with Swin Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Bootstrap Your Object Detector via Mixed Training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Leveraging Batch Normalization for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

End-to-End Semi-Supervised Object Detection with Soft Teacher.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Group-Free 3D Object Detection via Transformers.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

RepPoints v2: Verification Meets Regression for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Parametric Instance Classification for Unsupervised Visual Feature learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Disentangled Non-local Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Dense RepPoints: Representing Visual Objects with Dense Point Sets.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Spatially Adaptive Inference with Stochastic Feature Sampling and Interpolation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

A Closer Look at Local Aggregation Operators in Point Cloud Analysis.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Negative Margin Matters: Understanding Margin in Few-Shot Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

An Empirical Study of Spatial Attention Mechanisms in Deep Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Spatial-Temporal Relation Networks for Multi-Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Local Relation Networks for Image Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018

Integrated Object Detection and Tracking with Tracklet-Conditioned Detection.

[BibT_eX]

[DOI]

CoRR, 2018

Relation Networks for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Directional Edge Boxes: Exploiting Inner Normal Direction Cues for Effective Object Proposal Generation.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2017

2016

Symmetry-based object proposal for text detection.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Multi-oriented Text Detection with Fully Convolutional Networks.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Symmetry-based text line detection in natural scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Zheng Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...