Zheng Zhang

Affiliations:
  • Microsoft Research Asia, Beijing, China
  • Huazhong University of Science and Technology, School of Electronic Information and Communications, Wuhan, China (former)


According to our database1, Zheng Zhang authored at least 44 papers between 2015 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks.
CoRR, 2023

DeepMIM: Deep Supervision for Masked Image Modeling.
CoRR, 2023

All in Tokens: Unifying Output Space of Visual Tasks via Soft Token.
CoRR, 2023

Side Adapter Network for Open-Vocabulary Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

On Data Scaling in Masked Image Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Revealing the Dark Secrets of Masked Image Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-training for Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Exploring Discrete Diffusion Models for Image Captioning.
CoRR, 2022

Could Giant Pretrained Image Models Extract Universal Representations?
CoRR, 2022

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation.
CoRR, 2022

iCAR: Bridging Image Classification and Image-text Alignment for Visual Recognition.
CoRR, 2022

Could Giant Pre-trained Image Models Extract Universal Representations?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model.
Proceedings of the Computer Vision - ECCV 2022, 2022

A Simple Approach and Benchmark for 21, 000-Category Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

SimMIM: a Simple Framework for Masked Image Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Video Swin Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Swin Transformer V2: Scaling Up Capacity and Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model.
CoRR, 2021

Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence Learning.
CoRR, 2021

Self-Supervised Learning with Swin Transformers.
CoRR, 2021

Bootstrap Your Object Detector via Mixed Training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Leveraging Batch Normalization for Vision Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

End-to-End Semi-Supervised Object Detection with Soft Teacher.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Group-Free 3D Object Detection via Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
RepPoints v2: Verification Meets Regression for Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Parametric Instance Classification for Unsupervised Visual Feature learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Disentangled Non-local Neural Networks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Dense RepPoints: Representing Visual Objects with Dense Point Sets.
Proceedings of the Computer Vision - ECCV 2020, 2020

Spatially Adaptive Inference with Stochastic Feature Sampling and Interpolation.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Closer Look at Local Aggregation Operators in Point Cloud Analysis.
Proceedings of the Computer Vision - ECCV 2020, 2020

Negative Margin Matters: Understanding Margin in Few-Shot Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
An Empirical Study of Spatial Attention Mechanisms in Deep Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Spatial-Temporal Relation Networks for Multi-Object Tracking.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Local Relation Networks for Image Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Integrated Object Detection and Tracking with Tracklet-Conditioned Detection.
CoRR, 2018

Relation Networks for Object Detection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Directional Edge Boxes: Exploiting Inner Normal Direction Cues for Effective Object Proposal Generation.
J. Comput. Sci. Technol., 2017

2016
Symmetry-based object proposal for text detection.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Multi-oriented Text Detection with Fully Convolutional Networks.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Symmetry-based text line detection in natural scenes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015


  Loading...