Yousong Zhu

Orcid: 0000-0001-8544-410X

According to our database1, Yousong Zhu authored at least 26 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring.
CoRR, 2024

2023
Mitigating Hallucination in Visual Language Models with Visual Supervision.
CoRR, 2023

Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models.
CoRR, 2023

Efficient Masked Autoencoders with Self-Consistency.
CoRR, 2023

Exploring Stochastic Autoregressive Image Modeling for Visual Representation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Masked Contrastive Pre-Training for Efficient Video-Text Retrieval.
CoRR, 2022

Part-Aware Self-Supervised Pre-Training for Person Re-Identification.
CoRR, 2022

Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

C2AM Loss: Chasing a Better Decision Boundary for Long-Tail Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UniVIP: A Unified Framework for Self-Supervised Visual Pre-training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Cross-Dataset Collaborative Learning for Semantic Segmentation.
CoRR, 2021

MST: Masked Self-Supervised Transformer for Visual Representation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

DPT: Deformable Patch-based Transformer for Visual Recognition.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Attention-Guided Knowledge Distillation for Efficient Single-Stage Detector.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Adaptive Class Suppression Loss for Long-Tail Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Food det: Detecting foods in refrigerator with supervised transformer network.
Neurocomputing, 2020

A novel data augmentation scheme for pedestrian detection with attribute preserving GAN.
Neurocomputing, 2020

Large Batch Optimization for Object Detection: Training COCO in 12 minutes.
Proceedings of the Computer Vision - ECCV 2020, 2020

Dual Super-Resolution Learning for Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Attention CoupleNet: Fully Convolutional Attention Coupling Network for Object Detection.
IEEE Trans. Image Process., 2019

Elite Loss for scene text detection.
Neurocomputing, 2019

Mask Guided Knowledge Distillation for Single Shot Detector.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018
Improved Single Shot Object Detector Using Enhanced Features and Predicting Heads.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

2017
CoupleNet: Coupling Global Structure with Local Parts for Object Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Scale-Adaptive Deconvolutional Regression Network for Pedestrian Detection.
Proceedings of the Computer Vision - ACCV 2016, 2016


  Loading...