Xiaowei Hu

Affiliations:

Microsoft Corporation, Redmont, USA

According to our database¹, Xiaowei Hu authored at least 18 papers between 2020 and 2022.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2022

GIT: A Generative Image-to-text Transformer for Vision and Language.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2022

GLIPv2: Unifying Localization and Vision-Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

K-LITE: Learning Transferable Visual Models with External Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Injecting Semantic Concepts into End-to-End Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scaling Up Vision-Language Pretraining for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Scaling Up Vision-Language Pre-training for Image Captioning.

[BibT_eX]

[DOI]

CoRR, 2021

Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling.

[BibT_eX]

[DOI]

CoRR, 2021

UFO: A UniFied TransfOrmer for Vision-Language Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2021

VinVL: Making Visual Representations Matter in Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2021

Compressing Visual-linguistic Model via Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

VinVL: Revisiting Visual Representations in Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

MiniVLM: A Smaller and Faster Vision-Language Model.

[BibT_eX]

[DOI]

CoRR, 2020

VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training.

[BibT_eX]

[DOI]

CoRR, 2020

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Xiaowei Hu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...