Ronghang Hu

Orcid: 0000-0002-5060-9485

According to our database1, Ronghang Hu authored at least 29 papers between 2014 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Scaling Language-Image Pre-Training via Masking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Exploring Long-Sequence Masked Autoencoders.
CoRR, 2022

FLAVA: A Foundational Language And Vision Alignment Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Transformer is All You Need: Multimodal Multitask Learning with a Unified Transformer.
CoRR, 2021

UniT: Multimodal Multitask Learning with a Unified Transformer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Structured Models for Vision-and-Language Reasoning.
PhD thesis, 2020

Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image.
CoRR, 2020

TextCaps: A Dataset for Image Captioning with Reading Comprehension.
Proceedings of the Computer Vision - ECCV 2020, 2020

Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Language-Conditioned Graph Networks for Relational Reasoning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Are You Looking? Grounding to Multiple Modalities in Vision-and-Language Navigation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Generating Counterfactual Explanations with Natural Language.
CoRR, 2018

Speaker-Follower Models for Vision-and-Language Navigation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Explainable Neural Computation via Stack Neural Module Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Grounding Visual Explanations.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning to Segment Every Thing.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Grounding Visual Explanations (Extended Abstract).
CoRR, 2017

Learning to Reason: End-to-End Module Networks for Visual Question Answering.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Modeling Relationships in Referential Expressions with Compositional Modular Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions.
CoRR, 2016

Grounding of Textual Phrases in Images by Reconstruction.
Proceedings of the Computer Vision - ECCV 2016, 2016

Segmentation from Natural Language Expressions.
Proceedings of the Computer Vision - ECCV 2016, 2016

Natural Language Object Retrieval.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Spatial Semantic Regularisation for Large Scale Object Detection.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
LSDA: Large Scale Detection through Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Robust Head-Shoulder Detection Using a Two-Stage Cascade Framework.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014


  Loading...