Josiah Wang

Orcid: 0000-0003-0048-3893

According to our database1, Josiah Wang authored at least 33 papers between 2009 and 2022.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
MultiSubs: A Large-scale Multimodal and Multilingual Dataset.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021
Read, spot and translate.
Mach. Transl., 2021

MultiSubs: A Large-scale Multimodal and Multilingual Dataset.
CoRR, 2021

2020
Grounded Sequence to Sequence Transduction.
IEEE J. Sel. Top. Signal Process., 2020

2019
Imperial College London Submission to VATEX Video Captioning Task.
CoRR, 2019

Predicting Actions to Help Predict Translations.
CoRR, 2019

Transformer-based Cascaded Multimodal Speech Translation.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

Phrase Localization Without Paired Training Examples.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Automatic Image Annotation at ImageCLEF.
Proceedings of the Information Retrieval Evaluation in a Changing World, 2019

2018
Visual and Semantic Knowledge Transfer for Large Scale Semi-Supervised Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

The role of image representations in vision to language tasks.
Nat. Lang. Eng., 2018

End-to-end Image Captioning Exploits Multimodal Distributional Similarity.
CoRR, 2018

Object Counts! Bringing Explicit Detections Back into Image Captioning.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Defoiling Foiled Image Captions.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

End-to-end Image Captioning Exploits Distributional Similarity in Multimodal Space.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

2017
Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation.
Prague Bull. Math. Linguistics, 2017

Sheffield MultiMT: Using Object Posterior Predictions for Multimodal Machine Translation.
Proceedings of the Second Conference on Machine Translation, 2017

2016
SHEF-Multimodal: Grounding Machine Translation on Images.
Proceedings of the First Conference on Machine Translation, 2016

Cross-validating Image Description Datasets and Evaluation Metrics.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation.
Proceedings of the INLG 2016, 2016

Harvesting Training Images for Fine-Grained Object Categories Using Visual Descriptions.
Proceedings of the Advances in Information Retrieval, 2016

Large Scale Semi-Supervised Object Detection Using Visual and Semantic Knowledge Transfer.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016


Overview of the ImageCLEF 2016 Scalable Concept Image Annotation Task.
Proceedings of the Working Notes of CLEF 2016, 2016

2015
Generating Image Descriptions with Gold Standard Visual Inputs: Motivation, Evaluation and Baselines.
Proceedings of the ENLG 2015, 2015

Combining Geometric, Textual and Visual Features for Predicting Prepositions in Image Descriptions.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015


Overview of the ImageCLEF 2015 Scalable Image Annotation, Localization and Sentence Generation task.
Proceedings of the Working Notes of CLEF 2015, 2015

Defining Visually Descriptive Language.
Proceedings of the Fourth Workshop on Vision and Language, 2015

2014
A Poodle or a Dog? Evaluating Automatic Image Annotation Using Human Descriptions at Different Levels of Granularity.
Proceedings of the Third Workshop on Vision and Language, 2014

2013
Learning visual recognition of fine-grained object categories from textual descriptions.
PhD thesis, 2013

2009
Learning Models for Object Recognition from Natural Language Descriptions.
Proceedings of the British Machine Vision Conference, 2009


  Loading...