Josiah Wang

Josiel Figueiredo

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021

Read, spot and translate.

[BibT_eX]

[DOI]

Mach. Transl., 2021

MultiSubs: A Large-scale Multimodal and Multilingual Dataset.

[BibT_eX]

[DOI]

CoRR, 2021

2020

Grounded Sequence to Sequence Transduction.

[BibT_eX]

[DOI]

Loïc Barrault

Ozan Caglayan

Amanda Cardoso Duarte

IEEE J. Sel. Top. Signal Process., 2020

2019

Imperial College London Submission to VATEX Video Captioning Task.

[BibT_eX]

[DOI]

CoRR, 2019

Predicting Actions to Help Predict Translations.

[BibT_eX]

[DOI]

CoRR, 2019

Transformer-based Cascaded Multimodal Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Spoken Language Translation, 2019

Phrase Localization Without Paired Training Examples.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions.

[BibT_eX]

[DOI]

Pranava Madhyastha

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Automatic Image Annotation at ImageCLEF.

[BibT_eX]

[DOI]

Proceedings of the Information Retrieval Evaluation in a Changing World, 2019

2018

Visual and Semantic Knowledge Transfer for Large Scale Semi-Supervised Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

The role of image representations in vision to language tasks.

[BibT_eX]

[DOI]

Nat. Lang. Eng., 2018

End-to-end Image Captioning Exploits Multimodal Distributional Similarity.

[BibT_eX]

[DOI]

CoRR, 2018

Object Counts! Bringing Explicit Detections Back into Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Defoiling Foiled Image Captions.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

End-to-end Image Captioning Exploits Distributional Similarity in Multimodal Space.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2018, 2018

2017

Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation.

[BibT_eX]

[DOI]

Prague Bull. Math. Linguistics, 2017

Sheffield MultiMT: Using Object Posterior Predictions for Multimodal Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Second Conference on Machine Translation, 2017

2016

SHEF-Multimodal: Grounding Machine Translation on Images.

[BibT_eX]

[DOI]

Kashif Shah

Proceedings of the First Conference on Machine Translation, 2016

Cross-validating Image Description Datasets and Evaluation Metrics.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation.

[BibT_eX]

[DOI]

Proceedings of the INLG 2016, 2016

Harvesting Training Images for Fine-Grained Object Categories Using Visual Descriptions.

[BibT_eX]

[DOI]

Alba Garcia Seco de Herrera

Katja Markert

Mark Everingham

Proceedings of the Advances in Information Retrieval, 2016

Large Scale Semi-Supervised Object Detection Using Visual and Semantic Knowledge Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

General Overview of ImageCLEF at the CLEF 2016 Labs.

[BibT_eX]

[DOI]

Mauricio Villegas

Henning Müller

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2016

Overview of the ImageCLEF 2016 Scalable Concept Image Annotation Task.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2016, 2016

2015

Generating Image Descriptions with Gold Standard Visual Inputs: Motivation, Evaluation and Baselines.

[BibT_eX]

[DOI]

Proceedings of the ENLG 2015, 2015

Combining Geometric, Textual and Visual Features for Predicting Prepositions in Image Descriptions.

[BibT_eX]

[DOI]

Francesc Moreno-Noguer

Alba Garcia Seco de Herrera

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

General Overview of ImageCLEF at the CLEF 2015 Labs.

[BibT_eX]

[DOI]

Stefano Bromuri

M. Ashraful Amin

Mahmood Kazi Mohammed

Burak Acar

Suzan Üsküdarli

Neda Barzegar Marvasti

José Francisco Aldana-Montes

María del Mar Roldán-García

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2015

Overview of the ImageCLEF 2015 Scalable Image Annotation, Localization and Sentence Generation task.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2015, 2015

Defining Visually Descriptive Language.

[BibT_eX]

[DOI]

Arnau Ramisa

Proceedings of the Fourth Workshop on Vision and Language, 2015

2014

A Poodle or a Dog? Evaluating Automatic Image Annotation Using Human Descriptions at Different Levels of Granularity.

[BibT_eX]

[DOI]

Proceedings of the Third Workshop on Vision and Language, 2014

2013

Learning visual recognition of fine-grained object categories from textual descriptions.

[BibT_eX]

[DOI]

PhD thesis, 2013

2009

Learning Models for Object Recognition from Natural Language Descriptions.

[BibT_eX]

[DOI]