Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

What is More Likely to Happen Next? Video-and-Language Future Event Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Attention-Based Query Expansion Learning.

[BibT_eX]

[DOI]

Albert Gordo

Filip Radenovic

Tamara L. Berg

Proceedings of the Computer Vision - ECCV 2020, 2020

TVQA+: Spatio-Temporal Grounding for Video Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Combining Multiple Cues for Visual Madlibs Question Answering.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2019

Dance Dance Generation: Motion Transfer for Internet Videos.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

IMP: Instance Mask Projection for High Accuracy Semantic Segmentation of Things.

[BibT_eX]

[DOI]

Cheng-Yang Fu

Tamara L. Berg

Alexander C. Berg

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multi-Target Embodied Question Answering.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Physics-Inspired Garment Recovery from a Single-View Image.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2018

From image to language and back again.

[BibT_eX]

[DOI]

Anya Belz

Tamara L. Berg

Licheng Yu

Nat. Lang. Eng., 2018

Image2GIF: Generating Cinemagraphs Using Recurrent Deep Q-Networks.

[BibT_eX]

[DOI]

Yipin Zhou

Yale Song

Tamara L. Berg

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

TVQA: Localized, Compositional Video Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Visual to Sound: Generating Natural Sound for Videos in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

MAttNet: Modular Attention Network for Referring Expression Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

When Was That Made?

[BibT_eX]

[DOI]

Sirion Vittayakorn

Alexander C. Berg

Tamara L. Berg

Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Hierarchically-Attentive RNN for Album Summarization and Storytelling.

[BibT_eX]

[DOI]

Licheng Yu

Mohit Bansal

Tamara L. Berg

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Large Scale Retrieval and Generation of Image Descriptions.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Detailed Garment Recovery from a Single-View Image.

[BibT_eX]

[DOI]

CoRR, 2016

Learning to name objects.

[BibT_eX]

[DOI]

Commun. ACM, 2016

Combining multiple sources of knowledge in deep CNNs for action recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Learning Temporal Transformations from Time-Lapse Videos.

[BibT_eX]

[DOI]

Yipin Zhou

Tamara L. Berg

Proceedings of the Computer Vision - ECCV 2016, 2016

Modeling Context in Referring Expressions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Solving VIsual Madlibs with Multiple Cues.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2016, 2016

Auto-Illustrating Poems and Songs with Style.

[BibT_eX]

[DOI]

Katharina Schwarz

Tamara L. Berg

Hendrik P. A. Lensch

Proceedings of the Computer Vision - ACCV 2016, 2016

2015

Retrieving Similar Styles to Parse Clothing.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2015

Predicting Entry-Level Categories.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2015

Visual Madlibs: Fill in the blank Image Generation and Question Answering.

[BibT_eX]

[DOI]

CoRR, 2015

Runway to Realway: Visual Analysis of Fashion.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Temporal Perception and Prediction in Ego-Centric Video.

[BibT_eX]

[DOI]

Yipin Zhou

Tamara L. Berg

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Visual Madlibs: Fill in the Blank Description Generation and Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Where to Buy It: Matching Street Clothing Photos in Online Shops.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Refer-to-as Relations as Semantic Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

TREETALK: Composition and Compression of Trees for Image Descriptions.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2014

Materials discovery: Fine-grained classification of X-ray scattering images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Chic or Social: Visual Popularity Analysis in Online Fashion Networks.

[BibT_eX]

[DOI]

Kota Yamaguchi

Tamara L. Berg

Luis E. Ortiz

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

ReferItGame: Referring to Objects in Photographs of Natural Scenes.

[BibT_eX]

[DOI]

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Learning High-Level Judgments of Urban Perception.

[BibT_eX]

[DOI]

Vicente Ordonez

Tamara L. Berg

Proceedings of the Computer Vision - ECCV 2014, 2014

Hipster Wars: Discovering Elements of Fashion Styles.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

2013

BabyTalk: Understanding and Generating Simple Image Descriptions.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2013

Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items.

[BibT_eX]

[DOI]

Kota Yamaguchi

M. Hadi Kiapour

Tamara L. Berg

Proceedings of the IEEE International Conference on Computer Vision, 2013

From Large Scale Image Categorization to Entry-Level Categories.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

Studying Relationships between Human Gaze, Description, and Computer Vision.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Generalizing Image Captions for Image-Text Parallel Corpus.

[BibT_eX]

[DOI]

Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012

Detecting Visual Text.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Midge: Generating Image Descriptions From Computer Vision Detections.

[BibT_eX]

[DOI]

Proceedings of the EACL 2012, 2012

Two-person interaction detection using body-pose features and multiple instance learning.

[BibT_eX]

[DOI]

Kiwon Yun

Jean Honorio

Debaleena Chattopadhyay

Tamara L. Berg

Dimitris Samaras

Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

Parsing clothing in fashion photographs.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Understanding and predicting importance in images.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Collective Generation of Natural Image Descriptions.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011

Can Computers Master the Art of Communication?: A Focus on Visual Analytics.

[BibT_eX]

[DOI]

IEEE Computer Graphics and Applications, 2011

Iconizer: A Framework to Identify and Create Effective Representations for Visual Information Encoding.

[BibT_eX]

[DOI]

Supriya Garg

Tamara L. Berg

Klaus Mueller

Proceedings of the Smart Graphics - 11th International Symposium, 2011

Im2Text: Describing Images Using 1 Million Captioned Photographs.

[BibT_eX]

[DOI]

Vicente Ordonez

Girish Kulkarni

Tamara L. Berg

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Who are you with and where are you going?

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Baby talk: Understanding and generating simple image descriptions.

[BibT_eX]

[DOI]

Girish Kulkarni