Ali Farhadi

CoRR, 2020

Probing Text Models for Common Ground with Visual Representations.

[BibT_eX]

[DOI]

CoRR, 2020

Visual Commonsense Graphs: Reasoning about the Dynamic Context of a Still Image.

[BibT_eX]

[DOI]

CoRR, 2020

Evaluating Machines by their Real-World Language Use.

[BibT_eX]

[DOI]

CoRR, 2020

Watching the World Go By: Representation Learning from Unlabeled Videos.

[BibT_eX]

[DOI]

CoRR, 2020

Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping.

[BibT_eX]

[DOI]

CoRR, 2020

Enabling AI at the edge with XNOR-networks.

[BibT_eX]

[DOI]

Commun. ACM, 2020

Supermasks in Superposition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Soft Threshold Weight Reparameterization for Learnable Sparsity.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Grounded Situation Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

VisualCOMET: Reasoning About the Dynamic Context of a Still Image.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

A Cordial Sync: Going Beyond Marginal Policies for Multi-agent Embodied Tasks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Visual Reaction: Learning to Play Catch With Your Drone.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

What's Hidden in a Randomly Weighted Neural Network?

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Use the Force, Luke! Learning to Predict Physical Forces by Simulating Effects.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

RoboTHOR: An Open Simulation-to-Real Embodied AI Platform.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Butterfly Transform: An Efficient FFT Based Neural Architecture Design.

[BibT_eX]

[DOI]

Keivan Alizadeh-Vahid

Anish Prabhu

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Artificial Agents Learn Flexible Visual Representations by Playing a Hiding Game.

[BibT_eX]

[DOI]

CoRR, 2019

Butterfly Transform: An Efficient FFT Based Neural Architecture Design.

[BibT_eX]

[DOI]

Keivan Alizadeh

CoRR, 2019

What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning.

[BibT_eX]

[DOI]

Daniel Gordon

Dieter Fox

CoRR, 2019

Defending Against Neural Fake News.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Discovering Neural Wirings.

[BibT_eX]

[DOI]

Mitchell Wortsman

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Visual Semantic Navigation using Scene Priors.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

From Recognition to Cognition: Visual Commonsense Reasoning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ELASTIC: Improving CNNs With Dynamic Scaling Policies.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Video Relationship Reasoning Using Gated Spatio-Temporal Energy Graph.

[BibT_eX]

[DOI]

Yao-Hung Hubert Tsai

Louis-Philippe Morency

Ruslan Salakhutdinov

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Two Body Problem: Collaborative Visual Task Completion.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Conditional Driving from Natural Language Instructions.

[BibT_eX]

[DOI]

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

HellaSwag: Can a Machine Really Finish Your Sentence?

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

PhotoShape: photorealistic materials for large-scale shape collections.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2018

Re<sup>3</sup>: Real-Time Recurrent Regression Networks for Visual Tracking of Generic Objects.

[BibT_eX]

[DOI]

Daniel Gordon

Dieter Fox

IEEE Robotics Autom. Lett., 2018

ELASTIC: Improving CNNs with Instance Specific Scaling Policies.

[BibT_eX]

[DOI]

CoRR, 2018

Label Refinery: Improving ImageNet Classification through Label Progression.

[BibT_eX]

[DOI]

CoRR, 2018

Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos.

[BibT_eX]

[DOI]

CoRR, 2018

YOLOv3: An Incremental Improvement.

[BibT_eX]

[DOI]

Joseph Redmon

CoRR, 2018

Transferring Common-Sense Knowledge for Object Detection.

[BibT_eX]

[DOI]

Krishna Kumar Singh

Yong Jae Lee

CoRR, 2018

Neural Speed Reading via Skim-RNN.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

DOCK: Detecting Objects by Transferring Common-Sense Knowledge.

[BibT_eX]

[DOI]

Krishna Kumar Singh

Yong Jae Lee

Proceedings of the Computer Vision - ECCV 2018, 2018

Imagine This! Scripts to Compositions to Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Actor and Observer: Joint Modeling of First and Third-Person Videos.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

IQA: Visual Question Answering in Interactive Environments.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

SeGAN: Segmenting and Generating the Invisible.

[BibT_eX]

[DOI]

Kiana Ehsani

Roozbeh Mottaghi

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Who Let the Dogs Out? Modeling Dog Behavior From Visual Data.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Structured Set Matching Networks for One-Shot Part Labeling.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

AJILE Movement Prediction: Multimodal Deep Learning for Natural Human Neural Recordings and Video.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Semantic Highlight Retrieval and Term Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Summarizing Unconstrained Videos Using Salient Montages.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2017

AI2-THOR: An Interactive 3D Environment for Visual AI.

[BibT_eX]

[DOI]

CoRR, 2017

Re3 : Real-Time Recurrent Regression Networks for Object Tracking.

[BibT_eX]

[DOI]

Daniel Gordon

Dieter Fox

CoRR, 2017

Toward visual intelligence.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Trends in Machine-Learning (and impact on computer architecture), 2017

Target-driven visual navigation in indoor scenes using deep reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Query-Reduction Networks for Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Bidirectional Attention Flow for Machine Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Visual Semantic Planning Using Deep Successor Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

See the Glass Half Full: Reasoning About Liquid Containers, Their Volume and Content.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Commonly Uncommon: Semantic Sparsity in Situation Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Asynchronous Temporal Fields for Action Recognition.

[BibT_eX]

[DOI]

Gunnar A. Sigurdsson

Abhinav Gupta

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

YOLO9000: Better, Faster, Stronger.

[BibT_eX]

[DOI]

Joseph Redmon

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

LCNN: Lookup-Based Convolutional Neural Network.

[BibT_eX]

[DOI]

Hessam Bagherinezhad

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Ranking Highlights in Personal Videos by Analyzing Edited Videos.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Query-Regression Networks for Machine Comprehension.

[BibT_eX]

[DOI]

Min Joon Seo

CoRR, 2016

NCAM: Near-Data Processing for Nearest Neighbor Search.

[BibT_eX]

[DOI]

CoRR, 2016

Stating the Obvious: Extracting Visual Common Sense Knowledge.

[BibT_eX]

[DOI]

Mark Yatskar

Vicente Ordonez

Proceedings of the NAACL HLT 2016, 2016

Unsupervised Deep Embedding for Clustering Analysis.

[BibT_eX]

[DOI]

Junyuan Xie

Ross B. Girshick

Proceedings of the 33nd International Conference on Machine Learning, 2016

Semantic highlight retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Much Ado About Time: Exhaustive Annotation of Temporal Data.

[BibT_eX]

[DOI]

Proceedings of the Fourth AAAI Conference on Human Computation and Crowdsourcing, 2016

Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks.

[BibT_eX]

[DOI]

Junyuan Xie

Ross B. Girshick

Proceedings of the Computer Vision - ECCV 2016, 2016

Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

FigureSeer: Parsing Result-Figures in Research Papers.

[BibT_eX]

[DOI]

Noah Siegel

Zachary Horvitz

Roie Levin

Proceedings of the Computer Vision - ECCV 2016, 2016

XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

"What Happens If..." Learning to Predict the Effect of Forces in Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

A Diagram is Worth a Dozen Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Situation Recognition: Visual Semantic Role Labeling for Image Understanding.

[BibT_eX]

[DOI]

Mark Yatskar

Luke Zettlemoyer

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Actions ~ Transformations.

[BibT_eX]

[DOI]

Xiaolong Wang

Abhinav Gupta

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

You Only Look Once: Unified, Real-Time Object Detection.

[BibT_eX]

[DOI]

Joseph Redmon

Ross B. Girshick

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

A Task-Oriented Approach for Cost-Sensitive Recognition.

[BibT_eX]

[DOI]

Roozbeh Mottaghi

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Toward a Taxonomy and Computational Models of Abnormalities in Images.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

On the Application of Genetic Programming for New Generation of Ground Motion Prediction Equations.

[BibT_eX]

[DOI]

Proceedings of the Handbook of Genetic Programming Applications, 2015

Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing.

[BibT_eX]

[DOI]

Hamid Izadinia

Yejin Choi

CoRR, 2015

Learning to Select and Order Vacation Photographs.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Visalogy: Answering Visual Analogy Questions.

[BibT_eX]

[DOI]

C. Lawrence Zitnick

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Deep Classifiers from Image Tags in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions, 2015

Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off!

[BibT_eX]

[DOI]

Bilge Soran

Linda G. Shapiro

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing.

[BibT_eX]

[DOI]

Hamid Izadinia

Yejin Choi

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Solving Geometry Problems: Combining Text and Diagram Interpretation.

[BibT_eX]

[DOI]

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

VisKE: Visual knowledge extraction and question answering by visual verification of relation phrases.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Discriminative and consistent similarities in instance-level Multiple Instance Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Abnormal Object Recognition: A Comprehensive Study.

[BibT_eX]

[DOI]

Babak Saleh

Ahmed M. Elgammal

CoRR, 2014

Image Classification and Retrieval from User-Supplied Tags.

[BibT_eX]

[DOI]

CoRR, 2014

Multi-Resolution Language Grounding with Weak Supervision.

[BibT_eX]

[DOI]

Rik Koncel-Kedziorski

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Salient Montages from Unconstrained Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

Ranking Domain-Specific Highlights by Analyzing Edited Videos.

[BibT_eX]

[DOI]

Min Sun

Steven M. Seitz

Proceedings of the Computer Vision - ECCV 2014, 2014

Towards Transparent Systems: Semantic Characterization of Failure Modes.

[BibT_eX]

[DOI]

Aayush Bansal

Devi Parikh

Proceedings of the Computer Vision - ECCV 2014, 2014

Predicting Failures of Vision Systems.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Incorporating Scene Context and Object Layout into Appearance Modeling.

[BibT_eX]

[DOI]

Hamid Izadinia

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Learning Everything about Anything: Webly-Supervised Visual Concept Learning.

[BibT_eX]

[DOI]

Carlos Guestrin

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Action Recognition in the Presence of One Egocentric and Multiple Static Cameras.

[BibT_eX]

[DOI]

Bilge Soran

Linda G. Shapiro

Proceedings of the Computer Vision - ACCV 2014, 2014

Diagram Understanding in Geometry Questions.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013

Phrasal Recognition.

[BibT_eX]

[DOI]

Mohammad Amin Sadeghi

IEEE Trans. Pattern Anal. Mach. Intell., 2013

Object-Centric Anomaly Detection by Attribute-Based Reasoning.

[BibT_eX]

[DOI]

Babak Saleh

Ahmed M. Elgammal

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Multi-attribute Queries: To Merge or Not to Merge?

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Adding Unlabeled Samples to Categories by Learned Attributes.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012

Semantic Understanding of Professional Soccer Commentaries.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Attribute Discovery via Predictable Discriminative Binary Codes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2012, 2012

Building a dictionary of image fragments.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Designing representational architectures in recognition

[BibT_eX]

[DOI]

PhD thesis, 2011

Using Classification to Protect the Integrity of Spectrum Measurements in White Space Networks.

[BibT_eX]

[DOI]

Proceedings of the Network and Distributed System Security Symposium, 2011

Understanding egocentric activities.

[BibT_eX]

[DOI]

Alireza Fathi

James M. Rehg

Proceedings of the IEEE International Conference on Computer Vision, 2011

Recognition using visual phrases.

[BibT_eX]

[DOI]

Mohammad Amin Sadeghi

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

It's All About the Data.

[BibT_eX]

[DOI]

Proc. IEEE, 2010

Every Picture Tells a Story: Generating Sentences from Images.

[BibT_eX]

[DOI]

Seyyed Mohammad Mohsen Hejrati

Mohammad Amin Sadeghi

Proceedings of the Computer Vision, 2010

Attribute-centric recognition for cross-category generalization.

[BibT_eX]

[DOI]

Ian Endres

Derek Hoiem

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

The benefits and challenges of collecting richer object annotations.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009

Unlabeled data improvesword prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

A latent model of discriminative aspect.

[BibT_eX]

[DOI]

Mostafa Kamali Tabrizi

Ian Endres

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Describing objects by their attributes.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008

Scene Discovery by Matrix Factorization.

[BibT_eX]

[DOI]

Nicolas Loeff

Proceedings of the Computer Vision, 2008

Learning to Recognize Activities from the Wrong View Point.

[BibT_eX]

[DOI]

Mostafa Kamali Tabrizi

Proceedings of the Computer Vision, 2008

2007

Transfer Learning in Sign language.

[BibT_eX]

[DOI]

Ryan White

Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006

How to tell the difference between a cat and a dog?

[BibT_eX]

[DOI]

Int. J. Imaging Syst. Technol., 2006

An application of linear predictive coding and computational geometry to iris recognition.

[BibT_eX]

[DOI]

Masoud Alipour

Nima Razavi

Int. J. Imaging Syst. Technol., 2006

Aligning ASL for Statistical Translation Using a Discriminative Word Model.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2003

Image segmentation via local higher order statistics.

[BibT_eX]

[DOI]