Dhruv Batra

Prithvijit Chattopadhyay

Proceedings of the Computer Vision - ECCV 2018, 2018

Visual Coreference Resolution in Visual Dialog Using Neural Module Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Neural Baby Talk.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Embodied Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Neural Modular Control for Embodied Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Diverse Beam Search for Improved Description of Complex Scenes.

[BibT_eX]

[DOI]

Ashwin K. Vijayakumar

Michael Cogswell

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Empirical Minimum Bayes Risk Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2017

VQA: Visual Question Answering - www.visualqa.org.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2017

Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2017

Resolving vision and language ambiguities together: Joint segmentation & prepositional attachment resolution in captioned scenes.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2017

CoDraw: Visual Dialog for Collaborative Drawing.

[BibT_eX]

[DOI]

CoRR, 2017

C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset.

[BibT_eX]

[DOI]

CoRR, 2017

Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Evaluating Visual Conversational Agents via Cooperative Human-AI Games.

[BibT_eX]

[DOI]

Prithvijit Chattopadhyay

Proceedings of the Fifth AAAI Conference on Human Computation and Crowdsourcing, 2017

ParlAI: A Dialog Research Software Platform.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

The Promise of Premise: Harnessing Question Premises in Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Deal or No Deal? End-to-End Learning of Negotiation Dialogues.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning.

[BibT_eX]

[DOI]

Qing Sun

Stefan Lee

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Visual Dialog.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Counting Everyday Objects in Everyday Scenes.

[BibT_eX]

[DOI]

Prithvijit Chattopadhyay

Ramakrishna Vedantam

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Gender Classification of Walkers via Underfloor Accelerometer Measurements.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2016

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models.

[BibT_eX]

[DOI]

Ashwin K. Vijayakumar

Michael Cogswell

CoRR, 2016

Grad-CAM: Why did you say that?

[BibT_eX]

[DOI]

CoRR, 2016

Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization.

[BibT_eX]

[DOI]

CoRR, 2016

A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories.

[BibT_eX]

[DOI]

CoRR, 2016

Interpreting Visual Question Answering Models.

[BibT_eX]

[DOI]

CoRR, 2016

Reducing Overfitting in Deep Networks by Decorrelating Representations.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Learning Representations, 2016

Measuring Machine Intelligence Through Visual Question Answering.

[BibT_eX]

[DOI]

AI Mag., 2016

Pose tracking by efficiently exploiting global features.

[BibT_eX]

[DOI]

Ratnesh Kumar

Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Hierarchical Question-Image Co-Attention for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Stochastic Multiple Choice Learning for Training Diverse Deep Ensembles.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories.

[BibT_eX]

[DOI]

Proceedings of the NAACL HLT 2016, 2016

Visual Storytelling.

[BibT_eX]

[DOI]

Ting-Hao (Kenneth) Huang

Proceedings of the NAACL HLT 2016, 2016

Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions.

[BibT_eX]

[DOI]

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Resolving Language and Vision Ambiguities Together: Joint Segmentation & Prepositional Attachment Resolution in Captioned Scenes.

[BibT_eX]

[DOI]

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Sort Story: Sorting Jumbled Images and Captions into Stories.

[BibT_eX]

[DOI]

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Analyzing the Behavior of Visual Question Answering Models.

[BibT_eX]

[DOI]

Aishwarya Agrawal

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Yin and Yang: Balancing and Answering Binary Visual Questions.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Joint Unsupervised Learning of Deep Representations and Image Clusters.

[BibT_eX]

[DOI]

Jianwei Yang

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Object-Proposal Evaluation Protocol is 'Gameable'.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

We are Humor Beings: Understanding and Predicting Visual Humor.

[BibT_eX]

[DOI]

Arjun Chandrasekaran

Ashwin K. Vijayakumar

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Radio transformer networks: Attention models for learning to synchronize in wireless systems.

[BibT_eX]

[DOI]

Proceedings of the 50th Asilomar Conference on Signals, Systems and Computers, 2016

2015

Human pose estimation via multi-layer composite models.

[BibT_eX]

[DOI]

Kun Duan

Clint Solomon Mathialagan

David J. Crandall

Signal Process., 2015

Guest Editors' Introduction: Special Section on Higher Order Graphical Models in Computer Vision.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2015

A Comparative Study of Modern Inference Techniques for Structured Discrete Energy Minimization Problems.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2015

Why M Heads are Better than One: Training a Diverse Ensemble of Deep Networks.

[BibT_eX]

[DOI]

CoRR, 2015

CloudCV: Large Scale Distributed Computer Vision as a Cloud Service.

[BibT_eX]

[DOI]

Harsh Agrawal

CoRR, 2015

SubmodBoxes: Near-Optimal Search for a Set of Diverse Object Proposals.

[BibT_eX]

[DOI]

Qing Sun

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

VQA: Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Optimizing Expected Intersection-Over-Union with Candidate-Constrained CRFs.

[BibT_eX]

[DOI]

Faruk Ahmed

Daniel Tarlow

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Active learning for structured probabilistic models with histogram approximation.

[BibT_eX]

[DOI]

Qing Sun

Ankit Laddha

Clint Solomon Mathialagan

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

VIP: Finding important people in images.

[BibT_eX]

[DOI]

Andrew C. Gallagher

Clint Solomon Mathialagan

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

CloudCV: Large-Scale Distributed Computer Vision as a Cloud Service.

[BibT_eX]

[DOI]

Harsh Agrawal

Proceedings of the Mobile Cloud Visual Media Computing - From Interaction to Service, 2015

2014

Putting the User in the Loop for Image-Based Modeling.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2014

Combining the Best of Graphical Models and ConvNets for Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2014

Candidate Constrained CRFs for Loss-Aware Structured Prediction.

[BibT_eX]

[DOI]

Faruk Ahmed

Daniel Tarlow

CoRR, 2014

Submodular meets Structured: Finding Diverse Subsets in Exponentially-Large Structured Item Sets.

[BibT_eX]

[DOI]

Adarsh Prasad

Stefanie Jegelka

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Empirical Minimum Bayes Risk Prediction: How to Extract an Extra Few % Performance from Vision Models with Just Three More Parameters.

[BibT_eX]

[DOI]

Vittal Premachandran

Daniel Tarlow

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Multimodal Learning in Loosely-Organized Web Images.

[BibT_eX]

[DOI]

Kun Duan

David J. Crandall

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Efficiently Enforcing Diversity in Multi-Output Structured Prediction.

[BibT_eX]

[DOI]

Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, 2014

2013

Group Norm for Learning Structured SVMs with Unstructured Latent Variables.

[BibT_eX]

[DOI]

Daozheng Chen

William T. Freeman

Proceedings of the IEEE International Conference on Computer Vision, 2013

A Systematic Exploration of Diversity in Machine Translation.

[BibT_eX]

[DOI]

Kevin Gimpel

Chris Dyer

Gregory Shakhnarovich

Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Discriminative Re-ranking of Diverse Segmentations.

[BibT_eX]

[DOI]

Payman Yadollahpour

Gregory Shakhnarovich

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

A Comparative Study of Modern Inference Techniques for Discrete Energy Minimization Problems.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

DivMCuts: Faster Training of Structural SVMs with Diverse M-Best Cutting-Planes.

[BibT_eX]

[DOI]

Abner Guzmán-Rivera

Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, 2013

2012

An Efficient Message-Passing Algorithm for the M-Best MAP Problem.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Multiple Choice Learning: Learning to Produce Multiple Structured Outputs.

[BibT_eX]

[DOI]

Abner Guzmán-Rivera

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Diverse M-Best Solutions in Markov Random Fields.

[BibT_eX]

[DOI]

Payman Yadollahpour

Abner Guzmán-Rivera

Gregory Shakhnarovich

Proceedings of the Computer Vision - ECCV 2012, 2012

Learning the right model: Efficient max-margin learning in Laplacian CRFs.

[BibT_eX]

[DOI]

Ashutosh Saxena

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

MaxFlow Revisited: An Empirical Comparison of Maxflow Algorithms for Dense Vision Problems.

[BibT_eX]

[DOI]

Tanmay Verma

Proceedings of the British Machine Vision Conference, 2012

A Multi-layer Composite Model for Human Pose Estimation.

[BibT_eX]

[DOI]

Kun Duan

David J. Crandall

Proceedings of the British Machine Vision Conference, 2012

2011

Interactive Co-segmentation of Objects in Image Collections.

[BibT_eX]

[DOI]

Springer Briefs in Computer Science, Springer, ISBN: 978-1-4614-1915-0, 2011

Tighter Relaxations for MAP-MRF Inference: A Local Primal-Dual Gap based Separation Algorithm.

[BibT_eX]

[DOI]

Sebastian Nowozin

Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2011

Dynamic Tree Block Coordinate Ascent.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Machine Learning, 2011

Scribble based interactive 3D reconstruction via scene co-segmentation.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Inference for order reduction in Markov random fields.

[BibT_eX]

[DOI]

Andrew C. Gallagher

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Making the right moves: Guiding alpha-expansion using local primal-dual gaps.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

iModel: Interactive Co-segmentation for Object of Interest 3D Modeling.

[BibT_eX]

[DOI]

Proceedings of the Trends and Topics in Computer Vision, 2010

iCoseg: Interactive co-segmentation with intelligent scribble guidance.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Beyond trees: MRF inference via outer-planar decomposition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009

Seed Image Selection in interactive cosegmentation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2009

Cutout-search: Putting a name to the picture.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

2008

Learning class-specific affinities for image labelling.

[BibT_eX]

[DOI]

Rahul Sukthankar

Tsuhan Chen

Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Semi-Supervised Clustering via Learnt Codeword Distances.

[BibT_eX]

[DOI]