Devi Parikh

Orcid: 0000-0002-3779-6706

According to our database1, Devi Parikh authored at least 204 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Video Editing via Factorized Diffusion Distillation.
CoRR, 2024

2023
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning.
CoRR, 2023

Emu Edit: Precise Image Editing via Recognition and Generation Tasks.
CoRR, 2023

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack.
CoRR, 2023

Text-Conditional Contextualized Avatars For Zero-Shot Personalization.
CoRR, 2023

Text-To-4D Dynamic Scene Generation.
Proceedings of the International Conference on Machine Learning, 2023

Make-A-Video: Text-to-Video Generation without Text-Video Data.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

AudioGen: Textually Guided Audio Generation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SpaText: Spatio-Textual Representation for Controllable Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration.
Proceedings of the Computer Vision - ECCV 2022, 2022

Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors.
Proceedings of the Computer Vision - ECCV 2022, 2022

Episodic Memory Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Telling Creative Stories Using Generative Visual Aids.
CoRR, 2021

Dance2Music: Automatic Dance-driven Music Generation.
CoRR, 2021

Building Bridges: Generative Artworks to Explore AI Ethics.
CoRR, 2021

Human-Adversarial Visual Question Answering.
CoRR, 2021

ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations.
CoRR, 2021

Human-Adversarial Visual Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Creative Sketch Generation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Contrast and Classify: Training Robust VQA Models.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Visual Conceptual Blending with Large-Scale Language and Vision Models.
Proceedings of the Twelfth International Conference on Computational Creativity, 2021

KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

AI-assisted Human creativity.
Proceedings of the 4th Workshop on Affective Content Analysis (AffCon 2021) co-located with Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), 2021

2020
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization.
Int. J. Comput. Vis., 2020

Object-Centric Diagnosis of Visual Reasoning.
CoRR, 2020

The Open Catalyst 2020 (OC20) Dataset and Community Challenges.
CoRR, 2020

An Introduction to Electrocatalyst Design using Machine Learning for Renewable Energy Storage.
CoRR, 2020

Contrast and Classify: Alternate Training for Robust VQA.
CoRR, 2020

Are we pretraining it right? Digging deeper into visio-linguistic pretraining.
CoRR, 2020

SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions.
CoRR, 2020


Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Embodied Multimodal Multitask Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames.
Proceedings of the 8th International Conference on Learning Representations, 2020

Feel The Music: Automatically Generating A Dance For An Input Song.
Proceedings of the Eleventh International Conference on Computational Creativity, 2020

Exploring Crowd Co-creation Scenarios for Sketches.
Proceedings of the Eleventh International Conference on Computational Creativity, 2020

Predicting A Creator's Preferences In, and From, Interactive Generative Art.
Proceedings of the Eleventh International Conference on Computational Creativity, 2020

Lemotif: An Affective Visual Journal Using Deep Neural Networks.
Proceedings of the Eleventh International Conference on Computational Creativity, 2020

Neuro-Symbolic Generative Art: A Preliminary Study.
Proceedings of the Eleventh International Conference on Computational Creativity, 2020

Where Are You? Localization from Embodied Dialog.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Large-Scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline.
Proceedings of the Computer Vision - ECCV 2020, 2020

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web.
Proceedings of the Computer Vision - ECCV 2020, 2020

Spatially Aware Multimodal Transformers for TextVQA.
Proceedings of the Computer Vision - ECCV 2020, 2020

SQuINTing at VQA Models: Introspecting VQA Models With Sub-Questions.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

12-in-1: Multi-Task Vision and Language Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents.
Proceedings of the 4th Conference on Robot Learning, 2020

Sim-to-Real Transfer for Vision-and-Language Navigation.
Proceedings of the 4th Conference on Robot Learning, 2020

2019
Visual Dialog.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering.
Int. J. Comput. Vis., 2019

Decentralized Distributed PPO: Solving PointGoal Navigation.
CoRR, 2019

Unsupervised Discovery of Decision States for Transfer in Reinforcement Learning.
CoRR, 2019

Emergence of Compositional Language with Deep Generational Transmission.
CoRR, 2019

Embodied Visual Recognition.
CoRR, 2019

Lemotif: Abstract Visual Depictions of your Emotional States in Life.
CoRR, 2019

Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future.
CoRR, 2019

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded.
CoRR, 2019

Response to "Visual Dialogue without Vision or Dialogue" (Massiceti et al., 2018).
CoRR, 2019

Dialog System Technology Challenge 7.
CoRR, 2019

Cross-channel Communication Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

RUBi: Reducing Unimodal Biases for Visual Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Chasing Ghosts: Instruction Following as Bayesian State Tracking.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Probabilistic Neural Symbolic Models for Interpretable Visual Question Answering.
Proceedings of the 36th International Conference on Machine Learning, 2019

Counterfactual Visual Explanations.
Proceedings of the 36th International Conference on Machine Learning, 2019

TarMAC: Targeted Multi-Agent Communication.
Proceedings of the 36th International Conference on Machine Learning, 2019

Modeling the Long Term Future in Model-Based Reinforcement Learning.
Proceedings of the 7th International Conference on Learning Representations, 2019

Embodied Amodal Recognition: Learning to Move to Perceive Objects.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Habitat: A Platform for Embodied AI Research.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Fashion++: Minimal Edits for Outfit Improvement.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

nocaps: novel object captioning at scale.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Trick or TReAT : Thematic Reinforcement for Artistic Typography.
Proceedings of the Tenth International Conference on Computational Creativity, 2019

End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features.
Proceedings of the IEEE International Conference on Acoustics, 2019

Improving Generative Visual Dialog by Answering Diverse Questions.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Embodied Question Answering in Photorealistic Environments With Point Cloud Perception.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Towards VQA Models That Can Read.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Cycle-Consistency for Robust Visual Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Audio Visual Scene-Aware Dialog.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Pythia v0.1: the Winning Entry to the VQA Challenge 2018.
CoRR, 2018

Talk the Walk: Navigating New York City through Grounded Dialogue.
CoRR, 2018

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7.
CoRR, 2018

Punny Captions: Witty Wordplay in Image Descriptions.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Do explanations make VQA models more predictable to a human?
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Graph R-CNN for Scene Graph Generation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Choose Your Neuron: Incorporating Domain Knowledge Through Neuron-Importance.
Proceedings of the Computer Vision - ECCV 2018, 2018

Visual Coreference Resolution in Visual Dialog Using Neural Module Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Neural Baby Talk.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Embodied Question Answering.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Neural Modular Control for Embodied Question Answering.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

2017
VQA: Visual Question Answering - www.visualqa.org.
Int. J. Comput. Vis., 2017

Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
Comput. Vis. Image Underst., 2017

CoDraw: Visual Dialog for Collaborative Drawing.
CoRR, 2017

Active Learning for Visual Question Answering: An Empirical Study.
CoRR, 2017

It Takes Two to Tango: Towards Theory of AI's Mind.
CoRR, 2017

Cooperative Learning with Visual Attributes.
CoRR, 2017

C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset.
CoRR, 2017

Best of Both Worlds: Transferring Knowledge from Discriminative Learning to a Generative Visual Dialog Model.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation.
Proceedings of the 5th International Conference on Learning Representations, 2017

Evaluating Visual Conversational Agents via Cooperative Human-AI Games.
Proceedings of the Fifth AAAI Conference on Human Computation and Crowdsourcing, 2017

Sound-Word2Vec: Learning Word Representations Grounded in Sounds.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

ParlAI: A Dialog Research Software Platform.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Deal or No Deal? End-to-End Learning of Negotiation Dialogues.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Context-Aware Captions from Context-Agnostic Supervision.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Visual Dialog.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Counting Everyday Objects in Everyday Scenes.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Adopting Abstract Images for Semantic Scene Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Human-Machine CRFs for Identifying Bottlenecks in Scene Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Grad-CAM: Why did you say that?
CoRR, 2016

Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization.
CoRR, 2016

A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories.
CoRR, 2016

Interpreting Visual Question Answering Models.
CoRR, 2016

Measuring Machine Intelligence Through Visual Question Answering.
AI Mag., 2016

Hierarchical Question-Image Co-Attention for Visual Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories.
Proceedings of the NAACL HLT 2016, 2016


Knowing who to listen to: Prioritizing experts from a diverse ensemble for attribute personalization.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Sort Story: Sorting Jumbled Images and Captions into Stories.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Analyzing the Behavior of Visual Question Answering Models.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Leveraging Visual Question Answering for Image-Caption Ranking.
Proceedings of the Computer Vision - ECCV 2016, 2016

Deep Learning the City: Quantifying Urban Perception at a Global Scale.
Proceedings of the Computer Vision - ECCV 2016, 2016

Yin and Yang: Balancing and Answering Binary Visual Questions.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Joint Unsupervised Learning of Deep Representations and Image Clusters.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

VisualWord2Vec (Vis-W2V): Learning Visually Grounded Word Embeddings Using Abstract Scenes.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

We are Humor Beings: Understanding and Predicting Visual Humor.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
WhittleSearch: Interactive Image Search with Relative Attribute Feedback.
Int. J. Comput. Vis., 2015

Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes.
CoRR, 2015

Semantic classification of spacecraft's status: integrating system intelligence and human knowledge.
Proceedings of the 9th IEEE International Conference on Semantic Computing, 2015

Learning Common Sense through Visual Abstraction.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

VQA: Visual Question Answering.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

CIDEr: Consensus-based image description evaluation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Don't just listen, use your imagination: Leveraging visual common sense for non-visual tasks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Image specificity.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Understanding image virality.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
What Makes a Photograph Memorable?
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Collecting Image Description Datasets using Crowdsourcing.
CoRR, 2014

Human-Machine CRFs for Identifying Bottlenecks in Holistic Scene Understanding.
CoRR, 2014

Interactively Guiding Semi-Supervised Clustering via Attribute-Based Explanations.
Proceedings of the Computer Vision - ECCV 2014, 2014

Towards Transparent Systems: Semantic Characterization of Failure Modes.
Proceedings of the Computer Vision - ECCV 2014, 2014

Zero-Shot Learning via Visual Abstraction.
Proceedings of the Computer Vision - ECCV 2014, 2014

Predicting Failures of Vision Systems.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Predicting User Annoyance Using Visual Attributes.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Which Edges Matter?
Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Learning the Visual Interpretation of Sentences.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Attribute Dominance: What Pops Out?
Proceedings of the IEEE International Conference on Computer Vision, 2013

Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Implied Feedback: Learning Nuances of User Behavior in Image Search.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Bringing Semantics into Focus Using Visual Abstraction.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Multi-attribute Queries: To Merge or Not to Merge?
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Simultaneous Active Learning of Classifiers & Attributes via Relative Feedback.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Visual attributes for enhanced human-machine communication.
Proceedings of the 51st Annual Allerton Conference on Communication, 2013

2012
Exploring Tiny Images: The Roles of Appearance and Contextual Information for Machine and Human Object Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Attributes for Classifier Feedback.
Proceedings of the Computer Vision - ECCV 2012, 2012

The role of image understanding in contour detection.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Automatic discovery of groups of objects for scene understanding.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

WhittleSearch: Image search with relative attribute feedback.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Discovering localized attributes for fine-grained recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Relative Attributes for Enhanced Human-Machine Communication.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Interactive Co-segmentation of Objects in Image Collections.
Springer Briefs in Computer Science, Springer, ISBN: 978-1-4614-1915-0, 2011

Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance.
Int. J. Comput. Vis., 2011

Understanding the Intrinsic Memorability of Images.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Relative attributes.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Recognizing jumbled images: The role of local and global information in image classification.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Extracting adaptive contextual cues from unlabeled regions.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Finding the weakest link in person detectors.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Interactively building a discriminative vocabulary of nameable attributes.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Inference for order reduction in Markov random fields.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
The role of features, algorithms and data in visual recognition.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

iCoseg: Interactive co-segmentation with intelligent scribble guidance.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Beyond trees: MRF inference via outer-planar decomposition.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Unsupervised Modeling of Objects and Their Hierarchical Contextual Interactions.
EURASIP J. Image Video Process., 2009

Semi-supervised co-training and active learning based approach for multi-view intrusion detection.
Proceedings of the 2009 ACM Symposium on Applied Computing (SAC), 2009

Seed Image Selection in interactive cosegmentation.
Proceedings of the International Conference on Image Processing, 2009

Unsupervised learning of hierarchical spatial structures in images.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Cutout-search: Putting a name to the picture.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

2008
Data Fusion and Cost Minimization for Intrusion Detection.
IEEE Trans. Inf. Forensics Secur., 2008

An ensemble based data fusion approach for early diagnosis of Alzheimer's disease.
Inf. Fusion, 2008

Localization and Segmentation of A 2D High Capacity Color Barcode.
Proceedings of the 9th IEEE Workshop on Applications of Computer Vision (WACV 2008), 2008

Bringing diverse classifiers to common grounds: dtransform.
Proceedings of the IEEE International Conference on Acoustics, 2008

Determining Patch Saliency Using Low-Level Context.
Proceedings of the Computer Vision, 2008

From appearance to context-based recognition: Dense labeling in small images.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
An Ensemble-Based Incremental Learning Approach to Data Fusion.
IEEE Trans. Syst. Man Cybern. Part B, 2007

Feature-based Part Retrieval for Interactive 3D Reassembly.
Proceedings of the 8th IEEE Workshop on Applications of Computer Vision (WACV 2007), 2007

Hierarchical Semantics of Objects (hSOs).
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Unsupervised Learning of Hierarchical Semantics of Objects (hSOs).
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Unsupervised Identification of Multiple Objects of Interest from Multiple Images: dISCOVER.
Proceedings of the Computer Vision, 2007

2004
Combining classifiers for multisensor data fusion.
Proceedings of the IEEE International Conference on Systems, 2004


  Loading...