Abhinav Gupta

Orcid: 0000-0003-2298-3063

  • Facebook AI Research (FAIR), Pittsburgh, PA, USA
  • Carnegie Mellon University, Robotics Institute, Pittsburgh, PA, USA

According to our database1, Abhinav Gupta authored at least 210 papers between 2004 and 2024.

Collaborative distances:



In proceedings 
PhD thesis 


Online presence:

On csauthors.net:


HRP: Human Affordances for Robotic Pre-Training.
CoRR, 2024

Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning.
CoRR, 2024

Track2Act: Predicting Point Tracks from Internet Videos enables Diverse Zero-shot Robot Manipulation.
CoRR, 2024

G-HOP: Generative Hand-Object Prior for Interaction Reconstruction and Grasp Synthesis.
CoRR, 2024

Exploitation-Guided Exploration for Semantic Embodied Navigation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Hearing Touch: Audio-Visual Pretraining for Contact-Rich Manipulation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Towards Generalizable Zero-Shot Manipulation via Translating Human Interaction Plans.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

All the Feels: A Dexterous Hand With Large-Area Tactile Sensing.
IEEE Robotics Autom. Lett., December, 2023

Guest Editorial: Introduction to the Special Section on Graphs in Vision and Pattern Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking.
CoRR, 2023

Zero-Shot Robot Manipulation from Passive Human Videos.
CoRR, 2023

DragonClaw: A low-cost pneumatic gripper with integrated magnetic sensing.
Proceedings of the IEEE International Conference on Soft Robotics, 2023

Real World Offline Reinforcement Learning with Realistic Data Source.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Train Offline, Test Online: A Real Robot Learning Benchmark.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Learning Dexterous Manipulation from Exemplar Object Trajectories and Pre-Grasps.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Visual Affordance Prediction for Guiding Robot Exploration.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Manipulate by Seeing: Creating Manipulation Controllers from Pre-Trained Representations.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Affordance Diffusion: Synthesizing Hand-Object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

An Unbiased Look at Datasets for Visuo-Motor Pre-Training.
Proceedings of the Conference on Robot Learning, 2023

Evaluating Continual Learning on a Home Robot.
Proceedings of the Conference on Lifelong Learning Agents, 2023

Last-Mile Embodied Visual Navigation.
CoRR, 2022

All the Feels: A dexterous hand with large area sensing.
CoRR, 2022

Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction.
CoRR, 2022

Human-to-Robot Imitation in the Wild.
Proceedings of the Robotics: Science and Systems XVIII, New York City, NY, USA, June 27, 2022

Learning State-Aware Visual Representations from Audible Interactions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?
Proceedings of the Learning for Dynamics and Control Conference, 2022

The Unsurprising Effectiveness of Pre-Trained Vision Models for Control.
Proceedings of the International Conference on Machine Learning, 2022

The Challenges of Continuous Self-Supervised Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

What's in your hands? 3D Reconstruction of Generic Objects in Hands.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Pretrain, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

R3M: A Universal Visual Representation for Robot Manipulation.
Proceedings of the Conference on Robot Learning, 2022

CORA: Benchmarks, Baselines, and Metrics as a Platform for Continual Reinforcement Learning Agents.
Proceedings of the Conference on Lifelong Learning Agents, 2022

Self-Activating Neural Ensembles for Continual Reinforcement Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2022

Editorial: Introduction to the Special Section on CVPR2019 Best Papers.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Hierarchical Neural Dynamic Policies.
Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021

Interesting Object, Curious Agent: Learning Task-Agnostic Exploration.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

No RL, No Simulation: Learning to Navigate without Navigating.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

RB2: Robotic Manipulation Benchmarking with a Twist.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

droidlet: modular, heterogenous, multi-modal agents.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

PixelTransformer: Sample Conditioned Signal Generation.
Proceedings of the 38th International Conference on Machine Learning, 2021

Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Audio-Visual Floorplan Reconstruction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Where2Act: From Pixels to Actions for Articulated 3D Objects.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

The Functional Correspondence Problem.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learn-to-Race: A Multimodal Control Environment for Autonomous Racing.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Shelf-Supervised Mesh Prediction in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Robots on Demand: A Democratized Robotics Research Cloud.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

ReSkin: versatile, replaceable, lasting tactile skins.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

A Differentiable Recipe for Learning Visual Non-Prehensile Planar Manipulation.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Implicit Mesh Reconstruction from Unannotated Image Collections.
CoRR, 2020

Empirically Verifying Hypotheses Using Reinforcement Learning.
CoRR, 2020

Beyond the Camera: Neural Networks in World Coordinates.
CoRR, 2020

Swoosh! Rattle! Thump! - Actions that Sound.
Proceedings of the Robotics: Science and Systems XVI, 2020

Demystifying Contrastive Self-Supervised Learning: Invariances, Augmentations and Dataset Biases.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

See, Hear, Explore: Curiosity via Audio-Visual Association.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Object Goal Navigation using Goal-Oriented Semantic Exploration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Neural Dynamic Policies for End-to-End Sensorimotor Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Efficient Bimanual Manipulation Using Learned Task Schemas.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Learning Robot Skills with Temporal Variational Inference.
Proceedings of the 37th International Conference on Machine Learning, 2020

Dynamics-Aware Embeddings.
Proceedings of the 8th International Conference on Learning Representations, 2020

Discovering Motor Programs by Recomposing Demonstrations.
Proceedings of the 8th International Conference on Learning Representations, 2020

Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Intrinsic Motivation for Encouraging Synergistic Behavior.
Proceedings of the 8th International Conference on Learning Representations, 2020

Learning To Explore Using Active Neural SLAM.
Proceedings of the 8th International Conference on Learning Representations, 2020

Aligning Videos in Space and Time.
Proceedings of the Computer Vision - ECCV 2020, 2020

Semantic Curiosity for Active Visual Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

ClusterFit: Improving Generalization of Visual Representations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Articulation-Aware Canonical Surface Mapping.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Use the Force, Luke! Learning to Predict Physical Forces by Simulating Effects.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural Topological SLAM for Visual Navigation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Visual Imitation Made Easy.
Proceedings of the 4th Conference on Robot Learning, 2020

Same Object, Different Grasps: Data and Semantic Knowledge for Task-Oriented Grasping.
Proceedings of the 4th Conference on Robot Learning, 2020

Transformers for One-Shot Visual Imitation.
Proceedings of the 4th Conference on Robot Learning, 2020

From Images to 3D Shape Attributes.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

PyRobot: An Open-source Robotics Framework for Research and Benchmarking.
CoRR, 2019

Third-Person Visual Imitation Learning via Decoupled Hierarchical Controller.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Self-Supervised Exploration via Disagreement.
Proceedings of the 36th International Conference on Machine Learning, 2019

Environment Probing Interaction Policies.
Proceedings of the 7th International Conference on Learning Representations, 2019

Visual Semantic Navigation using Scene Priors.
Proceedings of the 7th International Conference on Learning Representations, 2019

Bounce and Learn: Modeling Scene Dynamics with Real-World Bounces.
Proceedings of the 7th International Conference on Learning Representations, 2019

Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies.
Proceedings of the 7th International Conference on Learning Representations, 2019

Learning Exploration Policies for Navigation.
Proceedings of the 7th International Conference on Learning Representations, 2019

Compositional Video Prediction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Task-Driven Modular Networks for Zero-Shot Compositional Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Canonical Surface Mapping via Geometric Cycle Consistency.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

3D-RelNet: Joint Object and Relational Network for 3D Prediction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Scaling and Benchmarking Self-Supervised Visual Representation Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Object-centric Forward Modeling for Model Predictive Control.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

BOLD5000: A public fMRI dataset of 5000 images.
CoRR, 2018

Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos.
CoRR, 2018

Never-ending learning.
Commun. ACM, 2018

Beyond Grids: Learning Graph Representations for Visual Recognition.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Robot Learning in Homes: Improving Generalization and Reducing Dataset Bias.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Hardware Conditioned Policies for Multi-Robot Transfer Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning to Grasp Without Seeing.
Proceedings of the 2018 International Symposium on Experimental Robotics, 2018

Learning 6-DOF Grasping Interaction via Deep Geometry-Aware 3D Representations.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

CASSL: Curriculum Accelerated Self-Supervised Learning.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Interpretable Intuitive Physics Model.
Proceedings of the Computer Vision - ECCV 2018, 2018

Videos as Space-Time Region Graphs.
Proceedings of the Computer Vision - ECCV 2018, 2018

Compositional Learning for Human Object Interaction.
Proceedings of the Computer Vision - ECCV 2018, 2018

Actor and Observer: Joint Modeling of First and Third-Person Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning by Asking Questions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Iterative Visual Reasoning Beyond Convolutions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Zero-Shot Recognition via Semantic Embeddings and Knowledge Graphs.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Non-Local Neural Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multiple Interactions Made Easy (MIME): Large Scale Demonstrations Data for Imitation.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

AI2-THOR: An Interactive 3D Environment for Visual AI.
CoRR, 2017

Learning Grasping Interaction with Geometry-aware 3D Representations.
CoRR, 2017

WebVision Challenge: Visual Learning and Understanding With Web Data.
CoRR, 2017

An Implementation of Faster RCNN with Study for Region Sampling.
CoRR, 2017

PixelNet: Representation of the pixels, by the pixels, and for the pixels.
CoRR, 2017

Learning to fly by crashing.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Target-driven visual navigation in indoor scenes using deep reinforcement learning.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Learning to push by grasping: Using multiple tasks for effective learning.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Supervision via competition: Robot adversaries for learning tasks.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Robust Adversarial Reinforcement Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Visual Semantic Planning Using Deep Successor Representations.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2017

The Pose Knows: Video Forecasting by Generating Pose Futures.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era.
Proceedings of the IEEE International Conference on Computer Vision, 2017

What Actions are Needed for Understanding Human Actions in Videos?
Proceedings of the IEEE International Conference on Computer Vision, 2017

Spatial Memory for Context Reasoning in Object Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Transitive Invariance for Self-Supervised Visual Representation Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning from Noisy Large-Scale Datasets with Minimal Supervision.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Asynchronous Temporal Fields for Action Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

From Red Wine to Red Tomato: Composition with Context.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

The More You Know: Using Knowledge Graphs for Image Classification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

What's in a Question: Using Visual Questions as a Form of Supervision.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Binge Watching: Scaling Affordance Learning from Sitcoms.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Beyond Skip Connections: Top-Down Modulation for Object Detection.
CoRR, 2016

Pose from Action: Unsupervised Learning of Pose Features based on Motion.
CoRR, 2016

Understanding Higher-Order Shape via 3D Shape Attributes.
CoRR, 2016

PixelNet: Towards a General Pixel-level Architecture.
CoRR, 2016

Cutting through the clutter: Task-relevant features for image matching.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Supersizing self-supervision: Learning to grasp from 50K tries and 700 robot hours.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Much Ado About Time: Exhaustive Annotation of Temporal Data.
Proceedings of the Fourth AAAI Conference on Human Computation and Crowdsourcing, 2016

Generative Image Modeling Using Style and Structure Adversarial Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

An Uncertain Future: Forecasting from Static Images Using Variational Autoencoders.
Proceedings of the Computer Vision - ECCV 2016, 2016

Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding.
Proceedings of the Computer Vision - ECCV 2016, 2016

Learning Visual Storylines with Skipping Recurrent Neural Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

Contextual Priming and Feedback for Faster R-CNN.
Proceedings of the Computer Vision - ECCV 2016, 2016

The Curious Robot: Learning Visual Representations via Physical Interactions.
Proceedings of the Computer Vision - ECCV 2016, 2016

"What Happens If..." Learning to Predict the Effect of Forces in Images.
Proceedings of the Computer Vision - ECCV 2016, 2016

The Visual Object Tracking VOT2016 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Learning a Predictable and Generative Vector Representation for Objects.
Proceedings of the Computer Vision - ECCV 2016, 2016

Actions ~ Transformations.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Training Region-Based Object Detectors with Online Hard Example Mining.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Cross-Stitch Networks for Multi-task Learning.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

3D Shape Attributes.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Marr Revisited: 2D-3D Alignment via Surface Normal Prediction.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Applying artificial vision models to human scene understanding.
Frontiers Comput. Neurosci., 2015

Transferring Rich Feature Hierarchies for Robust Visual Tracking.
CoRR, 2015

In Defense of the Direct Perception of Affordances.
CoRR, 2015

Mid-level Elements for Object Detection.
CoRR, 2015

What makes Paris look like Paris?
Commun. ACM, 2015

Unsupervised Learning of Visual Representations Using Videos.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Dense Optical Flow Prediction from a Static Image.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Single Image 3D without a Single 3D Image.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Unsupervised Visual Representation Learning by Context Prediction.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Webly Supervised Learning of Convolutional Networks.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Designing deep networks for surface normal estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Sense discovery via co-clustering on images and text.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

People Watching: Human Actions as a Cue for Single View Geometry.
Int. J. Comput. Vis., 2014

BBN VISER TRECVID 2014 Multimedia Event Detection and Multimedia Event Recounting Systems.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Unfolding an Indoor Origami World.
Proceedings of the Computer Vision - ECCV 2014, 2014

Context as Supervisory Signal: Discovering Objects with Predictable Context.
Proceedings of the Computer Vision - ECCV 2014, 2014

Patch to the Future: Unsupervised Visual Prediction.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Enriching Visual Knowledge Bases via Object Discovery and Segmentation.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

BBN VISER TRECVID 2013 Multimedia Event Detection and Multimedia Event Recounting Systems.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Mid-level Visual Element Discovery as Discriminative Mode Seeking.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Building Part-Based Object Detectors via 3D Geometry.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Data-Driven 3D Primitives for Single Image Understanding.
Proceedings of the IEEE International Conference on Computer Vision, 2013

NEIL: Extracting Visual Knowledge from Web Data.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Representing Videos Using Mid-level Discriminative Patches.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

BBNVISER : BBN VISER TRECVID 2012 Multimedia Event Detection and Multimedia Event Recounting Systems.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Exemplar-SVMs for Visual Ob ject Detection, Label Transfer and Image Retrieval.
Proceedings of the 29th International Conference on Machine Learning, 2012

Unsupervised Discovery of Mid-Level Discriminative Patches.
Proceedings of the Computer Vision - ECCV 2012, 2012

Constrained Semi-Supervised Learning Using Attributes and Comparative Attributes.
Proceedings of the Computer Vision - ECCV 2012, 2012

Scene Semantics from Long-Term Observation of People.
Proceedings of the Computer Vision - ECCV 2012, 2012

Data-driven visual similarity for cross-domain image matching.
ACM Trans. Graph., 2011

Demonstration of Integrated Micro-Electro-Mechanical Relay Circuits for VLSI Applications.
IEEE J. Solid State Circuits, 2011

Ensemble of exemplar-SVMs for object detection and beyond.
Proceedings of the IEEE International Conference on Computer Vision, 2011

From 3D scene geometry to human workspace.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Piecing together the segmentation jigsaw using context.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Estimating Spatial Layout of Rooms using Volumetric Reasoning about Objects and Surfaces.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Demonstration of integrated micro-electro-mechanical switch circuits for VLSI applications.
Proceedings of the IEEE International Solid-State Circuits Conference, 2010

Learning What and How of Contextual Models for Scene Labeling.
Proceedings of the Computer Vision, 2010

Blocks World Revisited: Image Understanding Using Qualitative Geometry and Mechanics.
Proceedings of the Computer Vision, 2010

Beyond active noun tagging: Modeling contextual interactions for multi-class active learning.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Beyond Nouns and Verbs.
PhD thesis, 2009

Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

A 90 nm CMOS Low-Power 60 GHz Transceiver With Integrated Baseband Circuitry.
IEEE J. Solid State Circuits, 2009

A 90nm CMOS low-power 60GHz transceiver with integrated baseband circuitry.
Proceedings of the IEEE International Solid-State Circuits Conference, 2009

Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Constraint Integration for Efficient Multiview Pose Estimation with Self-Occlusions.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

A "Shape Aware" Model for semi-supervised Learning of Objects and its Context.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers.
Proceedings of the Computer Vision, 2008

Context and observation driven latent variable model for human pose estimation.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

COST: An Approach for Camera Selection and Multi-Object Inference Ordering in Dynamic Scenes.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Objects in Action: An Approach for Combining Action Understanding and Object Perception.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Constraint Integration for Multiview Pose Estimation of Humans with Self-Occlusions.
Proceedings of the 3rd International Symposium on 3D Data Processing, 2006

Extracting regions of symmetry.
Proceedings of the 2005 International Conference on Image Processing, 2005

Non-linear Dimensionality Reduction by Locally Linear Isomaps.
Proceedings of the Neural Information Processing, 11th International Conference, 2004

Watermarking of MPEG-4 Videos.
Proceedings of the Biometric Authentication, First International Conference, 2004
