Jitendra Malik

Orcid: 0000-0003-3695-1580

Affiliations:
  • University of California, Berkeley, USA


According to our database1, Jitendra Malik authored at least 364 papers between 1983 and 2024.

Collaborative distances:
  • Dijkstra number2 of three.
  • Erdős number3 of two.

Awards

ACM Fellow

ACM Fellow 2008, "For contributions to computer vision.".

IEEE Fellow

IEEE Fellow 2006, "For contributions to computer vision and image analysis.".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
AutoEval Done Right: Using Synthetic Data for Model Evaluation.
CoRR, 2024

Twisting Lids Off with Two Hands.
CoRR, 2024

xT: Nested Tokenization for Larger Context in Large Images.
CoRR, 2024

Humanoid Locomotion as Next Token Prediction.
CoRR, 2024

Synthesizing Moving People with 3D Control.
CoRR, 2024

Dr<sup>2</sup>Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning.
CoRR, 2024

2023
Navigating to objects in the real world.
Sci. Robotics, June, 2023

Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation.
CoRR, 2023

Adaptive Human Trajectory Prediction via Latent Corridors.
CoRR, 2023

Reconstructing Hands in 3D with Transformers.
CoRR, 2023

Sequential Modeling Enables Scalable Learning for Large Vision Models.
CoRR, 2023

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
CoRR, 2023

GOAT: GO to Any Thing.
CoRR, 2023

Conformal Policy Learning for Sensorimotor Control Under Distribution Shifts.
CoRR, 2023

Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots.
CoRR, 2023

Interactive Task Planning with Language Models.
CoRR, 2023

What Matters to You? Towards Visual Representation Alignment for Robot Learning.
CoRR, 2023

Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions.
CoRR, 2023

Learning Vision-based Pursuit-Evasion Robot Policies.
CoRR, 2023

Learning Space-Time Semantic Correspondences.
CoRR, 2023

More Than an Arm: Using a Manipulator as a Tail for Enhanced Stability in Legged Locomotion.
CoRR, 2023

Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
CoRR, 2023

Learning Humanoid Locomotion with Transformers.
CoRR, 2023

Big Little Transformer Decoder.
CoRR, 2023

CA<sup>2</sup>T-Net: Category-Agnostic 3D Articulation Transfer from Single Image.
CoRR, 2023

EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Speculative Decoding with Big Little Decoder.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MAViL: Masked Audio-Video Learners.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning a Single Near-hover Position Controller for Vastly Different Quadcopters.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Learning Visual Locomotion with Cross-Modal Supervision.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles.
Proceedings of the International Conference on Machine Learning, 2023

Multi-skill Mobile Manipulation for Object Rearrangement.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Navigating to Objects Specified by Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Humans in 4D: Reconstructing and Tracking Humans with Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Decoupling Human and Camera Motion from Videos in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multiview Compressive Coding for 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

On the Benefits of 3D Pose and Tracking for Human Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Robot Learning with Sensorimotor Pre-training.
Proceedings of the Conference on Robot Learning, 2023

General In-hand Object Rotation with Vision and Touch.
Proceedings of the Conference on Robot Learning, 2023

2022
Multi-View Supervision for Single-View Reconstruction via Differentiable Ray Consistency.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Does unsupervised grammar induction need pixels?
CoRR, 2022

Instance-Specific Image Goal Navigation: Training Embodied Agents to Find Object Instances.
CoRR, 2022

Learning to Imitate Object Interactions from Internet Videos.
CoRR, 2022

Learning to Learn with Generative Models of Neural Network Checkpoints.
CoRR, 2022

A Zero-Shot Adaptive Quadcopter Controller.
CoRR, 2022

Masked Visual Pre-training for Motor Control.
CoRR, 2022

Squeezeformer: An Efficient Transformer for Automatic Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Adapting Rapid Motor Adaptation for Bipedal Robots.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Image-to-Image Regression with Distribution-Free Uncertainty Quantification and Applications in Imaging.
Proceedings of the International Conference on Machine Learning, 2022

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

PONI: Potential Functions for ObjectGoal Navigation with Interaction-free Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Tracking People by Predicting 3D Appearance, Location and Pose.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Human Mesh Recovery from Multiple Shots.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reversible Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MViTv2: Improved Multiscale Vision Transformers for Classification and Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


Coupling Vision and Proprioception for Navigation of Legged Robots.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ABO: Dataset and Benchmarks for Real-World 3D Object Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Differentiable Stereopsis: Meshes from multiple views using differentiable rendering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Real-World Robot Learning with Masked Visual Pre-training.
Proceedings of the Conference on Robot Learning, 2022

In-Hand Object Rotation via Rapid Motor Adaptation.
Proceedings of the Conference on Robot Learning, 2022

Legged Locomotion in Challenging Terrains using Egocentric Vision.
Proceedings of the Conference on Robot Learning, 2022

2021
Distribution-free, Risk-controlling Prediction Sets.
J. ACM, 2021

Tracking People by Predicting 3D Appearance, Location & Pose.
CoRR, 2021

Improved Multiscale Vision Transformers for Classification and Detection.
CoRR, 2021

Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

ABO: Dataset and Benchmarks for Real-World 3D Object Understanding.
CoRR, 2021

Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans.
CoRR, 2021

Active 3D Shape Reconstruction from Vision and Touch.
CoRR, 2021

RMA: Rapid Motor Adaptation for Legged Robots.
Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021

Habitat 2.0: Training Home Assistants to Rearrange their Habitat.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Active 3D Shape Reconstruction from Vision and Touch.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Tracking People with 3D Representations.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

PyTorchVideo: A Deep Learning Library for Video Understanding.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

State-Only Imitation Learning for Dexterous Manipulation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Differentiable Spatial Planning using Transformers.
Proceedings of the 38th International Conference on Machine Learning, 2021

Learning Long-term Visual Dynamics with Region Proposal Interaction Networks.
Proceedings of the 9th International Conference on Learning Representations, 2021

Uncertainty Sets for Image Classifiers using Conformal Prediction.
Proceedings of the 9th International Conference on Learning Representations, 2021

From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Reconstructing Hand-Object Interactions in the Wild.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multiscale Vision Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Minimizing Energy Consumption Leads to the Emergence of Gaits in Legged Robots.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020
Hierarchical Surface Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Multimodal Image Synthesis with Conditional Implicit Maximum Likelihood Estimation.
Int. J. Comput. Vis., 2020

Cognitive Mapping and Planning for Visual Navigation.
Int. J. Comput. Vis., 2020

Better Knowledge Retention through Metric Learning.
CoRR, 2020

Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation.
CoRR, 2020

Rearrangement: A Challenge for Embodied AI.
CoRR, 2020

Robust Learning Through Cross-Task Consistency.
CoRR, 2020

Audiovisual SlowFast Networks for Video Recognition.
CoRR, 2020

3D Shape Reconstruction from Vision and Touch.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Which Tasks Should Be Learned Together in Multi-task Learning?
Proceedings of the 37th International Conference on Machine Learning, 2020

Deep Isometric Learning for Visual Recognition.
Proceedings of the 37th International Conference on Machine Learning, 2020

Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Perceiving 3D Human-Object Spatial Arrangements from a Single Image in the Wild.
Proceedings of the Computer Vision - ECCV 2020, 2020

Inclusive GAN: Improving Data and Minority Coverage in Generative Models.
Proceedings of the Computer Vision - ECCV 2020, 2020

It Is Not the Journey But the Destination: Endpoint Conditioned Trajectory Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

Shape and Viewpoint Without Keypoints.
Proceedings of the Computer Vision - ECCV 2020, 2020

Long-Term Human Motion Prediction with Scene Context.
Proceedings of the Computer Vision - ECCV 2020, 2020

Robust Learning Through Cross-Task Consistency.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation.
Proceedings of the 4th Conference on Robot Learning, 2020

2019
Side-Tuning: Network Adaptation via Additive Side Networks.
CoRR, 2019

Learning Navigation Subroutines by Watching Videos.
CoRR, 2019

Trajectory Normalized Gradients for Distributed Optimization.
CoRR, 2019

Approximate Feature Collisions in Neural Nets.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Predicting 3D Human Dynamics From Video.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Habitat: A Platform for Embodied AI Research.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Diverse Image Synthesis From Semantic Layouts via Conditional IMLE.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Mesh R-CNN.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SlowFast Networks for Video Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning 3D Human Dynamics From Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Non-Adversarial Image Synthesis With Generative Latent Nearest Neighbors.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Individual Styles of Conversational Gesture.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Independent Object Motion From Unlabelled Stereoscopic Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning to Navigate Using Mid-Level Visual Priors.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Learning Navigation Subroutines from Egocentric Videos.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Combining Optimal Control and Learning for Visual Navigation in Novel Environments.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018
SFV: reinforcement learning of physical skills from videos.
ACM Trans. Graph., 2018

More Than a Feeling: Learning to Grasp and Regrasp Using Vision and Touch.
IEEE Robotics Autom. Lett., 2018

Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Active Tasks.
CoRR, 2018

Non-Adversarial Image Synthesis with Generative Latent Nearest Neighbors.
CoRR, 2018

Are All Training Examples Created Equal? An Empirical Study.
CoRR, 2018

On the Implicit Assumptions of GANs.
CoRR, 2018

Super-Resolution via Conditional Implicit Maximum Likelihood Estimation.
CoRR, 2018

Implicit Maximum Likelihood Estimation.
CoRR, 2018

On Evaluation of Embodied Navigation Agents.
CoRR, 2018

PatchFCN for Intracranial Hemorrhage Detection.
CoRR, 2018

Visual Memory for Robust Path Following.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Cost-Sensitive Active Learning for Intracranial Hemorrhage Detection.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

Learning Category-Specific Mesh Reconstruction from Image Collections.
Proceedings of the Computer Vision - ECCV 2018, 2018

Taskonomy: Disentangling Task Transfer Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Gibson Env: Real-World Perception for Embodied Agents.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Factoring Shape, Pose, and Layout From the 2D Image of a 3D Scene.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-View Consistency as Supervisory Signal for Learning Shape and Pose Prediction.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Instance Segmentation by Interaction.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Zero-Shot Visual Imitation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

End-to-End Recovery of Human Shape and Pose.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

From Lifestyle Vlogs to Everyday Interactions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Learning Category-Specific Deformable 3D Models for Object Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Shape Estimation from Shading, Defocus, and Correspondence Using Light-Field Angular Coherence.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Object Instance Segmentation and Fine-Grained Localization Using Hypercolumns.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Editorial- Deep Learning for Computer Vision.
Comput. Vis. Image Underst., 2017

Unifying Map and Landmark Based Representations for Visual Navigation.
CoRR, 2017

Large-Scale 3D Shape Reconstruction and Segmentation from ShapeNet Core55.
CoRR, 2017

Learning to Optimize Neural Nets.
CoRR, 2017

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions.
CoRR, 2017

Technical Perspective: What led computer vision to deep learning?
Commun. ACM, 2017

Learning a Multi-View Stereo Machine.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Combining self-supervised learning and imitation for vision-based rope manipulation.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Fast k-Nearest Neighbour Search via Prioritized DCI.
Proceedings of the 34th International Conference on Machine Learning, 2017

Learning to Optimize.
Proceedings of the 5th International Conference on Learning Representations, 2017

What will Happen Next? Forecasting Player Moves in Sports Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Feedback Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning Shape Abstractions by Assembling Volumetric Primitives.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Cognitive Mapping and Planning for Visual Navigation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Hierarchical Surface Prediction for 3D Object Reconstruction.
Proceedings of the 2017 International Conference on 3D Vision, 2017

2016
The three R's of computer vision: Recognition, reconstruction and reorganization.
Pattern Recognit. Lett., 2016

Depth Estimation and Specular Removal for Glossy Surfaces Using Point and Line Consistency with Light-Field Cameras.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Region-Based Convolutional Networks for Accurate Object Detection and Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Intrinsic Scene Properties from a Single RGB-D Image.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Feedback Networks.
CoRR, 2016

Beyond Skip Connections: Top-Down Modulation for Object Detection.
CoRR, 2016

Learning Visual Predictive Models of Physics for Playing Billiards.
Proceedings of the 4th International Conference on Learning Representations, 2016

Learning to Poke by Poking: Experiential Learning of Intuitive Physics.
CoRR, 2016

Fast k-Nearest Neighbour Search via Dynamic Continuous Indexing.
Proceedings of the 33nd International Conference on Machine Learning, 2016

View Synthesis by Appearance Flow.
Proceedings of the Computer Vision - ECCV 2016, 2016

Generic 3D Representation via Pose Estimation and Matching.
Proceedings of the Computer Vision - ECCV 2016, 2016

Amodal Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2016, 2016

Iterative Instance Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Cross Modal Distillation for Supervision Transfer.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Human Pose Estimation with Iterative Error Feedback.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Shape, Illumination, and Reflectance from Shading.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation.
Int. J. Comput. Vis., 2015

Shape and Symmetry Induction for 3D Objects.
CoRR, 2015

Bandit Label Inference for Weakly Supervised Learning.
CoRR, 2015

Visual Semantic Role Labeling.
CoRR, 2015

Exploring Person Context and Local Scene Context for Object Detection.
CoRR, 2015

Inferring 3D Object Pose in RGB-D Images.
CoRR, 2015

Recurrent Network Models for Kinematic Tracking.
CoRR, 2015

Detecting people in Cubist art.
AI Matters, 2015

Pose Induction for Novel Object Categories.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

DeepBox: Learning Objectness with Convolutional Networks.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Amodal Completion and Size Constancy in Natural Scenes.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Actions and Attributes from Wholes and Parts.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Contextual Action Recognition with R*CNN.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Recurrent Network Models for Human Dynamics.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning to See by Moving.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Viewpoints and keypoints.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Depth from shading, defocus, and correspondence using light-field angular coherence.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Category-specific object reconstruction from a single image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Hypercolumns for object segmentation and fine-grained localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Aligning 3D models to RGB-D images of cluttered scenes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Finding action tubes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Deformable part models are convolutional neural networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Learning to segment moving objects in videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Virtual view networks for object reconstruction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Segmentation of Moving Objects by Long Term Video Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

R-CNNs for Pose Estimation and Action Detection.
CoRR, 2014

Spatio-Temporal Moving Object Proposals.
CoRR, 2014

Pixels to Voxels: Modeling Visual Representation in the Human Brain.
CoRR, 2014

Grouping-Based Low-Rank Trajectory Completion and 3D Reconstruction.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Depth Estimation for Glossy Surfaces with Light-Field Cameras.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Simultaneous Detection and Segmentation.
Proceedings of the Computer Vision - ECCV 2014, 2014

Learning Rich Features from RGB-D Images for Object Detection and Segmentation.
Proceedings of the Computer Vision - ECCV 2014, 2014

Analyzing the Performance of Multilayer Neural Networks for Object Recognition.
Proceedings of the Computer Vision - ECCV 2014, 2014

Using k-Poselets for Detecting People and Localizing Their Keypoints.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Multiscale Combinatorial Grouping.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Efficient Classification for Additive Kernel SVMs.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Sharpening Out of Focus Images using High-Frequency Transfer.
Comput. Graph. Forum, 2013

Depth from Combining Defocus and Correspondence Using Light-Field Cameras.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Training Deformable Part Models with Decorrelated Features.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Volumetric Semantic Segmentation Using Pyramid Context Features.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Classification of sidewalks in street view images.
Proceedings of the International Green Computing Conference, 2013

Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Articulated Pose Estimation Using Discriminative Armlet Classifiers.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Automated Tuberculosis Diagnosis Using Fluorescence Images from a Mobile Microscope.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2012, 2012

Discriminative Decorrelation for Clustering and Classification.
Proceedings of the Computer Vision - ECCV 2012, 2012

Multi-component Models for Object Detection.
Proceedings of the Computer Vision - ECCV 2012, 2012

Color Constancy, Intrinsic Images, and Shape Estimation.
Proceedings of the Computer Vision - ECCV 2012, 2012

Shape, albedo, and illumination from a single image of an unknown object.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Semantic segmentation using regions and parts.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Contour Detection and Hierarchical Image Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Semantic contours from inverse detectors.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Describing people: A poselet-based approach to attribute classification.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Occlusion boundary detection and figure/ground assignment from optical flow.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Biased normalized cuts.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Action recognition from a distributed representation of pose and appearance.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Object segmentation by alignment of poselet activations to image contours.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

High-frequency shape and albedo from shading using natural image statistics.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data.
IEEE ACM Trans. Comput. Biol. Bioinform., 2010

Coupling visualization and data analysis for knowledge discovery from multi-dimensional scientific data.
Proceedings of the International Conference on Computational Science, 2010

Object Segmentation by Long Term Analysis of Point Trajectories.
Proceedings of the Computer Vision - ECCV 2010, 2010

Detecting People Using Mutually Consistent Poselet Activations.
Proceedings of the Computer Vision - ECCV 2010, 2010

2009
Visual Exploration of Three-Dimensional Gene Expression Using Physical Views and Linked Abstract Views.
IEEE ACM Trans. Comput. Biol. Bioinform., 2009

Multiple-view object recognition in band-limited distributed camera networks.
Proceedings of the Third ACM/IEEE International Conference on Distributed Smart Cameras, 2009

Multi-scale object detection by clustering lines.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Context by region ancestry.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Poselets: Body part detectors trained using 3D human pose annotations.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Object detection using a max-margin Hough transform.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Recognition using regions.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Large displacement optical flow.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

From contours to regions: An empirical evaluation.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Learning Probabilistic Models for Contour Completion in Natural Images.
Int. J. Comput. Vis., 2008

Learning to Locate Informative Features for Visual Identification.
Int. J. Comput. Vis., 2008

Recovering high dynamic range radiance maps from photographs.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2008

The future of image search.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Inferring spatial layout from a single image via depth-ordered grouping.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

Classification using intersection kernel support vector machines is efficient.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Using contours to detect and localize junctions in natural images.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Parsing Images of Architectural Scenes.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

PointCloudXplore 2: Visual Exploration of 3D Gene Expression.
Proceedings of the Visualization of Large and Unstructured Data Sets: Second workshop of the DFG's International Research Training Group "Visualization of Large and Unstructured Data Sets, 2007

Tracking as Repeated Figure/Ground Segmentation.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Recovering 3D Human Body Configurations Using Shape Contexts.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

PointCloudXplore: A Visualization Tool for 3D Gene Expression Data.
Proceedings of the Visualization of Large and Unstructured Data Sets: Applications in Geospatial Planning, Modeling and Engineering, 2006

PointCloudXplore: Visual Analysis of 3D Gene Expression Data Using Physical Views and Parallel Coordinates.
Proceedings of the 8th Joint Eurographics - IEEE VGTC Symposium on Visualization, 2006

Detecting Categories in News Video Using Acoustic, Speech, and Image Features.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Image Retrieval and Classification Using Local Distance Functions.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Figure/Ground Assignment in Natural Images.
Proceedings of the Computer Vision, 2006

SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Shape Guided Object Segmentation.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Shape Matching and Object Recognition.
Proceedings of the Toward Category-Level Object Recognition, 2006

Matching with Shape Contexts.
Proceedings of the Statistics and Analysis of Shapes, 2006

2005
Efficient Shape Matching Using Shape Contexts.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

Cue Integration for Figure/Ground Labeling.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Scale-Invariant Contour Completion Using Conditional Random Fields.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Recovering Human Body Configurations Using Pairwise Constraints between Parts.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Building a Classification Cascade for Visual Identification from One Example.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Shape Matching and Object Recognition Using Low Distortion Correspondences.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Registering Drosophila Embryos at Cellular Resolution to Build a Quantitative 3D Atlas of Gene Expression Patterns and Morphology.
Proceedings of the Fourth International IEEE Computer Society Computational Systems Bioinformatics Conference Workshops & Poster Abstracts, 2005

2004
Learning to Detect Natural Image Boundaries Using Local Brightness, Color, and Texture Cues.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

Spectral Grouping Using the Nyström Method.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

Twist Based Acquisition and Tracking of Animal and Human Kinematics.
Int. J. Comput. Vis., 2004

Recognition and synthesis of human actions from video.
Proceedings of the 9th International Fall Workshop on Vision, Modeling, and Visualization, 2004

An Information Maximization Model of Eye Movements.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Learning Hyper-Features for Visual Identification.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Recognizing Objects in Range Data Using Regional Point Descriptors.
Proceedings of the Computer Vision, 2004

04021 Abstracts Collection - Content-Based Retrieval.
Proceedings of the Content-Based Retrieval, 4.-9. January 2004, 2004

Recovering Human Body Configurations: Combining Segmentation and Recognition.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

2003
Learning a Classification Model for Segmentation.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Fast Vehicle Detection with Probabilistic Feature Grouping and its Application to Vehicle Tracking.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Recognizing Action at a Distance.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Learning a discriminative classifier using shape context distances.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Learning Affinity Functions for Image Segmentation: Combining Patch-based and Gradient-based Approaches.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

2002
Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying.
IEEE Trans. Pattern Anal. Mach. Intell., 2002

Shape Matching and Object Recognition Using Shape Contexts.
IEEE Trans. Pattern Anal. Mach. Intell., 2002

Learning to Detect Natural Image Boundaries Using Brightness and Texture.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

A Probabilistic Multi-scale Model for Contour Completion Based on Image Statistics.
Proceedings of the Computer Vision, 2002

Estimating Human Body Configurations Using Shape Context Matching.
Proceedings of the Computer Vision, 2002

Spectral Partitioning with Indefinite Kernels Using the Nyström Extension.
Proceedings of the Computer Vision, 2002

2001
Extracting Objects from Range and Radiance Images.
IEEE Trans. Vis. Comput. Graph., 2001

Contour and Texture Analysis for Image Segmentation.
Int. J. Comput. Vis., 2001

Editorial.
Int. J. Comput. Vis., 2001

Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons.
Int. J. Comput. Vis., 2001

Visual Grouping and Object Recognition.
Proceedings of the 11th International Conference on Image Analysis and Processing (ICIAP 2001), 2001

A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Matching Shapes.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Shape contexts enable efficient retrieval of similar shapes.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

Efficient Spatiotemporal Grouping Using the Nystro"m Method.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

Geometric Blur for Template Matching.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

2000
Normalized Cuts and Image Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2000

Shape Context: A New Descriptor for Shape Matching and Object Recognition.
Proceedings of the Advances in Neural Information Processing Systems 13, 2000

1999
A Comparative Study of Vision-Based Lateral Control Strategies for Autonomous Highway Driving.
Int. J. Robotics Res., 1999

Blobworld: A System for Region-Based Image Indexing and Retrieval.
Proceedings of the Visual Information and Information Systems, 1999

Inverse Global Illumination: Recovering Reflectance Models of Real Scenes from Photographs.
Proceedings of the 26th Annual Conference on Computer Graphics and Interactive Techniques, 1999

Grouping in the Normalized Cut Framework.
Proceedings of the Shape, Contour and Grouping in Computer Vision, 1999

Summary of the Panel Session.
Proceedings of the Vision Algorithms: Theory and Practice, 1999

Textons, Contours and Regions: Cue Integration in Image Segmentation.
Proceedings of the International Conference on Computer Vision, 1999

Recognizing Surfaces using Three-Dimensional Textons.
Proceedings of the International Conference on Computer Vision, 1999

Region-Based Image Retrieval (Eingeladener Vortrag).
Proceedings of the Mustererkennung 1999, 1999

1998
Recovering Photometric Properties of Architectural Scenes from Photographs.
Proceedings of the 25th Annual Conference on Computer Graphics and Interactive Techniques, 1998

Image and Video Segmentation: The Normalized Cut Framework.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Motion Segmentation and Tracking Using Normalized Cuts.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Color- and Texture-based Image Segmentation Using the Expectation-Maximization Algorithm and its Application to Content-Based Image Retrieval.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Self Inducing Relational Distance and Its Application to Image Segmentation.
Proceedings of the Computer Vision, 1998

Contour Continuity in Region Based Image Segmentation.
Proceedings of the Computer Vision, 1998

Finding Boundaries in Natural Images: A New Method Using Point Descriptors and Area Completion.
Proceedings of the Computer Vision, 1998

Tracking People with Twists and Exponential Maps.
Proceedings of the 1998 Conference on Computer Vision and Pattern Recognition (CVPR '98), 1998

1997
Rigid Body Segmentation and Shape Description from Dense Optical Flow Under Weak Perspective.
IEEE Trans. Pattern Anal. Mach. Intell., 1997

Computing Local Surface Orientation and Shape from Texture for Curved Surfaces.
Int. J. Comput. Vis., 1997

Image-based rendering: really new or déjà vu? (panel).
Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques, 1997

A Real-time Computer Vision System for Measuring Traffic Parameters.
Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997

On Perpendicular Texture: Why do we see more flowers in the distance?
Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997

Vision for Longitudinal Vehicle Control.
Proceedings of the British Machine Vision Conference 1997, 1997

1996
Modeling and Rendering Architecture from Photographs: A Hybrid Geometry- and Image-Based Approach.
Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques, 1996

Learning Appearance Based Models: Mixtures of Second Moment Experts.
Proceedings of the Advances in Neural Information Processing Systems 9, 1996

Finding objects in image databases by grouping.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

Reconstructing Polyhedral Models of Architectural Scenes from Photographs.
Proceedings of the Computer Vision, 1996

On Binocularly Viewed Occlusion Junctions.
Proceedings of the Computer Vision, 1996

Detecting, localizing and grouping repeated scene elements from an image.
Proceedings of the Computer Vision, 1996

Finding Pictures of Objects in Large Collections of Images.
Proceedings of the Object Representation in Computer Vision II, 1996

Finding Pictures of Objects in Large Collections of Images.
Proceedings of the Data Processing Clinic: Digital Image Access & Retrieval, 1996

1995
Robust computation of optical flow in a multi-scale differential framework.
Int. J. Comput. Vis., 1995

An Integrated Stereo-Based Approach to Automatic Vehicle Guidance.
Proceedings of the Procedings of the Fifth International Conference on Computer Vision (ICCV 95), 1995

Smart Cars and Smart Roads.
Proceedings of the British Machine Vision Conference, 1995

1994
Anisotropic Diffusion.
Proceedings of the Geometry-Driven Diffusion in Computer Vision, 1994

Distinctive Representations for the Recognition of Curved Surfaces Using Outlines and Markings.
Proceedings of the Object Representation in Computer Vision, 1994

Towards robust automatic traffic scene analysis in real-time.
Proceedings of the 12th IAPR International Conference on Pattern Recognition, 1994

Recovering Surface Curvature and Orientation From Texture Distortion: A Least Squares Algorithm and Sensitivity Analysis.
Proceedings of the Computer Vision, 1994

Robust Multiple Car Tracking with Occlusion Reasoning.
Proceedings of the Computer Vision, 1994

Automatic Symbolic Traffic Scene Analysis Using Belief Networks.
Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, WA, USA, July 31, 1994

1993
Action Representation and Purpose: Re-evaluating the Foundations of Computational Vision.
Proceedings of the 13th International Joint Conference on Artificial Intelligence. Chambéry, France, August 28, 1993

A differential method for computing local shape-from-texture for planar and curved surfaces.
Proceedings of the Conference on Computer Vision and Pattern Recognition, 1993

1992
Computational framework for determining stereo correspondence from a set of linear spatial filters.
Image Vis. Comput., 1992

Determining Three-Dimensional Shape from Orientation and Spatial Frequency Disparities.
Proceedings of the Computer Vision, 1992

1990
Scale-Space and Edge Detection Using Anisotropic Diffusion.
IEEE Trans. Pattern Anal. Mach. Intell., 1990

Computing the Aspect Graph for Line Drawings of Polyhedral Objects.
IEEE Trans. Pattern Anal. Mach. Intell., 1990

Detecting and localizing edges composed of steps, peaks and roofs.
Proceedings of the Third International Conference on Computer Vision, 1990

1989
Recovering Three-Dimensional Shape from a Single Image of Curved Objects.
IEEE Trans. Pattern Anal. Mach. Intell., 1989

A computational model of texture segmentation.
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1989

1987
Interpreting line drawings of curved objects.
Int. J. Comput. Vis., 1987

Recovering Three Dimensional Shape from a Single Image of Curved Objects.
Proceedings of the 10th International Joint Conference on Artificial Intelligence. Milan, 1987

1983
Reasoning in Time and Space.
Proceedings of the 8th International Joint Conference on Artificial Intelligence. Karlsruhe, 1983


  Loading...