Fei-Fei Li

According to our database1, Fei-Fei Li authored at least 261 papers between 2003 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Scene Graph Prediction with Limited Labels.
CoRR, 2019

Information Maximizing Visual Question Generation.
CoRR, 2019

Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks.
CoRR, 2019

Audio-Linguistic Embeddings for Spoken Sentences.
CoRR, 2019

Peeking into the Future: Predicting Future Person Activities and Locations in Videos.
CoRR, 2019

DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion.
CoRR, 2019

Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation.
CoRR, 2019

D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation.
CoRR, 2019

2018
Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos.
International Journal of Computer Vision, 2018

Composing Text and Image for Image Retrieval - An Empirical Odyssey.
CoRR, 2018

Vision-Based Gait Analysis for Senior Care.
CoRR, 2018

Faster CryptoNets: Leveraging Sparsity for Real-World Encrypted Inference.
CoRR, 2018

A Fully Private Pipeline for Deep Learning on Electronic Health Records.
CoRR, 2018

Privacy-Preserving Action Recognition for Smart Hospitals using Low-Resolution Depth Images.
CoRR, 2018

Measuring Depression Symptom Severity from Spoken Language and 3D Facial Expressions.
CoRR, 2018

RoboTurk: A Crowdsourcing Platform for Robotic Skill Learning through Imitation.
CoRR, 2018

Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks.
CoRR, 2018

HiDDeN: Hiding Data With Deep Networks.
CoRR, 2018

Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration.
CoRR, 2018

Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision.
CoRR, 2018

Flexible Neural Representation for Physics Prediction.
CoRR, 2018

Learning to Decompose and Disentangle Representations for Video Prediction.
CoRR, 2018

Image Generation from Scene Graphs.
CoRR, 2018

DDRprog: A CLEVR Differentiable Dynamic Reasoning Programmer.
CoRR, 2018

Iterative Visual Reasoning Beyond Convolutions.
CoRR, 2018

Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks.
CoRR, 2018

Referring Relationships.
CoRR, 2018

Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks.
CoRR, 2018

Emergence of Structured Behaviors from Curiosity-Based Intrinsic Motivation.
CoRR, 2018

Learning to Play with Intrinsically-Motivated Self-Aware Agents.
CoRR, 2018

Scaling Human-Object Interaction Recognition Through Zero-Shot Learning.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Engagement Learning: Expanding Visual Knowledge by Engaging Online Participants.
Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology Adjunct Proceedings, 2018

Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision.
Proceedings of the Robotics: Science and Systems XIV, 2018

Flexible neural representation for physics prediction.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning to Decompose and Disentangle Representations for Video Prediction.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning to Play With Intrinsically-Motivated, Self-Aware Agents.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

3D Point Cloud-Based Visual Prediction of ICU Mobility Care Activities.
Proceedings of the Machine Learning for Healthcare Conference, 2018

Neural Task Programming: Learning to Generalize Across Hierarchical Tasks.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Distributed Asynchronous Optimization with Unbounded Delays: How Slow Can You Go?
Proceedings of the 35th International Conference on Machine Learning, 2018

MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels.
Proceedings of the 35th International Conference on Machine Learning, 2018

HiDDeN: Hiding Data With Deep Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Graph Distillation for Action Detection with Privileged Modalities.
Proceedings of the Computer Vision - ECCV 2018, 2018

Progressive Neural Architecture Search.
Proceedings of the Computer Vision - ECCV 2018, 2018

Temporal Modular Networks for Retrieving Complex Compositional Activities in Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

Dynamic Task Prioritization for Multitask Learning.
Proceedings of the Computer Vision - ECCV 2018, 2018

Neural Graph Matching Networks for Fewshot 3D Action Recognition.
Proceedings of the Computer Vision - ECCV 2018, 2018

Thoracic Disease Identification and Localization With Limited Supervision.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Referring Relationships.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Image Generation From Scene Graphs.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Finding "It": Weakly-Supervised Reference-Aware Visual Grounding in Instructional Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Iterative Visual Reasoning Beyond Convolutions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

ROBOTURK: A Crowdsourcing Platform for Robotic Skill Learning through Imitation.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Emergence of Structured Behaviors from Curiosity-Based Intrinsic Motivation.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2017
Deep Visual-Semantic Alignments for Generating Image Descriptions.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Evidence for similar patterns of neural activity elicited by picture- and word-based representations of natural scenes.
NeuroImage, 2017

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations.
International Journal of Computer Vision, 2017

MentorNet: Regularizing Very Deep Neural Networks on Corrupted Labels.
CoRR, 2017

Progressive Neural Architecture Search.
CoRR, 2017

Label Efficient Learning of Transferable Representations across Domains and Tasks.
CoRR, 2017

Graph Distillation for Action Detection with Privileged Information.
CoRR, 2017

Thoracic Disease Identification and Localization with Limited Supervision.
CoRR, 2017

Neural Task Programming: Learning to Generalize Across Hierarchical Tasks.
CoRR, 2017

Scalable Annotation of Fine-Grained Categories Without Experts.
CoRR, 2017

Fine-Grained Car Detection for Visual Census Estimation.
CoRR, 2017

Fine-grained Recognition in the Wild: A Multi-Task Domain Adaptation Approach.
CoRR, 2017

Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance.
CoRR, 2017

Visual Semantic Planning using Deep Successor Representations.
CoRR, 2017

Learning to Learn from Noisy Web Videos.
CoRR, 2017

Tackling Over-pruning in Variational Autoencoders.
CoRR, 2017

Scene Graph Generation by Iterative Message Passing.
CoRR, 2017

Unsupervised Learning of Long-Term Motion Dynamics for Videos.
CoRR, 2017

Dense-Captioning Events in Videos.
CoRR, 2017

Inferring and Executing Programs for Visual Reasoning.
CoRR, 2017

Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos.
CoRR, 2017

ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems.
CoRR, 2017

Characterizing and Improving Stability in Neural Style Transfer.
CoRR, 2017

Using Deep Learning and Google Street View to Estimate the Demographic Makeup of the US.
CoRR, 2017

Label Efficient Learning of Transferable Representations acrosss Domains and Tasks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance.
Proceedings of the Machine Learning for Health Care Conference, 2017

Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Target-driven visual navigation in indoor scenes using deep reinforcement learning.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Unsupervised camera localization in crowded spaces.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Visual Semantic Planning Using Deep Successor Representations.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Dense-Captioning Events in Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Inferring and Executing Programs for Visual Reasoning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Characterizing and Improving Stability in Neural Style Transfer.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Fine-Grained Recognition in the Wild: A Multi-task Domain Adaptation Approach.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Knowledge Acquisition for Visual Question Answering via Iterative Querying.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning to Learn from Noisy Web Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Scene Graph Generation by Iterative Message Passing.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Jointly Learning Energy Expenditures and Activities Using Egocentric Multimodal Signals.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Unsupervised Learning of Long-Term Motion Dynamics for Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

A Hierarchical Approach for Generating Descriptive Image Paragraphs.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

A Glimpse Far into the Future: Understanding Long-term Crowd Worker Quality.
Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, 2017

Scalable Annotation of Fine-Grained Categories Without Experts.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

End-to-End, Single-Stream Temporal Action Detection in Untrimmed Videos.
Proceedings of the British Machine Vision Conference 2017, 2017

Computer Vision-based Approach to Maintain Independent Living for Seniors.
Proceedings of the AMIA 2017, 2017

Fine-Grained Car Detection for Visual Census Estimation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Learning to Predict Human Behavior in Crowded Scenes.
Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

Tracking Millions of Humans in Crowded Spaces.
Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

2016
Leveraging the Wisdom of the Crowd for Fine-Grained Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Typicality sharpens category representations in object-selective cortex.
NeuroImage, 2016

Crowdsourcing in Computer Vision.
Foundations and Trends in Computer Graphics and Vision, 2016

Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning.
CoRR, 2016

Visual Relationship Detection with Language Priors.
CoRR, 2016

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations.
CoRR, 2016

Embracing Error to Enable Rapid Crowdsourcing.
CoRR, 2016

A Hierarchical Approach for Generating Descriptive Image Paragraphs.
CoRR, 2016

Crowdsourcing in Computer Vision.
CoRR, 2016

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning.
CoRR, 2016

Perceptual Losses for Real-Time Style Transfer and Super-Resolution.
CoRR, 2016

Connectionist Temporal Modeling for Weakly Supervised Action Labeling.
CoRR, 2016

A Glimpse Far into the Future: Understanding Long-term Crowd Worker Accuracy.
CoRR, 2016

Viewpoint Invariant 3D Human Pose Estimation with Recurrent Error Feedback.
CoRR, 2016

Recurrent Attention Models for Depth-Based Person Identification.
CoRR, 2016

Toward More Gender Diversity in CS through an Artificial Intelligence Summer Program for High School Girls.
Proceedings of the 47th ACM Technical Symposium on Computing Science Education, Memphis, TN, USA, March 02, 2016

Vision-Based Classification of Developmental Disorders Using Eye-Movements.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016, 2016

Visual Relationship Detection with Language Priors.
Proceedings of the Computer Vision - ECCV 2016, 2016

The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition.
Proceedings of the Computer Vision - ECCV 2016, 2016

Perceptual Losses for Real-Time Style Transfer and Super-Resolution.
Proceedings of the Computer Vision - ECCV 2016, 2016

Connectionist Temporal Modeling for Weakly Supervised Action Labeling.
Proceedings of the Computer Vision - ECCV 2016, 2016

Towards Viewpoint Invariant 3D Human Pose Estimation.
Proceedings of the Computer Vision - ECCV 2016, 2016

What's the Point: Semantic Segmentation with Point Supervision.
Proceedings of the Computer Vision - ECCV 2016, 2016

Visual7W: Grounded Question Answering in Images.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

End-to-End Learning of Action Detection from Frame Glimpses in Videos.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Detecting Events and Key Actors in Multi-person Videos.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

DenseCap: Fully Convolutional Localization Networks for Dense Captioning.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recurrent Attention Models for Depth-Based Person Identification.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Social LSTM: Human Trajectory Prediction in Crowded Spaces.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Embracing Error to Enable Rapid Crowdsourcing.
Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, 2016

Vision-Based Hand Hygiene Monitoring in Hospitals.
Proceedings of the AMIA 2016, 2016

2015
Basic Level Category Structure Emerges Gradually across Human Ventral Visual Cortex.
J. Cognitive Neuroscience, 2015

ImageNet Large Scale Visual Recognition Challenge.
International Journal of Computer Vision, 2015

Building a Large-scale Multimodal Knowledge Base for Visual Question Answering.
CoRR, 2015

Visual7W: Grounded Question Answering in Images.
CoRR, 2015

End-to-end Learning of Action Detection from Frame Glimpses in Videos.
CoRR, 2015

Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos.
CoRR, 2015

Improving Image Classification with Location Context.
CoRR, 2015

What's the point: Semantic segmentation with point supervision.
CoRR, 2015

Learning Temporal Embeddings for Complex Video Analysis.
CoRR, 2015

Detecting events and key actors in multi-person videos.
CoRR, 2015

The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition.
CoRR, 2015

Visualizing and Understanding Recurrent Networks.
CoRR, 2015

DenseCap: Fully Convolutional Localization Networks for Dense Captioning.
CoRR, 2015

Love Thy Neighbors: Image Annotation by Exploiting Image Metadata.
CoRR, 2015

SentenceRacer: A Game with a Purpose for Image Sentence Annotation.
CoRR, 2015

Adaptive mesh method for topology optimization of fluid flow.
Appl. Math. Lett., 2015

Improving Image Classification with Location Context.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning Temporal Embeddings for Complex Video Analysis.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Love Thy Neighbors: Image Annotation by Exploiting Image Metadata.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

RGB-W: When Vision Meets Wireless.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Best of both worlds: Human-machine collaboration for object annotation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Learning semantic relationships for better action retrieval in images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Fine-grained recognition without part annotations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Deep visual-semantic alignments for generating image descriptions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Image retrieval using scene graphs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Generating Semantically Precise Scene Graphs from Textual Descriptions for Improved Image Retrieval.
Proceedings of the Fourth Workshop on Vision and Language, 2015

2014
Object Bank: An Object-Level Image Representation for High-Level Visual Recognition.
International Journal of Computer Vision, 2014

VideoSET: Video Summary Evaluation through Text.
CoRR, 2014

ImageNet Large Scale Visual Recognition Challenge.
CoRR, 2014

Deep Fragment Embeddings for Bidirectional Image Sentence Mapping.
CoRR, 2014

Deep Visual-Semantic Alignments for Generating Image Descriptions.
CoRR, 2014

Affordances Provide a Fundamental Categorization Principle for Visual Scenes.
CoRR, 2014

Visual Noise from Natural Scene Statistics Reveals Human Scene Category Representations.
CoRR, 2014

Understanding the 3D layout of a cluttered room from multiple images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Deep Fragment Embeddings for Bidirectional Image Sentence Mapping.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Learning Features and Parts for Fine-Grained Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Reasoning about Object Affordances in a Knowledge Base Representation.
Proceedings of the Computer Vision - ECCV 2014, 2014

Linking People in Videos with "Their" Names Using Coreference Resolution.
Proceedings of the Computer Vision - ECCV 2014, 2014

Efficient Image and Video Co-localization with Frank-Wolfe Algorithm.
Proceedings of the Computer Vision - ECCV 2014, 2014

Co-localization in Real-World Images.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Large-Scale Video Classification with Convolutional Neural Networks.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Socially-Aware Large-Scale Crowd Forecasting.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Discovering the Signatures of Joint Attention in Child-Caregiver Interaction.
Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Scalable multi-label annotation.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2014

Social Role Recognition for Human Event Understanding.
Proceedings of the Human-Centered Social Media Analytics, 2014

Integrating Randomization and Discrimination for Classifying Human-Object Interaction Activities.
Proceedings of the Human-Centered Social Media Analytics, 2014

2013
Differential connectivity within the Parahippocampal Place Area.
NeuroImage, 2013

Object discovery in 3D scenes via shape analysis.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

3D Object Representations for Fine-Grained Categorization.
Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Discovering Object Functionality.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Combining the Right Features for Complex Event Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Detecting Avocados to Zucchinis: What Have We Done, and Where Are We Going?
Proceedings of the IEEE International Conference on Computer Vision, 2013

Video Event Understanding Using Natural Language Descriptions.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Discriminative Segment Annotation in Weakly Labeled Video.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Social Role Discovery in Human Events.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Fine-Grained Crowdsourcing for Fine-Grained Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Free your Camera: 3D Indoor Scene Understanding from Arbitrary Camera Motion.
Proceedings of the British Machine Vision Conference, 2013

2012
Recognizing Human-Object Interactions in Still Images by Modeling the Mutual Context of Objects and Human Poses.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Voxel-level functional connectivity using spatial regularization.
NeuroImage, 2012

Efficient Euclidean Projections onto the Intersection of Norm Balls
CoRR, 2012

Shifting Weights: Adapting Object Detectors from Image to Video.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Web image prediction using multivariate point processes.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Efficient Euclidean Projections onto the Intersection of Norm Balls.
Proceedings of the 29th International Conference on Machine Learning, 2012

Crowdsourcing Annotations for Visual Object Detection.
Proceedings of the 4th Human Computation Workshop, 2012

Action Recognition with Exemplar Based 2.5D Graph Matching.
Proceedings of the Computer Vision - ECCV 2012, 2012

Object-Centric Spatial Pooling for Image Classification.
Proceedings of the Computer Vision - ECCV 2012, 2012

A codebook-free and annotation-free approach for fine-grained image categorization.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Learning latent temporal structure for complex event detection.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Multi-Level Structured Image Coding on High-Dimensional Image Representation.
Proceedings of the Computer Vision, 2012

2011
ReVision: automated classification, analysis and redesign of chart images.
Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology, 2011

Large-Scale Category Structure Aware Image Categorization.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Fast and Balanced: Efficient Label Tree Learning for Large Scale Object Recognition.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Human action recognition by learning bases of action attributes and parts.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Distributed cosegmentation via submodular optimization on anisotropic diffusion.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Online detection of unusual events in videos via dynamic sparse coding.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Combining randomization and discrimination for fine-grained image categorization.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Hierarchical semantic indexing for large scale image retrieval.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Multi-view Object Categorization and Pose Estimation.
Proceedings of the Computer Vision: Detection, Recognition and Reconstruction, 2010

What, Where and Who? Telling the Story of an Image by Activity Classification, Scene Recognition and Object Categorization.
Proceedings of the Computer Vision: Detection, Recognition and Reconstruction, 2010

Learning Object Categories From Internet Image Searches.
Proceedings of the IEEE, 2010

OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning.
International Journal of Computer Vision, 2010

Large Margin Learning of Upstream Scene Understanding Models.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Image Segmentation with Topic Random Field.
Proceedings of the Computer Vision - ECCV 2010, 2010

Attribute Learning in Large-Scale Datasets.
Proceedings of the Trends and Topics in Computer Vision, 2010

Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification.
Proceedings of the Computer Vision, 2010

Objects as Attributes for Scene Classification.
Proceedings of the Trends and Topics in Computer Vision, 2010

What Does Classifying More Than 10, 000 Image Categories Tell Us?
Proceedings of the Computer Vision - ECCV 2010, 2010

Modeling mutual context of object and human pose in human-object interaction activities.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Grouplet: A structured image representation for recognizing human and object interactions.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Efficient extraction of human motion volumes by tracking.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Building and using a semantivisual image hierarchy.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Hierarchical Mixture of Classification Experts Uncovers Interactions between Brain Regions.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Exploring Functional Connectivities of the Human Brain using Multivariate Information Analysis.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Mining discriminative adjectives and prepositions for natural scene recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

Simultaneous image classification and annotation.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

A multi-view probabilistic model for 3D object classes.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Towards total scene understanding: Classification, annotation and segmentation in an automatic framework.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

ImageNet: A large-scale hierarchical image database.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words.
International Journal of Computer Vision, 2008

Variational Transform Invariant Mixture of Probabilistic PCA.
Proceedings of the 9th IEEE Workshop on Applications of Computer Vision (WACV 2008), 2008

View Synthesis for Recognizing Unseen Poses of Object Classes.
Proceedings of the Computer Vision, 2008

Extracting Moving People from Internet Videos.
Proceedings of the Computer Vision, 2008

Towards Scalable Dataset Construction: An Active Learning Approach.
Proceedings of the Computer Vision, 2008

2007
Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories.
Computer Vision and Image Understanding, 2007

3D generic object categorization, localization and pose estimation.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

What, where and who? Classifying events by scene and object recognition.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Spatially Coherent Latent Topic Model for Concurrent Segmentation and Classification of Objects and Scenes.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

A Hierarchical Model of Shape and Appearance for Human Action Classification.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

OPTIMOL: automatic Online Picture collecTion via Incremental MOdel Learning.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

OPTIMOL: A Framework for Online Picture Collection via Incremental Model Learning.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
One-Shot Learning of Object Categories.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Variational Shift Invariant Probabilistic PCA for Face Recognition.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Audio-Visual Speaker Localization Using Graphical Models.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Using Dependent Regions for Object Categorization in a Generative Framework.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words.
Proceedings of the British Machine Vision Conference 2006, 2006

2005
Learning Object Categories from Google's Image Search.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

A Bayesian Hierarchical Model for Learning Natural Scene Categories.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2004

What do reflections tell us about the shape of a mirror?
Proceedings of the 1st Symposium on Applied Perception in Graphics and Visualization, 2004

2003
A Bayesian Approach to Unsupervised One-Shot Learning of Object Categories.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003


  Loading...