Amir Zamir

CoRR, 2024

Unraveling the Key Components of OOD Generalization via Diversification.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

4M: Massively Multimodal Masked Modeling.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Modality-invariant Visual Odometry for Embodied Vision.

[BibT_eX]

[DOI]

Marius Memmel

Roman Bachmann

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

CLIPasso: semantically-aware object sketching.

[BibT_eX]

[DOI]

Yael Vinker

Ehsan Pajouheshgar

Jessica Y. Bo

Roman Christian Bachmann

ACM Trans. Graph., 2022

Simple Control Baselines for Evaluating Transfer Learning.

[BibT_eX]

[DOI]

CoRR, 2022

PALMER: Perception - Action Loop with Memory for Long-Horizon Planning.

[BibT_eX]

[DOI]

Onur Beker

Mohammad Mohammadi

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Task Discovery: Finding the Tasks that Neural Networks Generalize on.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MultiMAE: Multi-modal Multi-task Masked Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

3D Common Corruptions and Data Augmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans.

[BibT_eX]

[DOI]

CoRR, 2021

Robustness via Cross-Domain Ensembles.

[BibT_eX]

[DOI]

Teresa Yeo

Oguzhan Fatih Kar

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation.

[BibT_eX]

[DOI]

CoRR, 2020

Robust Learning Through Cross-Task Consistency.

[BibT_eX]

[DOI]

CoRR, 2020

Which Tasks Should Be Learned Together in Multi-task Learning?

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Robust Learning Through Cross-Task Consistency.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

2019

Side-Tuning: Network Adaptation via Additive Side Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Taskonomy: Disentangling Task Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

An Information-Theoretic Approach to Transferability in Task Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning to Navigate Using Mid-Level Visual Priors.

[BibT_eX]

[DOI]

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018

Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Active Tasks.

[BibT_eX]

[DOI]

CoRR, 2018

On Evaluation of Embodied Navigation Agents.

[BibT_eX]

[DOI]

Peter Anderson

Angel X. Chang

Devendra Singh Chaplot

CoRR, 2018

Gibson Env: Real-World Perception for Embodied Agents.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

The THUMOS challenge on action recognition for videos "in the wild".

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2017

Joint 2D-3D-Semantic Data for Indoor Scene Understanding.

[BibT_eX]

[DOI]

CoRR, 2017

Feedback Networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Introduction to Large-Scale Visual Geo-localization.

[BibT_eX]

[DOI]

Proceedings of the Deep Learning and Convolutional Neural Networks for Medical Image Computing, 2016

Feedback Networks.

[BibT_eX]

[DOI]

CoRR, 2016

Unsupervised Semantic Action Discovery from Video Collections.

[BibT_eX]

[DOI]

CoRR, 2016

Generic 3D Representation via Pose Estimation and Matching.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Structural-RNN: Deep Learning on Spatio-Temporal Graphs.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

3D Semantic Parsing of Large-Scale Indoor Spaces.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Semantic Cross-View Matching.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Unsupervised Semantic Parsing of Video Collections.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Action Recognition by Hierarchical Mid-Level Action Elements.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014

Image Geo-Localization Based on MultipleNearest Neighbor Feature Matching UsingGeneralized Graphs.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2014

DaMN - Discriminative and Mutually Nearest: Exploiting Pairwise Category Proximity for Video Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

GIS-Assisted Object Detection and Geospatial Localization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

GPS-Tag Refinement Using Random Walks with an Adaptive Damping Factor.

[BibT_eX]

[DOI]

Shervin Ardeshir

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Video Classification Using Semantic Concept Co-occurrences.

[BibT_eX]

[DOI]

Shayan Modiri Assari

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013

Visual business recognition: a multimodal approach.

[BibT_eX]

[DOI]

Afshin Dehghan

Proceedings of the ACM Multimedia Conference, 2013

2012

UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild

[BibT_eX]

[DOI]

Khurram Soomro

CoRR, 2012

GMCP-Tracker: Global Multi-object Tracking Using Generalized Minimum Clique Graphs.

[BibT_eX]

[DOI]

Afshin Dehghan

Proceedings of the Computer Vision - ECCV 2012, 2012

City scale geo-spatial trajectory estimation of a moving camera.

[BibT_eX]

[DOI]

Gonzalo Vaca-Castano

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Street View Challenge: Identification of Commercial Entities in Street View Imagery.

[BibT_eX]

[DOI]

Alexander Darino

Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011

2010

Accurate Image Localization Based on Google Maps Street View.

[BibT_eX]

[DOI]