Amir Zamir

Orcid: 0000-0002-5559-1843

Affiliations:
  • Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland
  • Stanford University, CA, USA (former)
  • University of California, Berkeley, CA, USA (former)
  • University of Central Florida, Department of Electrical Engineering and Computer Science, Orlando, FL, USA (former, PhD 2014)


According to our database1, Amir Zamir authored at least 51 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Controlled Training Data Generation with Diffusion Models.
CoRR, 2024

2023
Unraveling the Key Components of OOD Generalization via Diversification.
CoRR, 2023

4M: Massively Multimodal Masked Modeling.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Modality-invariant Visual Odometry for Embodied Vision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
CLIPasso: semantically-aware object sketching.
ACM Trans. Graph., 2022

Simple Control Baselines for Evaluating Transfer Learning.
CoRR, 2022

PALMER: Perception - Action Loop with Memory for Long-Horizon Planning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Task Discovery: Finding the Tasks that Neural Networks Generalize on.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MultiMAE: Multi-modal Multi-task Masked Autoencoders.
Proceedings of the Computer Vision - ECCV 2022, 2022

3D Common Corruptions and Data Augmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans.
CoRR, 2021

Robustness via Cross-Domain Ensembles.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation.
CoRR, 2020

Robust Learning Through Cross-Task Consistency.
CoRR, 2020

Which Tasks Should Be Learned Together in Multi-task Learning?
Proceedings of the 37th International Conference on Machine Learning, 2020

Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Robust Learning Through Cross-Task Consistency.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation.
Proceedings of the 4th Conference on Robot Learning, 2020

2019
Side-Tuning: Network Adaptation via Additive Side Networks.
CoRR, 2019

An Information-Theoretic Approach to Transferability in Task Transfer Learning.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning to Navigate Using Mid-Level Visual Priors.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018
Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Active Tasks.
CoRR, 2018

On Evaluation of Embodied Navigation Agents.
CoRR, 2018

Taskonomy: Disentangling Task Transfer Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Gibson Env: Real-World Perception for Embodied Agents.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
The THUMOS challenge on action recognition for videos "in the wild".
Comput. Vis. Image Underst., 2017

Joint 2D-3D-Semantic Data for Indoor Scene Understanding.
CoRR, 2017

Feedback Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Introduction to Large-Scale Visual Geo-localization.
Proceedings of the Deep Learning and Convolutional Neural Networks for Medical Image Computing, 2016

Feedback Networks.
CoRR, 2016

Unsupervised Semantic Action Discovery from Video Collections.
CoRR, 2016

Generic 3D Representation via Pose Estimation and Matching.
Proceedings of the Computer Vision - ECCV 2016, 2016

Structural-RNN: Deep Learning on Spatio-Temporal Graphs.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

3D Semantic Parsing of Large-Scale Indoor Spaces.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Semantic Cross-View Matching.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Unsupervised Semantic Parsing of Video Collections.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Action Recognition by Hierarchical Mid-Level Action Elements.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Image Geo-Localization Based on MultipleNearest Neighbor Feature Matching UsingGeneralized Graphs.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

DaMN - Discriminative and Mutually Nearest: Exploiting Pairwise Category Proximity for Video Action Recognition.
Proceedings of the Computer Vision - ECCV 2014, 2014

GIS-Assisted Object Detection and Geospatial Localization.
Proceedings of the Computer Vision - ECCV 2014, 2014

GPS-Tag Refinement Using Random Walks with an Adaptive Damping Factor.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Video Classification Using Semantic Concept Co-occurrences.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Visual business recognition: a multimodal approach.
Proceedings of the ACM Multimedia Conference, 2013

2012
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
CoRR, 2012

GMCP-Tracker: Global Multi-object Tracking Using Generalized Minimum Clique Graphs.
Proceedings of the Computer Vision - ECCV 2012, 2012

City scale geo-spatial trajectory estimation of a moving camera.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Street View Challenge: Identification of Commercial Entities in Street View Imagery.
Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011

2010
Accurate Image Localization Based on Google Maps Street View.
Proceedings of the Computer Vision, 2010


  Loading...