Hartwig Adam

Orcid: 0000-0003-1258-4341

According to our database1, Hartwig Adam authored at least 60 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
VideoPrism: A Foundational Visual Encoder for Video Understanding.
CoRR, 2024

Distilling Vision-Language Models on Millions of Videos.
CoRR, 2024

PolyMaX: General Dense Prediction with Mask Transformer.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation.
CoRR, 2023

SANPO: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset.
CoRR, 2023

VideoGLUE: Video General Understanding Evaluation of Foundation Models.
CoRR, 2023

Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Unified Visual Relationship Detection with Vision and Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Zero-shot Generalization and Robustness of Multi-Modal Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose.
Int. J. Comput. Vis., 2022

2.5D visual relationship detection.
Comput. Vis. Image Underst., 2022

Surrogate Gap Minimization Improves Sharpness-Aware Training.
Proceedings of the Tenth International Conference on Learning Representations, 2022

k-means Mask Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing.
Proceedings of the Computer Vision - ECCV 2022, 2022

Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset.
Proceedings of the Computer Vision - ECCV 2022, 2022

Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TubeFormer-DeepLab: Video Mask Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

On Temporal Granularity in Self-Supervised Video Representation Learning.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Computational Media Intelligence: Human-Centered Machine Analysis of Media.
Proc. IEEE, 2021

Exploring Temporal Granularity in Self-Supervised Video Representation Learning.
CoRR, 2021

DeepLab2: A TensorFlow Library for Deep Labeling.
CoRR, 2021

STEP: Segmenting and Tracking Every Pixel.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

MaX-DeepLab: End-to-End Panoptic Segmentation With Mask Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

VIP-DeepLab: Learning Visual Perception With Depth-Aware Video Panoptic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation.
CoRR, 2020

EEV Dataset: Predicting Expressions Evoked by Diverse Videos.
CoRR, 2020

ATQAM/MAST'20: Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

View-Invariant Probabilistic Embedding for Human Pose.
Proceedings of the Computer Vision - ECCV 2020, 2020

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset.
Proceedings of the Computer Vision - ECCV 2020, 2020

Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

MnasFPN: Learning Latency-Aware Pyramid Architecture for Object Detection on Mobile Devices.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Panoptic-DeepLab.
CoRR, 2019

The iMaterialist Fashion Attribute Dataset.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Geo-Aware Networks for Fine-Grained Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Searching for MobileNetV3.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

FEELVOS: Fast End-To-End Embedding Learning for Video Object Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications.
CoRR, 2018

Searching for Efficient Multi-Scale Architectures for Dense Image Prediction.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications.
Proceedings of the Computer Vision - ECCV 2018, 2018

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

The INaturalist Species Classification and Detection Dataset.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

MaskLab: Instance Segmentation by Refining Object Detection With Semantic and Direction Features.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Improving Smiling Detection with Race and Gender Diversity.
CoRR, 2017

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications.
CoRR, 2017

The iNaturalist Challenge 2017 Dataset.
CoRR, 2017

Rethinking Atrous Convolution for Semantic Image Segmentation.
CoRR, 2017

Learning Unified Embedding for Apparel Recognition.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

BranchOut: Regularization for Online Ensemble Tracking with Convolutional Neural Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2014
Large-Scale Object Classification Using Label Relation Graphs.
Proceedings of the Computer Vision - ECCV 2014, 2014

2009
Tour the world: a technical demonstration of a web-scale landmark recognition engine.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Large-scale privacy protection in Google Street View.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Tour the world: Building a web-scale landmark recognition engine.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Large scale learning and recognition of faces in web videos.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008


  Loading...