Hartwig Adam

Orcid: 0000-0003-1258-4341

According to our database¹, Hartwig Adam authored at least 64 papers between 2008 and 2025.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

SANPO: A Scene Understanding, Accessibility and Human Navigation Dataset.

[BibT_eX]

[DOI]

Cattalyya Nuengsigkapian

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Epsilon-VAE: Denoising as Visual Decoding.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

VideoGLUE: Video General Understanding Evaluation of Foundation Models.

[BibT_eX]

[DOI]

Liangzhe Yuan

Nitesh Bharadwaj Gundavarapu

Trans. Mach. Learn. Res., 2024

Video Creation by Demonstration.

[BibT_eX]

[DOI]

CoRR, 2024

ε-VAE: Denoising as Visual Decoding.

[BibT_eX]

[DOI]

CoRR, 2024

PolyMaX: General Dense Prediction with Mask Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

VideoPoet: A Large Language Model for Zero-Shot Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

VideoPrism: A Foundational Visual Encoder for Video Understanding.

[BibT_eX]

[DOI]

Long Zhao

Nitesh Bharadwaj Gundavarapu

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Distilling Vision-Language Models on Millions of Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

SANPO: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset.

[BibT_eX]

[DOI]

Cattalyya Nuengsigkapian

CoRR, 2023

Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Unified Visual Relationship Detection with Vision and Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Zero-shot Generalization and Robustness of Multi-Modal Models.

[BibT_eX]

[DOI]

Balaji Lakshminarayanan

Jiaping Zhao

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2022

2.5D visual relationship detection.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2022

Surrogate Gap Minimization Improves Sharpness-Aware Training.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

k-means Mask Transformer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TubeFormer-DeepLab: Video Mask Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

On Temporal Granularity in Self-Supervised Video Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021

Computational Media Intelligence: Human-Centered Machine Analysis of Media.

[BibT_eX]

[DOI]

Proc. IEEE, 2021

Exploring Temporal Granularity in Self-Supervised Video Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2021

DeepLab2: A TensorFlow Library for Deep Labeling.

[BibT_eX]

[DOI]

CoRR, 2021

STEP: Segmenting and Tracking Every Pixel.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

MaX-DeepLab: End-to-End Panoptic Segmentation With Mask Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

VIP-DeepLab: Learning Visual Perception With Depth-Aware Video Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation.

[BibT_eX]

[DOI]

Liang-Chieh Chen

Raphael Gontijo Lopes

CoRR, 2020

EEV Dataset: Predicting Expressions Evoked by Diverse Videos.

[BibT_eX]

[DOI]

CoRR, 2020

ATQAM/MAST'20: Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

View-Invariant Probabilistic Embedding for Human Pose.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation.

[BibT_eX]

[DOI]

Liang-Chieh Chen

Raphael Gontijo Lopes

Proceedings of the Computer Vision - ECCV 2020, 2020

Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

MnasFPN: Learning Latency-Aware Pyramid Architecture for Object Detection on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Panoptic-DeepLab.

[BibT_eX]

[DOI]

CoRR, 2019

The iMaterialist Fashion Attribute Dataset.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Geo-Aware Networks for Fine-Grained Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Searching for MobileNetV3.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

FEELVOS: Fast End-To-End Embedding Learning for Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications.

[BibT_eX]

[DOI]

CoRR, 2018

Searching for Efficient Multi-Scale Architectures for Dense Image Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

The INaturalist Species Classification and Detection Dataset.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

MaskLab: Instance Segmentation by Refining Object Detection With Semantic and Direction Features.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Improving Smiling Detection with Race and Gender Diversity.

[BibT_eX]

[DOI]

Hee Jung Ryu

Margaret Mitchell

Hartwig Adam

CoRR, 2017

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications.

[BibT_eX]

[DOI]

CoRR, 2017

The iNaturalist Challenge 2017 Dataset.

[BibT_eX]

[DOI]

CoRR, 2017

Rethinking Atrous Convolution for Semantic Image Segmentation.

[BibT_eX]

[DOI]

CoRR, 2017

Learning Unified Embedding for Apparel Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

BranchOut: Regularization for Online Ensemble Tracking with Convolutional Neural Networks.

[BibT_eX]

[DOI]

Bohyung Han

Jack Sim

Hartwig Adam

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2014

Large-Scale Object Classification Using Label Relation Graphs.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

2009

Tour the world: a technical demonstration of a web-scale landmark recognition engine.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Large-scale privacy protection in Google Street View.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Tour the world: Building a web-scale landmark recognition engine.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008

Large scale learning and recognition of faces in web videos.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Hartwig Adam

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...