An automated method for tendon image segmentation on ultrasound using grey-level co-occurrence matrix features and hidden Gaussian Markov random fields.

[BibT_eX]

[DOI]

Isabelle Scott

David Connell

Comput. Biol. Medicine, February, 2024

Towards Optimal Adapter Placement for Efficient Transfer Learning.

[BibT_eX]

[DOI]

Aleksandra Irena Nowak

CoRR, 2024

Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Mixture of Nested Experts: Adaptive Processing of Visual Tokens.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

PLANTED: A Dataset for Planted Forest Identification from Multi-Satellite Time Series.

[BibT_eX]

[DOI]

Luis Miguel Pazos-Outón

Cristina Nader Vasconcelos

Proceedings of the IGARSS 2024, 2024

VIEWS: Entity-Aware News Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition.

[BibT_eX]

[DOI]

Shreyank N. Gowda

Anurag Arnab

Jonathan Huang

Proceedings of the Computer Vision - ECCV 2024, 2024

Streaming Dense Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Pixel Aligned Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Time-, Memory- and Parameter-Efficient Visual Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VicTR: Video-conditioned Text Representations for Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

End-to-End Spatio-Temporal Action Localisation with Video Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

On Scaling Up a Multilingual Vision and Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Dynamic Graph Message Passing Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

PolyViT: Co-training Vision Transformers on Images, Videos and Audio.

[BibT_eX]

[DOI]

Valerii Likhosherstov

Anurag Arnab

Krzysztof Marcin Choromanski

Mario Lucic

Yi Tay

Mostafa Dehghani

Trans. Mach. Learn. Res., 2023

Video Summarization: Towards Entity-Aware Captions.

[BibT_eX]

[DOI]

CoRR, 2023

Optimizing ViViT Training: Time and Memory Reduction for Action Recognition.

[BibT_eX]

[DOI]

Shreyank N. Gowda

Anurag Arnab

Jonathan Huang

CoRR, 2023

PaLI-X: On Scaling up a Multilingual Vision and Language Model.

[BibT_eX]

[DOI]

CoRR, 2023

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Does Visual Pretraining Help End-to-End Reasoning?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Adaptive Computation with Elastic Input Sequence.

[BibT_eX]

[DOI]

Fuzhao Xue

Valerii Likhosherstov

Proceedings of the International Conference on Machine Learning, 2023

Scaling Vision Transformers to 22 Billion Parameters.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

UnLoc: A Unified Framework for Video Localization Tasks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Audiovisual Masked Autoencoders.

[BibT_eX]

[DOI]

Mariana-Iuliana Georgescu

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

How can objects help action recognition?

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Token Turing Machines.

[BibT_eX]

[DOI]

Michael S. Ryoo

Keerthana Gopalakrishnan

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Dynamic Graph Message Passing Networks for Visual Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

Beyond Transfer Learning: Co-finetuning for Action Localisation.

[BibT_eX]

[DOI]

CoRR, 2022

M&M Mix: A Multimodal Multiview Transformer Ensemble.

[BibT_eX]

[DOI]

CoRR, 2022

Simple Open-Vocabulary Object Detection with Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2022

The Efficiency Misnomer.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Simple Open-Vocabulary Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Multiview Transformers for Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

End-to-end Generative Pretraining for Multimodal Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning with Neighbor Consistency for Noisy Labels.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SCENIC: A JAX Library for Computer Vision Research and Beyond.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

PolyViT: Co-training Vision Transformers on Images, Videos and Audio.

[BibT_eX]

[DOI]

Valerii Likhosherstov

Anurag Arnab

Krzysztof Choromanski

CoRR, 2021

TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?

[BibT_eX]

[DOI]

CoRR, 2021

TokenLearner: Adaptive Space-Time Tokenization for Videos.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Attention Bottlenecks for Multimodal Fusion.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Compressive Visual Representations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Unified Graph Structured Models for Video Understanding.

[BibT_eX]

[DOI]

Anurag Arnab

Chen Sun

Cordelia Schmid

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ViViT: A Video Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Simplifying TugGraph using zipping algorithms.

[BibT_eX]

[DOI]

Pattern Recognit., 2020

Meta-Learning Deep Visual Words for Fast Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Dynamic Graph Message Passing Networks.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019

Pixel-level scene understanding with deep structured models.

[BibT_eX]

[DOI]

Anurag Arnab

PhD thesis, 2019

Exploiting Temporal Context for 3D Human Pose Estimation in the Wild.

[BibT_eX]

[DOI]

Anurag Arnab

Carl Doersch

Andrew Zisserman

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dual Graph Convolutional Network for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018

Conditional Random Fields Meet Deep Neural Networks for Semantic Segmentation: Combining Probabilistic Graphical Models with Deep Learning for Structured Prediction.

[BibT_eX]

[DOI]

Anurag Arnab

Shuai Zheng

Sadeep Jayasumana

Bernardino Romera-Paredes