Anurag Arnab

Orcid: 0000-0002-5216-4838

According to our database1, Anurag Arnab authored at least 60 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
An automated method for tendon image segmentation on ultrasound using grey-level co-occurrence matrix features and hidden Gaussian Markov random fields.
Comput. Biol. Medicine, February, 2024

Time-, Memory- and Parameter-Efficient Visual Adaptation.
CoRR, 2024

2023
Dynamic Graph Message Passing Networks.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

PolyViT: Co-training Vision Transformers on Images, Videos and Audio.
Trans. Mach. Learn. Res., 2023

Pixel Aligned Language Models.
CoRR, 2023

Video Summarization: Towards Entity-Aware Captions.
CoRR, 2023

Dense Video Object Captioning from Disjoint Supervision.
CoRR, 2023

Optimizing ViViT Training: Time and Memory Reduction for Action Recognition.
CoRR, 2023

PaLI-X: On Scaling up a Multilingual Vision and Language Model.
CoRR, 2023

End-to-End Spatio-Temporal Action Localisation with Video Transformers.
CoRR, 2023

VicTR: Video-conditioned Text Representations for Activity Recognition.
CoRR, 2023

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation.
CoRR, 2023

Scaling Vision Transformers to 22 Billion Parameters.
CoRR, 2023

Does Visual Pretraining Help End-to-End Reasoning?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Adaptive Computation with Elastic Input Sequence.
Proceedings of the International Conference on Machine Learning, 2023


UnLoc: A Unified Framework for Video Localization Tasks.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Audiovisual Masked Autoencoders.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

How can objects help action recognition?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Token Turing Machines.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Dynamic Graph Message Passing Networks for Visual Recognition.
CoRR, 2022

Beyond Transfer Learning: Co-finetuning for Action Localisation.
CoRR, 2022

M&M Mix: A Multimodal Multiview Transformer Ensemble.
CoRR, 2022

Simple Open-Vocabulary Object Detection with Vision Transformers.
CoRR, 2022

The Efficiency Misnomer.
Proceedings of the Tenth International Conference on Learning Representations, 2022


Multiview Transformers for Video Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

End-to-end Generative Pretraining for Multimodal Video Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning with Neighbor Consistency for Noisy Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SCENIC: A JAX Library for Computer Vision Research and Beyond.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
PolyViT: Co-training Vision Transformers on Images, Videos and Audio.
CoRR, 2021

TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
CoRR, 2021

TokenLearner: Adaptive Space-Time Tokenization for Videos.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Attention Bottlenecks for Multimodal Fusion.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Compressive Visual Representations.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Unified Graph Structured Models for Video Understanding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ViViT: A Video Vision Transformer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Simplifying TugGraph using zipping algorithms.
Pattern Recognit., 2020

On the Robustness of Semantic Segmentation Models to Adversarial Attacks.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Meta-Learning Deep Visual Words for Fast Video Object Segmentation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos.
Proceedings of the Computer Vision - ECCV 2020, 2020

Dynamic Graph Message Passing Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Pixel-level scene understanding with deep structured models.
PhD thesis, 2019

Exploiting Temporal Context for 3D Human Pose Estimation in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dual Graph Convolutional Network for Semantic Segmentation.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Conditional Random Fields Meet Deep Neural Networks for Semantic Segmentation: Combining Probabilistic Graphical Models with Deep Learning for Structured Prediction.
IEEE Signal Process. Mag., 2018

Revisiting Deep Structured Models for Pixel-Level Labeling with Gradient-Based Inference.
SIAM J. Imaging Sci., 2018

Weakly- and Semi-supervised Panoptic Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Deep Fully-Connected Part-Based Models for Human Pose Estimation.
Proceedings of The 10th Asian Conference on Machine Learning, 2018

2017
Learning Arbitrary Potentials in CRFs with Gradient Descent.
CoRR, 2017

A Projected Gradient Descent Method for CRF Inference Allowing End-to-End Training of Arbitrary Pairwise Potentials.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2017

Pixelwise Instance Segmentation with a Dynamically Instantiated Network.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Holistic, Instance-level Human Parsing.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Higher Order Conditional Random Fields in Deep Neural Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

Bottom-up Instance Segmentation using Deep Higher-Order CRFs.
Proceedings of the British Machine Vision Conference 2016, 2016

2015
SemanticPaint: A Framework for the Interactive Segmentation of 3D Scenes.
CoRR, 2015

Higher Order Potentials in End-to-End Trainable Conditional Random Fields.
CoRR, 2015

SemanticPaint: interactive segmentation and learning of 3D worlds.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2015

Joint Object-Material Category Segmentation from Audio-Visual Cues.
Proceedings of the British Machine Vision Conference 2015, 2015


  Loading...