Muzammal Naseer

Inf. Fusion, 2025

Enhancing Novel Object Detection via Cooperative Foundational Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Hierarchical Self-supervised Adversarial Training for Robust Vision Models in Histopathology.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

On Frequency Domain Adversarial Vulnerabilities of Volumetric Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection.

[BibT_eX]

[DOI]

Divya Velayudhan

Abdelfatah Hassan Ahmed

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Towards Evaluating the Robustness of Visual State Space Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

How Good is my Video-LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs.

[BibT_eX]

[DOI]

Muhammad Ferjad Naeem

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

DyCON: Dynamic Uncertainty-aware Consistency and Contrastive Learning for Semi-supervised Medical Image Segmentation.

[BibT_eX]

[DOI]

Maregu Assefa

Iyyakutti Iyappan Ganapathi

Syed Sadaf Ali

Mohamed L. Seghier

Naoufel Werghi

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Learning to Prompt with Text Only Supervision for Vision-Language Models.

[BibT_eX]

[DOI]

Muhammad Ferjad Naeem

Luc Van Gool

Federico Tombari

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Guidance Through Surrogate: Toward a Generic Diagnostic Attack.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., February, 2024

UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities.

[BibT_eX]

[DOI]

CoRR, 2024

Y-CA-Net: A Convolutional Attention Based Network for Volumetric Medical Image Segmentation.

[BibT_eX]

[DOI]

Muhammad Hamza Sharif

CoRR, 2024

CDChat: A Large Multimodal Model for Remote Sensing Change Description.

[BibT_eX]

[DOI]

CoRR, 2024

Distillation-free Scaling of Large SSMs for Images and Videos.

[BibT_eX]

[DOI]

CoRR, 2024

STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-Granularity Language-Guided Multi-Object Tracking.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-modal Generation via Cross-Modal In-Context Learning.

[BibT_eX]

[DOI]

CoRR, 2024

VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding.

[BibT_eX]

[DOI]

CoRR, 2024

Learnable weight initialization for volumetric medical image segmentation.

[BibT_eX]

[DOI]

Shahina K. Kunhimon

Abdelrahman M. Shaker

Fahad Shahbaz Khan

Artif. Intell. Medicine, 2024

Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

BAPLe: Backdoor Attacks on Medical Foundational Models Using Prompt Learning.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 Workshops, 2024

Language Guided Domain Generalized Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors.

[BibT_eX]

[DOI]

Fahad Shamshad

Karthik Nandakumar

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Composed Video Retrieval via Enriched Context and Discriminative Embeddings.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GeoChat: Grounded Large Vision-Language Model for Remote Sensing.

[BibT_eX]

[DOI]

Kartik Kuckreja

Muhammad Sohail Danish

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cross-Modal Self-Training: Aligning Images and Pointclouds to learn Classification without Labels.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models.

[BibT_eX]

[DOI]

Proceedings of the 35th British Machine Vision Conference, 2024

ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2024, 2024

S3A: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Stylized Adversarial Defense.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization.

[BibT_eX]

[DOI]

Jameel Hassan

Hanan Gani

Noor Hussein

Fahad Shahbaz Khan

CoRR, 2023

Videoprompter: an ensemble of foundational models for zero-shot video understanding.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment.

[BibT_eX]

[DOI]

CoRR, 2023

Foundational Models Defining a New Era in Vision: A Survey and Outlook.

[BibT_eX]

[DOI]

CoRR, 2023

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization.

[BibT_eX]

[DOI]

Jameel Abdul Samadh

Hanan Gani

Noor Hussein

Fahad Shahbaz Khan

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Boosting Adversarial Transferability using Dynamic Cues.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition.

[BibT_eX]

[DOI]

Syed Talal Wasim

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FLIP: Cross-domain Face Anti-spoofing with Language Guidance.

[BibT_eX]

[DOI]

Koushik Srivatsan

Karthik Nandakumar

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-regulating Prompts: Foundational Model Adaptation without Forgetting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search.

[BibT_eX]

[DOI]

Fahad Shamshad

Karthik Nandakumar

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Transformers in Vision: A Survey.

[BibT_eX]

[DOI]

ACM Comput. Surv., January, 2022

Guidance Through Surrogate: Towards a Generic Diagnostic Attack.

[BibT_eX]

[DOI]

CoRR, 2022

On Improving Adversarial Transferability of Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Self-supervised Video Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

How to Train Vision Transformer on Small-scale Datasets?

[BibT_eX]

[DOI]

Hanan Gani

Mohammad Yaqub

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Self-distilled Vision Transformer for Domain Generalization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

2021

Intriguing Properties of Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Orthogonal Projection Loss.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

On Generating Transferable Targeted Perturbations.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rich Semantics Improve Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

A Self-supervised Approach for Adversarial Robustness.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey.

[BibT_eX]

[DOI]

Fatih Porikli

IEEE Access, 2019

Local Gradients Smoothing: Defense Against Localized Adversarial Attacks.

[BibT_eX]

[DOI]

Fatih Porikli

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Cross-Domain Transferability of Adversarial Perturbations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

Distorting Neural Representations to Generate Highly Transferable Adversarial Examples.

[BibT_eX]

[DOI]

CoRR, 2018

Indoor Scene Understanding in 2.5/3D: A Survey.

[BibT_eX]

[DOI]