Muzammal Naseer

Orcid: 0000-0001-7663-7161

According to our database1, Muzammal Naseer authored at least 76 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
MixANT: Observation-dependent Memory Propagation for Stochastic Dense Action Anticipation.
CoRR, September, 2025

Foundation Models Defining a New Era in Vision: A Survey and Outlook.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2025

How Good is my Histopathology Vision-Language Foundation Model? A Holistic Benchmark.
CoRR, March, 2025

Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models.
CoRR, February, 2025

Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation.
CoRR, January, 2025

CLDTracker: A Comprehensive Language Description for visual Tracking.
Inf. Fusion, 2025

Enhancing Novel Object Detection via Cooperative Foundational Models.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Hierarchical Self-supervised Adversarial Training for Robust Vision Models in Histopathology.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

On Frequency Domain Adversarial Vulnerabilities of Volumetric Medical Image Segmentation.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Towards Evaluating the Robustness of Visual State Space Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

How Good is my Video-LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

DyCON: Dynamic Uncertainty-aware Consistency and Contrastive Learning for Semi-supervised Medical Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Learning to Prompt with Text Only Supervision for Vision-Language Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Guidance Through Surrogate: Toward a Generic Diagnostic Attack.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities.
CoRR, 2024

Y-CA-Net: A Convolutional Attention Based Network for Volumetric Medical Image Segmentation.
CoRR, 2024

CDChat: A Large Multimodal Model for Remote Sensing Change Description.
CoRR, 2024

Distillation-free Scaling of Large SSMs for Images and Videos.
CoRR, 2024

STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models.
CoRR, 2024

Multi-Granularity Language-Guided Multi-Object Tracking.
CoRR, 2024

Multi-modal Generation via Cross-Modal In-Context Learning.
CoRR, 2024

VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding.
CoRR, 2024

Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding.
CoRR, 2024

Learnable weight initialization for volumetric medical image segmentation.
Artif. Intell. Medicine, 2024

Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

BAPLe: Backdoor Attacks on Medical Foundational Models Using Prompt Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 Workshops, 2024

Language Guided Domain Generalized Medical Image Segmentation.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Composed Video Retrieval via Enriched Context and Discriminative Embeddings.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GeoChat: Grounded Large Vision-Language Model for Remote Sensing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cross-Modal Self-Training: Aligning Images and Pointclouds to learn Classification without Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models.
Proceedings of the 35th British Machine Vision Conference, 2024

ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes.
Proceedings of the Computer Vision - ACCV 2024, 2024

S3A: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Stylized Adversarial Defense.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization.
CoRR, 2023

Videoprompter: an ensemble of foundational models for zero-shot video understanding.
CoRR, 2023

Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment.
CoRR, 2023

Foundational Models Defining a New Era in Vision: A Survey and Outlook.
CoRR, 2023

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Boosting Adversarial Transferability using Dynamic Cues.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FLIP: Cross-domain Face Anti-spoofing with Language Guidance.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-regulating Prompts: Foundational Model Adaptation without Forgetting.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Transformers in Vision: A Survey.
ACM Comput. Surv., January, 2022

Guidance Through Surrogate: Towards a Generic Diagnostic Attack.
CoRR, 2022

On Improving Adversarial Transferability of Vision Transformers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Self-supervised Video Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

How to Train Vision Transformer on Small-scale Datasets?
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Self-distilled Vision Transformer for Domain Generalization.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
Intriguing Properties of Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Orthogonal Projection Loss.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

On Generating Transferable Targeted Perturbations.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rich Semantics Improve Few-Shot Learning.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
A Self-supervised Approach for Adversarial Robustness.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey.
IEEE Access, 2019

Local Gradients Smoothing: Defense Against Localized Adversarial Attacks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Cross-Domain Transferability of Adversarial Perturbations.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Distorting Neural Representations to Generate Highly Transferable Adversarial Examples.
CoRR, 2018

Indoor Scene Understanding in 2.5/3D: A Survey.
CoRR, 2018


  Loading...