Muhammad Maaz

Affiliations:
  • Mohamed bin Zayed University of AI, Abu Dhabi, UAE


According to our database1, Muhammad Maaz authored at least 13 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
PALO: A Polyglot Large Multimodal Model for 5B People.
CoRR, 2024

2023
PG-Video-LLaVA: Pixel Grounding Large Video-Language Models.
CoRR, 2023

GLaMM: Pixel Grounding Large Multimodal Model.
CoRR, 2023

Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models.
CoRR, 2023

SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Fine-tuned CLIP Models are Efficient Video Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MaPLe: Multi-modal Prompt Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation.
CoRR, 2022

Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Class-Agnostic Object Detection with Multi-modal Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Multi-modal Transformers Excel at Class-agnostic Object Detection.
CoRR, 2021

Self-Supervised Learning for Fine-Grained Visual Categorization.
CoRR, 2021


  Loading...