Hisham Cholakkal

Orcid: 0000-0002-8230-9065

According to our database1, Hisham Cholakkal authored at least 70 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Robust Perception and Precise Segmentation for Scribble-Supervised RGB-D Saliency Detection.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection.
IEEE Trans. Geosci. Remote. Sens., 2024

ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection.
CoRR, 2024

Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery.
CoRR, 2024

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT.
CoRR, 2024

PALO: A Polyglot Large Multimodal Model for 5B People.
CoRR, 2024

BiMediX: Bilingual Medical Mixture of Experts LLM.
CoRR, 2024

TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

DDAM-PS: Diligent Domain Adaptive Mixer for Person Search.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Semi-supervised Open-World Object Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Transformers in Remote Sensing: A Survey.
Remote. Sens., April, 2023

SipMaskv2: Enhanced Fast Image and Video Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

GLaMM: Pixel Grounding Large Multimodal Model.
CoRR, 2023

Foundational Models Defining a New Era in Vision: A Survey and Outlook.
CoRR, 2023

XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.
CoRR, 2023

Remote Sensing Change Detection With Transformers Trained from Scratch.
CoRR, 2023

Video Instance Segmentation in an Open-World.
CoRR, 2023

SAT: Scale-Augmented Transformer for Person Search.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Data-Efficient Transformer-Based 3D Object Detection.
Proceedings of the 18th International Joint Conference on Computer Vision, 2023

Salient Mask-Guided Vision Transformer for Fine-Grained Classification.
Proceedings of the 18th International Joint Conference on Computer Vision, 2023

RadarFormer: Lightweight and Accurate Real-Time Radar Object Detection Model.
Proceedings of the Image Analysis - 22nd Scandinavian Conference, 2023

Anchor-ReID: A Test Time Adaptation for Person Re-identification.
Proceedings of the Image Analysis - 22nd Scandinavian Conference, 2023

Handling Data Heterogeneity via Architectural Design for Federated Visual Recognition.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

3D Indoor Instance Segmentation in an Open-World.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cross-Modulated Few-Shot Image Generation for Colorectal Tissue Classification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Generative Multiplane Neural Radiance for 3D-Aware Image Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Person Image Synthesis via Denoising Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SA2-Net: Scale-aware Attention Network for Microscopic Image Segmentation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Towards Partial Supervision for Generic Object Counting in Natural Scenes.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility.
CoRR, 2022

Multi-scale Feature Aggregation for Crowd Counting.
CoRR, 2022

3D Vision with Transformers: A Survey.
CoRR, 2022

CMR3D: Contextualized Multi-Stage Refinement for 3D Object Detection.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022

On the Robustness of 3D Object Detectors.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022

Learning a Dynamic Cross-Modal Network for Multispectral Pedestrian Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

DoodleFormer: Creative Sketch Drawing with Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

PSTR: End-to-End One-Step Person Search With Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

PS-ARM: An End-to-End Attention-Aware Relation Mixer Network for Person Search.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
PSC-Net: learning part spatial co-occurrence for occluded pedestrian detection.
Sci. China Inf. Sci., 2021

D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Handwriting Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Structured Latent Embeddings for Recognizing Unseen Classes in Unseen Domains.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
PSC-Net: Learning Part Spatial Co-occurence for Occluded Pedestrian Detection.
CoRR, 2020

Count- and Similarity-Aware R-CNN for Pedestrian Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

Fixing Localization Errors to Improve Image Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

D2Det: Towards High Quality Object Detection and Instance Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Fine-Grained Recognition: Accounting for Subtle Differences between Similar Classes.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
CNN-based gender classification in near-infrared periocular images.
Pattern Anal. Appl., 2019

Learning Rich Features at High-Speed for Single-Shot Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Enriched Feature Guided Refinement Network for Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Object Counting and Instance Segmentation With Image-Level Supervision.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Backtracking Spatial Pyramid Pooling-Based Image Classifier for Weakly Supervised Top-Down Salient Object Detection.
IEEE Trans. Image Process., 2018

2017
Classifier-based approaches for top-down salient object detection
PhD thesis, 2017

L1-Regularized Reconstruction Error as Alpha Matte.
IEEE Signal Process. Lett., 2017

2016
Sparse Coding for Alpha Matting.
IEEE Trans. Image Process., 2016

A classifier-guided approach for top-down salient object detection.
Signal Process. Image Commun., 2016

Weakly Supervised Top-down Salient Object Detection.
CoRR, 2016

Backtracking ScSPM Image Classifier for Weakly Supervised Top-Down Saliency.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Temporal trimap propagation using motion-assisted shape blending.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Top-down saliency with Locality-constrained Contextual Sparse Coding.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Sparse codes as Alpha Matte.
Proceedings of the British Machine Vision Conference, 2014

2013
High-level synthesis of multiple dependent CUDA kernels on FPGA.
Proceedings of the 18th Asia and South Pacific Design Automation Conference, 2013


  Loading...