Karan Sikka

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Dual-Key Multimodal Backdoors for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Towards Solving Multimodal Comprehension.

[BibT_eX]

[DOI]

Pritish Sahu

Niluthpol Chowdhury Mithun

CoRR, 2021

Online Defense of Trojaned Models using Misattributions.

[BibT_eX]

[DOI]

CoRR, 2021

MISA: Online Defense of Trojaned Models using Misattributions.

[BibT_eX]

[DOI]

Proceedings of the ACSAC '21: Annual Computer Security Applications Conference, Virtual Event, USA, December 6, 2021

2020

Zero-Shot Learning with Knowledge Enhanced Visual Semantic Embeddings.

[BibT_eX]

[DOI]

CoRR, 2020

Deep Adaptive Semantic Logic (DASL): Compiling Declarative Knowledge into Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2020

RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019

FoodX-251: A Dataset for Fine-grained Food Classification.

[BibT_eX]

[DOI]

CoRR, 2019

Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks.

[BibT_eX]

[DOI]

Lucas Van Bramer

CoRR, 2019

Learning User Preferences from Social Multimedia Analysis and Overview of the iFood2019Challenge.

[BibT_eX]

[DOI]

Proceedings of the 5th International Workshop on Multimedia Assisted Dietary Management, 2019

Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Semantically-Aware Attentive Neural Embeddings for 2D Long-Term Visual Localization.

[BibT_eX]

[DOI]

Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018

Discriminatively Trained Latent Ordinal Model for Video Classification.

[BibT_eX]

[DOI]

Gaurav Sharma

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Semantically-Aware Attentive Neural Embeddings for Image-based Visual Localization.

[BibT_eX]

[DOI]

CoRR, 2018

Understanding Visual Ads by Aligning Symbols and Objects using Co-Attention.

[BibT_eX]

[DOI]

CoRR, 2018

Zero-Shot Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

2017

Deep active object recognition by joint label and action prediction.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2017

Combining Weakly and Webly Supervised Learning for Classifying Food Images.

[BibT_eX]

[DOI]

Parneet Kaur

CoRR, 2017

AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Latent Dynamic Space-Time Volumes for Predicting Human Facial Behavior in Videos.

[BibT_eX]

[DOI]

PhD thesis, 2016

LOMo: Latent Ordinal Model for Facial Analysis in Videos.

[BibT_eX]

[DOI]

Gaurav Sharma

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

The more the merrier: Analysing the affect of a group of people in images.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Exemplar Hidden Markov Models for classification of facial expressions in videos.

[BibT_eX]

[DOI]

Abhinav Dhall

Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

Joint Clustering and Classification for Multiple Instance Learning.

[BibT_eX]

[DOI]

Ritwik Giri

Proceedings of the British Machine Vision Conference 2015, 2015

Deep Q-learning for Active Recognition of GERMS: Baseline performance on a standardized dataset for active learning.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2015, 2015

2014

Classification and weakly supervised pain localization using multiple segment representation.

[BibT_eX]

[DOI]

Abhinav Dhall

Image Vis. Comput., 2014

A discriminative parts based model approach for fiducial points free and shape constrained head pose normalisation in the wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Facial Expression Analysis for Estimating Pain in Clinical Settings.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Emotion Recognition In The Wild Challenge 2014: Baseline, Data and Protocol.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Multimodal Interaction, 2014

2013

Pseudo vs. True Defect Classification in Printed Circuits Boards using Wavelet Features.

[BibT_eX]

[DOI]

CoRR, 2013

Multiple kernel learning for emotion recognition in the wild.

[BibT_eX]

[DOI]

Karmen Dykstra

Suchitra Sathyanarayana

Gwen Littlewort

Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Weakly supervised pain localization using multiple instance learning.

[BibT_eX]

[DOI]

Abhinav Dhall

Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

2012

Exploring Bag of Words Architectures in the Facial Expression Domain.

[BibT_eX]

[DOI]

Tingfan Wu

Joshua Susskind