Karan Sikka

Orcid: 0000-0002-0187-5322

According to our database1, Karan Sikka authored at least 46 papers between 2012 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
A Video is Worth 10, 000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval.
CoRR, 2023

DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback.
CoRR, 2023

Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning.
CoRR, 2023

Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models.
CoRR, 2023

SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments.
CoRR, 2023

Predicting Information Pathways Across Online Communities.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Detecting Trojaned DNNs Using Counterfactual Attributions.
Proceedings of the IEEE International Conference on Assured Autonomy, 2023

Multilingual Content Moderation: A Case Study on Reddit.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022
MUWS'22: 1st International Workshop on Multimodal Understanding for the Web and Social Media.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Dual-Key Multimodal Backdoors for Visual Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Towards Solving Multimodal Comprehension.
CoRR, 2021

Online Defense of Trojaned Models using Misattributions.
CoRR, 2021

MISA: Online Defense of Trojaned Models using Misattributions.
Proceedings of the ACSAC '21: Annual Computer Security Applications Conference, Virtual Event, USA, December 6, 2021

2020
Zero-Shot Learning with Knowledge Enhanced Visual Semantic Embeddings.
CoRR, 2020

Deep Adaptive Semantic Logic (DASL): Compiling Declarative Knowledge into Deep Neural Networks.
CoRR, 2020

RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019
FoodX-251: A Dataset for Fine-grained Food Classification.
CoRR, 2019

Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks.
CoRR, 2019

Learning User Preferences from Social Multimedia Analysis and Overview of the iFood2019Challenge.
Proceedings of the 5th International Workshop on Multimedia Assisted Dietary Management, 2019

Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Semantically-Aware Attentive Neural Embeddings for 2D Long-Term Visual Localization.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Discriminatively Trained Latent Ordinal Model for Video Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Semantically-Aware Attentive Neural Embeddings for Image-based Visual Localization.
CoRR, 2018

Understanding Visual Ads by Aligning Symbols and Objects using Co-Attention.
CoRR, 2018

Zero-Shot Object Detection.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Deep active object recognition by joint label and action prediction.
Comput. Vis. Image Underst., 2017

Combining Weakly and Webly Supervised Learning for Classifying Food Images.
CoRR, 2017

AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Latent Dynamic Space-Time Volumes for Predicting Human Facial Behavior in Videos.
PhD thesis, 2016

LOMo: Latent Ordinal Model for Facial Analysis in Videos.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
The more the merrier: Analysing the affect of a group of people in images.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Exemplar Hidden Markov Models for classification of facial expressions in videos.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

Joint Clustering and Classification for Multiple Instance Learning.
Proceedings of the British Machine Vision Conference 2015, 2015

Deep Q-learning for Active Recognition of GERMS: Baseline performance on a standardized dataset for active learning.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Classification and weakly supervised pain localization using multiple segment representation.
Image Vis. Comput., 2014

A discriminative parts based model approach for fiducial points free and shape constrained head pose normalisation in the wild.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Facial Expression Analysis for Estimating Pain in Clinical Settings.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Emotion Recognition In The Wild Challenge 2014: Baseline, Data and Protocol.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

2013
Pseudo vs. True Defect Classification in Printed Circuits Boards using Wavelet Features.
CoRR, 2013

Multiple kernel learning for emotion recognition in the wild.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Weakly supervised pain localization using multiple instance learning.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

2012
Exploring Bag of Words Architectures in the Facial Expression Domain.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012


  Loading...