Ajay Divakaran
Orcid: 0000-0003-0371-5346
According to our database1,
Ajay Divakaran
authored at least 127 papers
between 1995 and 2024.
Collaborative distances:
Collaborative distances:
Awards
IEEE Fellow
IEEE Fellow 2011, "For contributions to multimedia content analysis".
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification.
CoRR, 2024
CoRR, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
A Video is Worth 10, 000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval.
CoRR, 2023
DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback.
CoRR, 2023
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE International Conference on Assured Autonomy, 2023
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
Class Prototypes based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2.
CoRR, 2022
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2.
Proceedings of the Conference on Lifelong Learning Agents, 2022
System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games.
Proceedings of the Second International Conference on AI-ML Systems, 2022
2021
Knowing What VQA Does Not: Pointing to Error-Inducing Regions to Improve Explanation Helpfulness.
CoRR, 2021
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction.
Proceedings of the 2021 9th International Conference on Affective Computing and Intelligent Interaction, 2021
2020
Lifelong Learning using Eigentasks: Task Separation, Skill Acquisition, and Selective Transfer.
CoRR, 2020
Deep Adaptive Semantic Logic (DASL): Compiling Declarative Knowledge into Deep Neural Networks.
CoRR, 2020
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020
2019
Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks.
CoRR, 2019
Lucid Explanations Help: Using a Human-AI Image-Guessing Game to Evaluate Machine Explanation Helpfulness.
CoRR, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Can You Explain That? Lucid Explanations Help Human-AI Collaborative Image Retrieval.
Proceedings of the Seventh AAAI Conference on Human Computation and Crowdsourcing, 2019
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
2018
CoRR, 2018
2017
CoRR, 2017
2016
Proceedings of the Sixth International Conference on Learning Analytics & Knowledge, 2016
2015
2nd Workshop on Computational Models of Social Interactions: Human-Computer-Media Communication (HCMC2015).
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Exploiting Multimodal Affect and Semantics to Identify Politically Persuasive Web Videos.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015
The Tower Game Dataset: A multimodal dataset for analyzing social interaction predicates.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013
Proceedings of the ACM Multimedia Conference, 2013
Affect analysis in natural human interaction using Joint Hidden Conditional Random Fields.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013
Proceedings of the IEEE International Conference on Computer Vision, 2013
Leveraging a Generalized Tutoring Framework in Exploratory Simulations Of Ill-Defined Domains.
Proceedings of the Workshops at the 16th International Conference on Artificial Intelligence in Education AIED 2013, 2013
2012
On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks.
Int. J. Multim. Data Eng. Manag., 2012
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
How to put it into words - using random forests to extract symbol level descriptions from audio content for concept detection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Evaluation of low-level features and their combinations for complex event detection in open source videos.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
2011
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011
On the Applicability of Speaker Diarization to Audio Concept Detection for Multimedia Retrieval.
Proceedings of the 2011 IEEE International Symposium on Multimedia, 2011
2009
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2009), 2009
2008
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Detection of music segment boundaries using audio-visual features for a personal video recorder.
IEEE Trans. Consumer Electron., 2007
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
2006
An enhanced video summarization system using audio features for a personal video recorder.
IEEE Trans. Consumer Electron., 2006
A Content-Adaptive Analysis and Representation Framework for Audio Event Discovery from "Unscripted" Multimedia.
EURASIP J. Adv. Signal Process., 2006
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
A highlight scene detection and video summarization system using audio feature for a personal video recorder.
IEEE Trans. Consumer Electron., 2005
Modeling sports highlights using a time-series clustering framework and model interpretation.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2005, 2005
Highlights extraction from sports video based on an audio-visual marker detection framework.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams [video applications].
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the Electronic Imaging: Image and Video Communications and Processing 2005, 2005
2004
Pattern Recognit. Lett., 2004
J. Vis. Commun. Image Represent., 2004
Proceedings of the Visual Communications and Image Processing 2004, 2004
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004
A time series clustering based framework for multimedia mining and summarization using audio features.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004
Effective and efficient sports highlights extraction using the minimum description length criterion in selecting GMM structures.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
Time series analysis and segmentation using eigenvectors for mining semantic audio label sequences.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
Adaptive fast playback-based video skimming using a compressed-domain visual complexity measure.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
Discovering meaningful multimedia patterns with audio-visual concepts and associated text.
Proceedings of the 2004 International Conference on Image Processing, 2004
Proceedings of the 2004 International Conference on Image Processing, 2004
2003
J. Vis. Commun. Image Represent., 2003
Procedure for audio-assisted browsing of news video using generalized sound recognition.
Proceedings of the Storage and Retrieval for Media Databases 2003, 2003
Automatic extraction of soccer video highlights using a combination of motion and audio features.
Proceedings of the Storage and Retrieval for Media Databases 2003, 2003
Unsupervised discovery of multilevel statistical video structures using hierarchical hidden Markov models.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
Generation of sports highlights using motion activity in combination with a common audio feature extraction framework.
Proceedings of the 2003 International Conference on Image Processing, 2003
Feature selection for unsupervised discovery of statistical temporal structures in video.
Proceedings of the 2003 International Conference on Image Processing, 2003
Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Comparing MFCC and MPEG-7 audio features for feature extraction, maximum likelihood HMM and entropic prior HMM for sports audio classification.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Rapid generation of sports video highlights using the MPEG-7 motion activity descriptor.
Proceedings of the Storage and Retrieval for Media Databases 2002, 2002
Proceedings of the Storage and Retrieval for Media Databases 2002, 2002
Representation of motion activity in hierarchical levels for video indexing and filtering.
Proceedings of the 2002 International Conference on Image Processing, 2002
Proceedings of the 2002 International Conference on Image Processing, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
2001
Video summarization using descriptors of motion activity: A motion activity based approach to key-frame extraction from video shots.
J. Electronic Imaging, 2001
Proceedings of the Storage and Retrieval for Media Databases 2001, 2001
Proceedings of the Storage and Retrieval for Media Databases 2001, 2001
Proceedings of the Advances in Multimedia Information Processing, 2001
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001
A Novel Pair-Wise Comparison Based Analytical Framework For Automatic Measurement Of Intensity Of Motion Activity Of Video Segments.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001
Proceedings of the 2001 International Conference on Image Processing, 2001
Proceedings of the Computer Analysis of Images and Patterns, 9th International Conference, 2001
2000
IEEE Trans. Consumer Electron., 2000
Proceedings of the Storage and Retrieval for Media Databases 2000, 2000
Proceedings of the Storage and Retrieval for Media Databases 2000, 2000
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000
A Region Based Descriptor for Spatial Distribution of Motion Activity for Compressed Video.
Proceedings of the 2000 International Conference on Image Processing, 2000
1999
Proceedings of the Storage and Retrieval for Image and Video Databases VII, 1999
1997
IEEE Trans. Circuits Syst. Video Technol., 1997
1995
IEEE Trans. Inf. Theory, 1995