Sushant Gautam
Orcid: 0000-0001-9232-2661
According to our database1,
Sushant Gautam authored at least 39 papers
between 2022 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
The Moltbook Observatory Archive: an incremental dataset of agent-only social network activity.
CoRR, May, 2026
When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels.
CoRR, May, 2026
Knowledge-Guided Retrieval-Augmented Generation for Zero-Shot Psychiatric Data: Privacy Preserving Synthetic Data Generation.
CoRR, March, 2026
VideoHEDGE: Entropy-Based Hallucination Detection for Video-VLMs via Semantic Clustering and Spatiotemporal Perturbations.
CoRR, January, 2026
Proceedings of the Advances in Information Retrieval, 2026
2025
Beyond Audio: Enhancing SoccerNet-Echoes with Multimodal Event Extraction Using LLMs.
Int. J. Semantic Comput., December, 2025
HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models.
CoRR, November, 2025
CoRR, August, 2025
CoRR, May, 2025
CoRR, April, 2025
Accurate diabetic retinopathy segmentation and classification model using gated recurrent unit with residual attention network.
Biomed. Signal Process. Control., 2025
Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis.
Big Data Cogn. Comput., 2025
Proceedings of the 16th ACM Multimedia Systems Conference, 2025
Proceedings of the 16th ACM Multimedia Systems Conference, 2025
Proceedings of the 16th ACM Multimedia Systems Conference, 2025
Kvasir-VQA-x1:A Multimodal Dataset for Medical Reasoning and Robust MedVQA in Gastrointestinal Endoscopy.
Proceedings of the Data Engineering in Medical Imaging - Third MICCAI Workshop, 2025
ImageCLEF 2025: Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications.
Proceedings of the Advances in Information Retrieval, 2025
Overview of ImageCLEF 2025: Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2025
Overview of ImageCLEFmedical 2025 - Visual Question Answering and Synthetic Image Generation for Gastrointestinal Tract.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum, 2025
Point, Detect, Count: Multi-Task Medical Image Understanding with Instruction-Tuned Vision-Language Models.
Proceedings of the 38th IEEE International Symposium on Computer-Based Medical Systems, 2025
Proceedings of the International Conference on Content-Based Multimedia Indexing, 2025
2024
The SoccerSum Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch.
Dataset, February, 2024
CoRR, 2024
FactGenius: Combining Zero-Shot Prompting and Fuzzy Relation Mining to Improve Fact Verification with Knowledge Graphs.
CoRR, 2024
The SoccerSum Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch.
Proceedings of the 15th ACM Multimedia Systems Conference, 2024
Proceedings of the 15th ACM Multimedia Systems Conference, 2024
Proceedings of the 15th ACM Multimedia Systems Conference, 2024
Proceedings of the 3rd Mile-High Video Conference, 2024
PlayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips.
Proceedings of the IEEE International Symposium on Multimedia, 2024
Proceedings of the IEEE International Symposium on Multimedia, 2024
Proceedings of the Advances on Graph-Based Approaches in Information Retrieval, 2024
Proceedings of the 21st International Conference on Content-Based Multimedia Indexing, 2024
Proceedings of the 21st International Conference on Content-Based Multimedia Indexing, 2024
Proceedings of the 21st International Conference on Content-Based Multimedia Indexing, 2024
2023
Bridging Multimedia Modalities: Enhanced Multimodal AI Understanding and Intelligent Agents.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023
2022
Proceedings of the NarSUM '22: Proceedings of the 1st Workshop on User-centric Narrative Summarization of Long Videos, 2022