Sushant Gautam

Orcid: 0000-0001-9232-2661

According to our database1, Sushant Gautam authored at least 39 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
The Moltbook Observatory Archive: an incremental dataset of agent-only social network activity.
CoRR, May, 2026

When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels.
CoRR, May, 2026

Knowledge-Guided Retrieval-Augmented Generation for Zero-Shot Psychiatric Data: Privacy Preserving Synthetic Data Generation.
CoRR, March, 2026

VideoHEDGE: Entropy-Based Hallucination Detection for Video-VLMs via Semantic Clustering and Spatiotemporal Perturbations.
CoRR, January, 2026


2025
Beyond Audio: Enhancing SoccerNet-Echoes with Multimodal Event Extraction Using LLMs.
Int. J. Semantic Comput., December, 2025

HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models.
CoRR, November, 2025

Medico 2025: Visual Question Answering for Gastrointestinal Imaging.
CoRR, August, 2025

Prompt to Polyp: Medical Text-Conditioned Image Synthesis with Diffusion Models.
CoRR, May, 2025

X-DECODE: EXtreme Deblurring with Curriculum Optimization and Domain Equalization.
CoRR, April, 2025

Accurate diabetic retinopathy segmentation and classification model using gated recurrent unit with residual attention network.
Biomed. Signal Process. Control., 2025

Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis.
Big Data Cogn. Comput., 2025

HockeyOrient: A Dataset for Ice Hockey Player Orientation Classification.
Proceedings of the 16th ACM Multimedia Systems Conference, 2025

HockeyRink: A Dataset for Precise Ice Hockey Rink Keypoint Mapping and Analytics.
Proceedings of the 16th ACM Multimedia Systems Conference, 2025

HockeyAI: A Multi-Class Ice Hockey Dataset for Object Detection.
Proceedings of the 16th ACM Multimedia Systems Conference, 2025

Kvasir-VQA-x1:A Multimodal Dataset for Medical Reasoning and Robust MedVQA in Gastrointestinal Endoscopy.
Proceedings of the Data Engineering in Medical Imaging - Third MICCAI Workshop, 2025



Overview of ImageCLEFmedical 2025 - Visual Question Answering and Synthetic Image Generation for Gastrointestinal Tract.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum, 2025

Point, Detect, Count: Multi-Task Medical Image Understanding with Instruction-Tuned Vision-Language Models.
Proceedings of the 38th IEEE International Symposium on Computer-Based Medical Systems, 2025

SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding.
Proceedings of the International Conference on Content-Based Multimedia Indexing, 2025

2024
The SoccerSum Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch.
Dataset, February, 2024

TACDEC: Dataset of Tackle Events in Soccer Game Videos.
Dataset, February, 2024

Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case Study.
CoRR, 2024

Kvasir-VQA: A Text-Image Pair GI Tract Dataset.
CoRR, 2024

FactGenius: Combining Zero-Shot Prompting and Fuzzy Relation Mining to Improve Fact Verification with Knowledge Graphs.
CoRR, 2024

The SoccerSum Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch.
Proceedings of the 15th ACM Multimedia Systems Conference, 2024

Multimodal AI-Based Summarization and Storytelling for Soccer on Social Media.
Proceedings of the 15th ACM Multimedia Systems Conference, 2024

TACDEC: Dataset of Tackle Events in Soccer Game Videos.
Proceedings of the 15th ACM Multimedia Systems Conference, 2024

AI-Based Sports Highlight Generation for Social Media.
Proceedings of the 3rd Mile-High Video Conference, 2024

PlayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips.
Proceedings of the IEEE International Symposium on Multimedia, 2024

SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset.
Proceedings of the IEEE International Symposium on Multimedia, 2024

Soccer-GraphRAG: Applications of GraphRAG in Soccer.
Proceedings of the Advances on Graph-Based Approaches in Information Retrieval, 2024

Demo: Soccer Information Retrieval Via Natural Queries using SoccerRAG.
Proceedings of the 21st International Conference on Content-Based Multimedia Indexing, 2024

SoccerRAG: Multimodal Soccer Information Retrieval via Natural Queries.
Proceedings of the 21st International Conference on Content-Based Multimedia Indexing, 2024

Demo: Creating Player-Specific Soccer Highlight Clips with PlayerTV.
Proceedings of the 21st International Conference on Content-Based Multimedia Indexing, 2024

2023
Soccer on Social Media.
CoRR, 2023

Bridging Multimedia Modalities: Enhanced Multimodal AI Understanding and Intelligent Agents.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

2022
Soccer Game Summarization using Audio Commentary, Metadata, and Captions.
Proceedings of the NarSUM '22: Proceedings of the 1st Workshop on User-centric Narrative Summarization of Long Videos, 2022


  Loading...