Sushant Gautam

Orcid: 0000-0001-9232-2661

According to our database1, Sushant Gautam authored at least 31 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Medico 2025: Visual Question Answering for Gastrointestinal Imaging.
CoRR, August, 2025

Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust MedVQA in Gastrointestinal Endoscopy.
CoRR, June, 2025

SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding.
CoRR, May, 2025

Prompt to Polyp: Medical Text-Conditioned Image Synthesis with Diffusion Models.
CoRR, May, 2025

X-DECODE: EXtreme Deblurring with Curriculum Optimization and Domain Equalization.
CoRR, April, 2025

Accurate diabetic retinopathy segmentation and classification model using gated recurrent unit with residual attention network.
Biomed. Signal Process. Control., 2025

HockeyOrient: A Dataset for Ice Hockey Player Orientation Classification.
Proceedings of the 16th ACM Multimedia Systems Conference, 2025

HockeyRink: A Dataset for Precise Ice Hockey Rink Keypoint Mapping and Analytics.
Proceedings of the 16th ACM Multimedia Systems Conference, 2025

HockeyAI: A Multi-Class Ice Hockey Dataset for Object Detection.
Proceedings of the 16th ACM Multimedia Systems Conference, 2025



Point, Detect, Count: Multi-Task Medical Image Understanding with Instruction-Tuned Vision-Language Models.
Proceedings of the 38th IEEE International Symposium on Computer-Based Medical Systems, 2025

2024
The SoccerSum Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch.
Dataset, February, 2024

TACDEC: Dataset of Tackle Events in Soccer Game Videos.
Dataset, February, 2024

Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis.
CoRR, 2024

Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case Study.
CoRR, 2024

Kvasir-VQA: A Text-Image Pair GI Tract Dataset.
CoRR, 2024

FactGenius: Combining Zero-Shot Prompting and Fuzzy Relation Mining to Improve Fact Verification with Knowledge Graphs.
CoRR, 2024

The SoccerSum Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch.
Proceedings of the 15th ACM Multimedia Systems Conference, 2024

Multimodal AI-Based Summarization and Storytelling for Soccer on Social Media.
Proceedings of the 15th ACM Multimedia Systems Conference, 2024

TACDEC: Dataset of Tackle Events in Soccer Game Videos.
Proceedings of the 15th ACM Multimedia Systems Conference, 2024

AI-Based Sports Highlight Generation for Social Media.
Proceedings of the 3rd Mile-High Video Conference, 2024

PlayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips.
Proceedings of the IEEE International Symposium on Multimedia, 2024

SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset.
Proceedings of the IEEE International Symposium on Multimedia, 2024

Soccer-GraphRAG: Applications of GraphRAG in Soccer.
Proceedings of the Advances on Graph-Based Approaches in Information Retrieval, 2024

Demo: Soccer Information Retrieval Via Natural Queries using SoccerRAG.
Proceedings of the 21st International Conference on Content-Based Multimedia Indexing, 2024

SoccerRAG: Multimodal Soccer Information Retrieval via Natural Queries.
Proceedings of the 21st International Conference on Content-Based Multimedia Indexing, 2024

Demo: Creating Player-Specific Soccer Highlight Clips with PlayerTV.
Proceedings of the 21st International Conference on Content-Based Multimedia Indexing, 2024

2023
Soccer on Social Media.
CoRR, 2023

Bridging Multimedia Modalities: Enhanced Multimodal AI Understanding and Intelligent Agents.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

2022
Soccer Game Summarization using Audio Commentary, Metadata, and Captions.
Proceedings of the NarSUM '22: Proceedings of the 1st Workshop on User-centric Narrative Summarization of Long Videos, 2022


  Loading...