Kun Yuan

Orcid: 0000-0002-6030-8862

Affiliations:

Technical University of Munich, Center for Machine Learning, Munich, Germany
University of Strasbourg, CNRS, INSERM, ICube, UMR7357, Strasbourg, France

According to our database¹, Kun Yuan authored at least 23 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2026

EndoChat: Grounded multimodal large language model for endoscopic surgery.

[BibT_eX]

[DOI]

Medical Image Anal., 2026

2025

SurgLLM: A Versatile Large Multimodal Model with Spatial Focus and Temporal Awareness for Surgical Video Understanding.

[BibT_eX]

[DOI]

CoRR, September, 2025

SurgVidLM: Towards Multi-grained Surgical Video Understanding with Large Language Model.

[BibT_eX]

[DOI]

CoRR, June, 2025

Text-driven adaptation of foundation models for few-shot surgical workflow analysis.

[BibT_eX]

[DOI]

Int. J. Comput. Assist. Radiol. Surg., June, 2025

EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy.

[BibT_eX]

[DOI]

CoRR, May, 2025

ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling.

[BibT_eX]

[DOI]

CoRR, May, 2025

Rethinking data imbalance in class incremental surgical instrument segmentation.

[BibT_eX]

[DOI]

Medical Image Anal., 2025

Learning multi-modal representations by watching hundreds of surgical video lectures.

[BibT_eX]

[DOI]

Medical Image Anal., 2025

Recognizing Surgical Phases Anywhere: Few-Shot Test-Time Adaptation and Task-Graph Guided Refinement.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery.

[BibT_eX]

[DOI]

Proceedings of the AI for Clinical Applications - First International Workshops, 2025

SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Multi-modal Representations for Fine-Grained Multi-Label Critical View of Safety Recognition.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Advancing surgical VQA with scene graph knowledge.

[BibT_eX]

[DOI]

Int. J. Comput. Assist. Radiol. Surg., July, 2024

OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining.

[BibT_eX]

[DOI]

CoRR, 2024

Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

HecVL: Hierarchical Video-Language Pretraining for Zero-Shot Surgical Phase Recognition.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Enhancing Gait Video Analysis in Neurodegenerative Diseases by Knowledge Augmentation in Vision Language Model.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

2023

CholecTriplet2022: Show me a tool and tell me the triplet - An endoscopic vision challenge for surgical action triplet detection.

[BibT_eX]

[DOI]

Medical Image Anal., October, 2023

Learning Multi-modal Representations by Watching Hundreds of Surgical Video Lectures.

[BibT_eX]

[DOI]

CoRR, 2023

2020

An Efficient Hybrid Model for Kidney Tumor Segmentation in CT Images.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

Towards Content-Independent Multi-Reference Super-Resolution: Adaptive Pattern Matching and Feature Aggregation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Kun Yuan

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...