Long Bai

Orcid: 0000-0002-9762-6821

Affiliations:
  • Chinese University of Hong Kong, Department of Electrical Engineering, Hong Kong


According to our database1, Long Bai authored at least 62 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
EndoChat: Grounded multimodal large language model for endoscopic surgery.
Medical Image Anal., 2026

2025
Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge.
CoRR, July, 2025

Geo-RepNet: Geometry-Aware Representation Learning for Surgical Phase Recognition in Endoscopic Submucosal Dissection.
CoRR, July, 2025

SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting.
CoRR, June, 2025

Endo-4DGX: Robust Endoscopic Scene Reconstruction and Illumination Correction with Gaussian Splatting.
CoRR, June, 2025

Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement.
CoRR, June, 2025

SurgVidLM: Towards Multi-grained Surgical Video Understanding with Large Language Model.
CoRR, June, 2025

CapsDT: Diffusion-Transformer for Capsule Robot Manipulation.
CoRR, June, 2025

TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast.
CoRR, June, 2025

EndoARSS: Adapting Spatially-Aware Foundation Model for Efficient Activity Recognition and Semantic Segmentation in Endoscopic Surgery.
CoRR, June, 2025

EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy.
CoRR, May, 2025

Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery.
CoRR, March, 2025

Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras.
CoRR, March, 2025

Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping.
CoRR, January, 2025

V²-SfMLearner: Learning Monocular Depth and Ego-Motion for Multimodal Wireless Capsule Endoscopy.
IEEE Trans Autom. Sci. Eng., 2025

Rethinking data imbalance in class incremental surgical instrument segmentation.
Medical Image Anal., 2025

Surgical-VQLA++: Adversarial contrastive learning for calibrated robust visual question-localized answering in robotic surgery.
Inf. Fusion, 2025

Multimodal graph representation learning for robust surgical workflow recognition with adversarial feature disentanglement.
Inf. Fusion, 2025

PedSemiSeg: Pedagogy-inspired semi-supervised polyp segmentation.
Comput. Medical Imaging Graph., 2025

PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Privacy-Preserving Synthetic Continual Semantic Segmentation for Robotic Surgery.
IEEE Trans. Medical Imaging, June, 2024

Surgical-DINO: adapter learning of foundation models for depth estimation in endoscopic surgery.
Int. J. Comput. Assist. Radiol. Surg., June, 2024

V<sup>2</sup>-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy.
CoRR, 2024

SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation.
CoRR, 2024

ETSM: Automating Dissection Trajectory Suggestion and Confidence Map-Based Safety Margin Prediction for Robot-assisted Endoscopic Submucosal Dissection.
CoRR, 2024

Transferring Knowledge from High-Quality to Low-Quality MRI for Adult Glioma Diagnosis.
CoRR, 2024

CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection.
CoRR, 2024

SurgPLAN++: Universal Surgical Phase Localization Network for Online and Offline Inference.
CoRR, 2024

SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation.
CoRR, 2024

Learning to Adapt Foundation Model DINOv2 for Capsule Endoscopy Diagnosis.
CoRR, 2024

Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery.
CoRR, 2024

Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting.
CoRR, 2024

SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge.
CoRR, 2024

Registering Neural 4D Gaussians for Endoscopic Surgery.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2024

Web-Based Augmented Reality with Auto-Scaling and Real-Time Head Tracking Towards Markerless Neurointerventional Preoperative Planning and Training of Head-Mounted Robotic Needle Insertion.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2024

A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 Workshops, 2024

Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Endoood: Uncertainty-Aware Out-of-Distribution Detection in Capsule Endoscopy Diagnosis.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

OSSAR: Towards Open-Set Surgical Activity Recognition in Robot-assisted Surgery.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Affecting Audience Valence and Arousal in 360 Immersive Environments: How Powerful Neural Style Transfer Is?
Proceedings of the Virtual, Augmented and Mixed Reality, 2024

2023
Transformer-based 3D U-Net for pulmonary vessel segmentation and artery-vein separation from CT images.
Medical Biol. Eng. Comput., October, 2023

Domain adaptive Sim-to-Real segmentation of oropharyngeal organs.
Medical Biol. Eng. Comput., October, 2023

Rethinking exemplars for continual semantic segmentation in endoscopy scenes: Entropy-based mini-batch pseudo-replay.
Comput. Biol. Medicine, October, 2023

Two-stage contextual transformer-based convolutional neural network for airway extraction from CT images.
Artif. Intell. Medicine, September, 2023

An RNN-LSTM Enhanced Compact and Affordable Micro Force Sensing System for Interventional Continuum Robots With Interchangeable End-Effector Instruments.
IEEE Trans. Instrum. Meas., 2023

Semi-supervised Learning for Segmentation of Bleeding Regions in Video Capsule Endoscopy.
CoRR, 2023

Landmark Detection using Transformer Toward Robot-assisted Nasal Airway Intubation.
CoRR, 2023

Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery.
CoRR, 2023

Sim-to-Real Segmentation in Robot-assisted Transoral Tracheal Intubation.
CoRR, 2023

Surgical tool classification and localization: results and methods from the MICCAI 2022 SurgToolLoc challenge.
CoRR, 2023

The Exploration and Evaluation of Generating Affective 360° Panoramic VR Environments Through Neural Style Transfer.
CoRR, 2023

The Exploration and Evaluation of Generating Affective $360^{\circ}$ Panoramic VR Environments Through Neural Style Transfer.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023

CAT-ViL: Co-attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Revisiting Distillation for Continual Learning on Visual Question Localized-Answering in Robotic Surgery.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusion.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Surgical-VQLA:Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Sample-adaptive Augmentation for Point Cloud Recognition Against Real-world Corruptions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning of Wireless Capsule Endoscopy.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2022


  Loading...