Long Bai
Orcid: 0000-0002-9762-6821Affiliations:
- Chinese University of Hong Kong, Department of Electrical Engineering, Hong Kong
According to our database1,
Long Bai
authored at least 63 papers
between 2022 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
Medical Image Anal., 2026
2025
Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge.
CoRR, July, 2025
Geo-RepNet: Geometry-Aware Representation Learning for Surgical Phase Recognition in Endoscopic Submucosal Dissection.
CoRR, July, 2025
SurgVidLM: Towards Multi-grained Surgical Video Understanding with Large Language Model.
CoRR, June, 2025
TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast.
CoRR, June, 2025
EndoARSS: Adapting Spatially-Aware Foundation Model for Efficient Activity Recognition and Semantic Segmentation in Endoscopic Surgery.
CoRR, June, 2025
EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy.
CoRR, May, 2025
Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras.
CoRR, March, 2025
V²-SfMLearner: Learning Monocular Depth and Ego-Motion for Multimodal Wireless Capsule Endoscopy.
IEEE Trans Autom. Sci. Eng., 2025
Medical Image Anal., 2025
Surgical-VQLA++: Adversarial contrastive learning for calibrated robust visual question-localized answering in robotic surgery.
Inf. Fusion, 2025
Multimodal graph representation learning for robust surgical workflow recognition with adversarial feature disentanglement.
Inf. Fusion, 2025
Comput. Medical Imaging Graph., 2025
Recognizing Surgical Phases Anywhere: Few-Shot Test-Time Adaptation and Task-Graph Guided Refinement.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025
Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery.
Proceedings of the AI for Clinical Applications - First International Workshops, 2025
SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025
Endo-4DGX: Robust Endoscopic Scene Reconstruction and Illumination Correction with Gaussian Splatting.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025
ETSM: Automating Dissection Trajectory Suggestion and Confidence Map-Based Safety Margin Prediction for Robot-Assisted Endoscopic Submucosal Dissection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025
Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-Driven Surface Normal-Aware Tracking and Mapping.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025
SurgPLAN++: Universal Surgical Phase Localization Network for Online and Offline Inference.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025
PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
2024
IEEE Trans. Medical Imaging, June, 2024
Surgical-DINO: adapter learning of foundation models for depth estimation in endoscopic surgery.
Int. J. Comput. Assist. Radiol. Surg., June, 2024
V<sup>2</sup>-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy.
CoRR, 2024
SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation.
CoRR, 2024
Transferring Knowledge from High-Quality to Low-Quality MRI for Adult Glioma Diagnosis.
CoRR, 2024
CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection.
CoRR, 2024
SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation.
CoRR, 2024
CoRR, 2024
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery.
CoRR, 2024
CoRR, 2024
SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge.
CoRR, 2024
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2024
Web-Based Augmented Reality with Auto-Scaling and Real-Time Head Tracking Towards Markerless Neurointerventional Preoperative Planning and Training of Head-Mounted Robotic Needle Insertion.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2024
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 Workshops, 2024
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024
EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024
EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024
Endoood: Uncertainty-Aware Out-of-Distribution Detection in Capsule Endoscopy Diagnosis.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024
ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Affecting Audience Valence and Arousal in 360 Immersive Environments: How Powerful Neural Style Transfer Is?
Proceedings of the Virtual, Augmented and Mixed Reality, 2024
2023
Transformer-based 3D U-Net for pulmonary vessel segmentation and artery-vein separation from CT images.
Medical Biol. Eng. Comput., October, 2023
Medical Biol. Eng. Comput., October, 2023
Rethinking exemplars for continual semantic segmentation in endoscopy scenes: Entropy-based mini-batch pseudo-replay.
Comput. Biol. Medicine, October, 2023
Two-stage contextual transformer-based convolutional neural network for airway extraction from CT images.
Artif. Intell. Medicine, September, 2023
An RNN-LSTM Enhanced Compact and Affordable Micro Force Sensing System for Interventional Continuum Robots With Interchangeable End-Effector Instruments.
IEEE Trans. Instrum. Meas., 2023
Semi-supervised Learning for Segmentation of Bleeding Regions in Video Capsule Endoscopy.
CoRR, 2023
CoRR, 2023
Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery.
CoRR, 2023
Surgical tool classification and localization: results and methods from the MICCAI 2022 SurgToolLoc challenge.
CoRR, 2023
The Exploration and Evaluation of Generating Affective 360° Panoramic VR Environments Through Neural Style Transfer.
CoRR, 2023
The Exploration and Evaluation of Generating Affective $360^{\circ}$ Panoramic VR Environments Through Neural Style Transfer.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023
CAT-ViL: Co-attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023
Revisiting Distillation for Continual Learning on Visual Question Localized-Answering in Robotic Surgery.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023
LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusion.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023
Surgical-VQLA:Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Sample-adaptive Augmentation for Point Cloud Recognition Against Real-world Corruptions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
2022
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning of Wireless Capsule Endoscopy.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2022