Xuelian Cheng

Orcid: 0009-0002-7014-5304

According to our database1, Xuelian Cheng authored at least 21 papers between 2017 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences.
CoRR, March, 2026

BVD: A Two-Stage Network for Identifying Bronchial Variation Types from CT Images.
Proceedings of the 23rd IEEE International Symposium on Biomedical Imaging, 2026

DiffNR: Diffusion-Enhanced Neural Representation Optimization for Sparse-View 3D Tomographic Reconstruction.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
ColonAdapter: Geometry Estimation Through Foundation Model Adaptation for Colonoscopy.
IEEE Robotics Autom. Lett., December, 2025

RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation.
CoRR, November, 2025

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning.
CoRR, October, 2025

RationalVLA: A Rational Vision-Language-Action Model with Dual System.
CoRR, June, 2025

APTOS-2024 challenge report: Generation of synthetic 3D OCT images from fundus photographs.
CoRR, June, 2025

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding.
CoRR, May, 2025

Feasibility Study on Optimising the Efficacy of a Population Age Estimation Model for South China by Combined Machine Learning for the Second and Third Molars.
J. Imaging Inform. Medicine, 2025

MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
EndoSurf: Neural Surface Reconstruction of Deformable Tissues with Stereo Endoscope Videos.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

2022
Deep Laparoscopic Stereo Matching with Transformers.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Implicit Motion Handling for Video Camouflaged Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Network Architecture for Open-Set Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2020
Hierarchical Neural Architecture Search for Deep Stereo Matching.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Noise-Aware Unsupervised Deep Lidar-Stereo Fusion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
3D skeleton based action recognition by video-domain translation-scale invariant mapping and multi-scale dilated CNN.
Multim. Tools Appl., 2018

2017
Skeleton based action recognition using translation-scale invariant image mapping and multi-scale deep CNN.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017


  Loading...