Le Xue

Orcid: 0000-0002-9627-0763

According to our database¹, Le Xue authored at least 39 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

PETWB-REP: A Multi-Cancer Whole-Body FDG PET/CT and Radiology Report Dataset for Medical Imaging Research.

[BibT_eX]

[DOI]

CoRR, November, 2025

BLIP3o-NEXT: Next Frontier of Native Image Generation.

[BibT_eX]

[DOI]

CoRR, October, 2025

VisCoP: Visual Probing for Video Domain Adaptation of Vision Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography.

[BibT_eX]

[DOI]

CoRR, August, 2025

Contra4: Evaluating Contrastive Cross-Modal Reasoning in Audio, Video, Image, and 3D.

[BibT_eX]

[DOI]

CoRR, June, 2025

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset.

[BibT_eX]

[DOI]

CoRR, May, 2025

SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images.

[BibT_eX]

[DOI]

CoRR, February, 2025

SemiSAM+: Rethinking semi-supervised medical image segmentation in the era of foundation models.

[BibT_eX]

[DOI]

Medical Image Anal., 2025

Towards Multi-scenario Generalization: Text-Guided Unified Framework for Low-Dose CT and Total-Body PET Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living.

[BibT_eX]

[DOI]

Dominick Reilly

Rajatsubhra Chakraborty

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Diagnostic performance of artificial intelligence-assisted PET imaging for Parkinson's disease: a systematic review and meta-analysis.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2024

ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions.

[BibT_eX]

[DOI]

CoRR, 2024

xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs.

[BibT_eX]

[DOI]

CoRR, 2024

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2024

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens.

[BibT_eX]

[DOI]

CoRR, 2024

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Hierarchical Point Attention for Indoor 3D Object Detection.

[BibT_eX]

[DOI]

Manli Shu

Le Xue

Ning Yu

Roberto Martín-Martín

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

xGen-VideoSyn-1: High-Fidelity Text-to-Video Synthesis with Compressed Representations.

[BibT_eX]

[DOI]

Can Qin

Congying Xia

Krithika Ramakrishnan

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-Modal Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

ULIP-2: Towards Scalable Multimodal Pre-Training for 3D Understanding.

[BibT_eX]

[DOI]

Roberto Martín-Martín

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

OTFPF: Optimal transport based feature pyramid fusion network for brain age estimation.

[BibT_eX]

[DOI]

Inf. Fusion, December, 2023

AIGAN: Attention-encoding Integrated Generative Adversarial Network for the reconstruction of low-dose CT and low-dose PET images.

[BibT_eX]

[DOI]

Medical Image Anal., May, 2023

X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning.

[BibT_eX]

[DOI]

CoRR, 2023

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents.

[BibT_eX]

[DOI]

CoRR, 2023

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization.

[BibT_eX]

[DOI]

CoRR, 2023

REX: Rapid Exploration and eXploitation for AI Agents.

[BibT_eX]

[DOI]

CoRR, 2023

ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding.

[BibT_eX]

[DOI]

Roberto Martín-Martín

CoRR, 2023

Model-Agnostic Hierarchical Attention for 3D Object Detection.

[BibT_eX]

[DOI]

Manli Shu

Le Xue

Ning Yu

Roberto Martín-Martín

Juan Carlos Niebles

Caiming Xiong

Ran Xu

CoRR, 2023

Robustness Evaluation of Transformer-Based Form Field Extractors via Form Attacks.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding.

[BibT_eX]

[DOI]

Le Xue

Mingfei Gao

Chen Xing

Roberto Martín-Martín

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding.

[BibT_eX]

[DOI]

Le Xue

Mingfei Gao

Chen Xing

Roberto Martín-Martín

CoRR, 2022

OTFPF: Optimal Transport-Based Feature Pyramid Fusion Network for Brain Age Estimation with 3D Overlapped ConvNeXt.

[BibT_eX]

[DOI]

CoRR, 2022

Active Index: An Integrated Index to Reveal Disrupted Brain Network Organizations of Major Depressive Disorder Patients.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

A Resource-Efficient Deep Learning Framework for Low-Dose Brain Pet Image Reconstruction and Analysis.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like Documents.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021

Value Retrieval with Arbitrary Queries for Form-like Documents.

[BibT_eX]

[DOI]

CoRR, 2021

Cross-Modality Generation of Amyloid PET from FDG PET for Alzheimer's Disease Diagnosis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

Le Xue

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...