Le Xue
Orcid: 0000-0003-2810-770X
According to our database1,
Le Xue
authored at least 35 papers
between 2021 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Contra4: Evaluating Contrastive Cross-Modal Reasoning in Audio, Video, Image, and 3D.
CoRR, June, 2025
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset.
CoRR, May, 2025
SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models.
CoRR, February, 2025
SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images.
CoRR, February, 2025
SemiSAM+: Rethinking semi-supervised medical image segmentation in the era of foundation models.
Medical Image Anal., 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Diagnostic performance of artificial intelligence-assisted PET imaging for Parkinson's disease: a systematic review and meta-analysis.
npj Digit. Medicine, 2024
ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models.
CoRR, 2024
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs.
CoRR, 2024
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens.
CoRR, 2024
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
xGen-VideoSyn-1: High-Fidelity Text-to-Video Synthesis with Compressed Representations.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-Modal Reasoning.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
OTFPF: Optimal transport based feature pyramid fusion network for brain age estimation.
Inf. Fusion, December, 2023
AIGAN: Attention-encoding Integrated Generative Adversarial Network for the reconstruction of low-dose CT and low-dose PET images.
Medical Image Anal., May, 2023
X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning.
CoRR, 2023
CoRR, 2023
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding.
CoRR, 2022
OTFPF: Optimal Transport-Based Feature Pyramid Fusion Network for Brain Age Estimation with 3D Overlapped ConvNeXt.
CoRR, 2022
Active Index: An Integrated Index to Reveal Disrupted Brain Network Organizations of Major Depressive Disorder Patients.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022
A Resource-Efficient Deep Learning Framework for Low-Dose Brain Pet Image Reconstruction and Analysis.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022
Proceedings of the 29th International Conference on Computational Linguistics, 2022
2021
Cross-Modality Generation of Amyloid PET from FDG PET for Alzheimer's Disease Diagnosis.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021