Xianhang Li

Orcid: 0009-0001-9536-1161

According to our database¹, Xianhang Li authored at least 28 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Rethinking JEPA: Compute-Efficient Video SSL with Frozen Teachers.

[BibT_eX]

[DOI]

CoRR, September, 2025

A New Benchmark for Evaluating Code Translation with Third-Party Libraries.

[BibT_eX]

[DOI]

CoRR, September, 2025

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning.

[BibT_eX]

[DOI]

CoRR, May, 2025

What If We Recaption Billions of Web Images with LLaMA-3?

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Autoregressive Pretraining with Mamba in Vision.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Efficient VideoMAE via Temporal Progressive Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

2024

Unleashing the Power of Visual Prompting At the Pixel Level.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers.

[BibT_eX]

[DOI]

Medical Image Anal., 2024

CLIPS: An Enhanced CLIP Framework for Learning with Synthetic Captions.

[BibT_eX]

[DOI]

CoRR, 2024

Medical Vision Generalist: Unifying Medical Imaging Tasks in Context.

[BibT_eX]

[DOI]

CoRR, 2024

Scaling White-Box Transformers for Vision.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Brain Tumor Segmentation Through Supervoxel Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

L2B: Learning to Bootstrap Robust Models for Combating Label Noise.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Masked Autoencoders are Secretly Efficient Learners.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Revisiting Adversarial Training at Scale.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a $10, 000 Budget; An Extra $4, 000 Unlocks 81.8% Accuracy.

[BibT_eX]

[DOI]

Xianhang Li

Zeyu Wang

Cihang Xie

CoRR, 2023

An Inverse Scaling Law for CLIP Training.

[BibT_eX]

[DOI]

Xianhang Li

Zeyu Wang

Cihang Xie

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge.

[BibT_eX]

[DOI]

Proceedings of the Brain Tumor Segmentation, and Cross-Modality Domain Adaptation for Medical Image Segmentation, 2023

Consistency-Guided Meta-learning for Bootstrapping Semi-supervised Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

2022

Learning to Bootstrap for Combating Label Noise.

[BibT_eX]

[DOI]

CoRR, 2022

Pose-guided Generative Adversarial Net for Novel View Action Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Fast AdvProp.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

In Defense of Image Pre-Training for Spatiotemporal Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

CT-Net: Channel Tensorization Network for Video Classification.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

SmallBigNet: Integrating Core and Contextual Views for Video Classification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Xianhang Li

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...