Xianhang Li

Orcid: 0009-0001-9536-1161

According to our database1, Xianhang Li authored at least 25 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning.
CoRR, May, 2025

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Autoregressive Pretraining with Mamba in Vision.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Efficient VideoMAE via Temporal Progressive Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

2024
Unleashing the Power of Visual Prompting At the Pixel Level.
Trans. Mach. Learn. Res., 2024

TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers.
Medical Image Anal., 2024

CLIPS: An Enhanced CLIP Framework for Learning with Synthetic Captions.
CoRR, 2024

What If We Recaption Billions of Web Images with LLaMA-3?
CoRR, 2024

Medical Vision Generalist: Unifying Medical Imaging Tasks in Context.
CoRR, 2024

Scaling White-Box Transformers for Vision.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Brain Tumor Segmentation Through Supervoxel Transformer.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

L2B: Learning to Bootstrap Robust Models for Combating Label Noise.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Masked Autoencoders are Secretly Efficient Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Revisiting Adversarial Training at Scale.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers.
CoRR, 2023

CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a $10, 000 Budget; An Extra $4, 000 Unlocks 81.8% Accuracy.
CoRR, 2023

An Inverse Scaling Law for CLIP Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge.
Proceedings of the Brain Tumor Segmentation, and Cross-Modality Domain Adaptation for Medical Image Segmentation, 2023

Consistency-Guided Meta-learning for Bootstrapping Semi-supervised Medical Image Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

2022
Learning to Bootstrap for Combating Label Noise.
CoRR, 2022

Pose-guided Generative Adversarial Net for Novel View Action Synthesis.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Fast AdvProp.
Proceedings of the Tenth International Conference on Learning Representations, 2022

In Defense of Image Pre-Training for Spatiotemporal Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
CT-Net: Channel Tensorization Network for Video Classification.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
SmallBigNet: Integrating Core and Contextual Views for Video Classification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


  Loading...