Yuxuan Sun

Orcid: 0000-0002-1277-4316

Affiliations:

Zhejiang University, College of Computer Science and Technology, Hangzhou, China
Westlake University, School of Engineering, Hangzhou, China

According to our database¹, Yuxuan Sun authored at least 37 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

PathBench: Advancing the Benchmark of Large Multimodal Models for Pathology Image Understanding at Patch and Whole Slide Level.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, October, 2025

ToPoFM: Topology-Guided Pathology Foundation Model for High-Resolution Pathology Image Synthesis With Cellular-Level Control.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, October, 2025

Agent Learning via Early Experience.

[BibT_eX]

[DOI]

CoRR, October, 2025

CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic.

[BibT_eX]

[DOI]

CoRR, May, 2025

Towards Effective and Efficient Context-aware Nucleus Detection in Histopathology Whole Slide Images.

[BibT_eX]

[DOI]

CoRR, March, 2025

Benchmarking PathCLIP for Pathology Image Analysis.

[BibT_eX]

[DOI]

J. Imaging Inform. Medicine, 2025

AEM: Attention Entropy Maximization for Multiple Instance Learning Based Whole Slide Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

AAAR-1.0: Assessing AI's Potential to Assist Research.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Stable Test-Time Training for Semantic Segmentation with Output Contrastive Loss.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Masked Conditional Variational Autoencoders for Chromosome Straightening.

[BibT_eX]

[DOI]

Peter M. A. van Ooijen

Kang Li

Lin Yang

IEEE Trans. Medical Imaging, January, 2024

Large-scale cervical precancerous screening via AI-assisted cytology whole slide image analysis.

[BibT_eX]

[DOI]

CoRR, 2024

PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration.

[BibT_eX]

[DOI]

CoRR, 2024

UWB Radar Signal Kick Detection for Tailgate Unlocking Based on Spatio-Temporal Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Mobility, 2024

PathUp: Patch-wise Timestep Tracking for Multi-class Large Pathology Image Synthesising Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Context-Aware Text-Assisted Multimodal Framework for Cervical Cytology Cell Diagnosis and Chatting.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction Following.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Unleashing the Power of Prompt-Driven Nucleus Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Unleashing the Power of Prompt-driven Nucleus Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Test-Time Training for Semantic Segmentation with Output Contrastive Loss.

[BibT_eX]

[DOI]

CoRR, 2023

Attention-Challenging Multiple Instance Learning for Whole Slide Image Classification.

[BibT_eX]

[DOI]

CoRR, 2023

Multimodal Question Answering for Unified Information Extraction.

[BibT_eX]

[DOI]

Yuxuan Sun

Kai Zhang

Yu Su

CoRR, 2023

PathAsst: Redefining Pathology through Generative Foundation AI Assistant for Pathology.

[BibT_eX]

[DOI]

CoRR, 2023

Assessing the Robustness of Deep Learning-Assisted Pathological Image Analysis Under Practical Variables of Imaging System.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Task-Specific Fine-Tuning via Variational Information Bottleneck for Weakly-Supervised Pathology Whole Slide Image Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Benchmarking the Robustness of Deep Neural Networks to Common Corruptions in Digital Pathology.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Weakly Supervised Classification using Multi-Level Instance-Aware Optimization on Cervical Cytologic Image.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

Category Separation For Weakly Supervised Multi-Class Cell Counting.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

2020

Joint Learning of Token Context and Span Feature for Span-Based Nested NER.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

ETIP: a lengthy nested NER problem for Chinese insurance policy analysis.

[BibT_eX]

[DOI]

Pattern Anal. Appl., 2020

Attention-based Deep Learning Model for Text Readability Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

RIVA: A Pre-trained Tweet Multimodal Model Based on Text-image Relation for Multimodal NER.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Yuxuan Sun

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...