Yuxuan Sun
Orcid: 0000-0002-1277-4316Affiliations:
- Zhejiang University, College of Computer Science and Technology, Hangzhou, China
- Westlake University, School of Engineering, Hangzhou, China
According to our database1,
Yuxuan Sun
authored at least 33 papers
between 2020 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic.
CoRR, May, 2025
Towards Effective and Efficient Context-aware Nucleus Detection in Histopathology Whole Slide Images.
CoRR, March, 2025
PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
IEEE Trans. Medical Imaging, January, 2024
Large-scale cervical precancerous screening via AI-assisted cytology whole slide image analysis.
CoRR, 2024
PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration.
CoRR, 2024
UWB Radar Signal Kick Detection for Tailgate Unlocking Based on Spatio-Temporal Network.
Proceedings of the IEEE International Conference on Mobility, 2024
PathUp: Patch-wise Timestep Tracking for Multi-class Large Pathology Image Synthesising Diffusion Model.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Context-Aware Text-Assisted Multimodal Framework for Cervical Cytology Cell Diagnosis and Chatting.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
PathAsst: A Generative Foundation AI Assistant towards Artificial General Intelligence of Pathology.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
CoRR, 2023
Attention-Challenging Multiple Instance Learning for Whole Slide Image Classification.
CoRR, 2023
PathAsst: Redefining Pathology through Generative Foundation AI Assistant for Pathology.
CoRR, 2023
Assessing the Robustness of Deep Learning-Assisted Pathological Image Analysis Under Practical Variables of Imaging System.
Proceedings of the IEEE International Conference on Acoustics, 2023
Task-Specific Fine-Tuning via Variational Information Bottleneck for Weakly-Supervised Pathology Whole Slide Image Classification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Benchmarking the Robustness of Deep Neural Networks to Common Corruptions in Digital Pathology.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022
Weakly Supervised Classification using Multi-Level Instance-Aware Optimization on Cervical Cytologic Image.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Pattern Anal. Appl., 2020
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020
RIVA: A Pre-trained Tweet Multimodal Model Based on Text-image Relation for Multimodal NER.
Proceedings of the 28th International Conference on Computational Linguistics, 2020