Shitian Zhao
According to our database1,
Shitian Zhao authored at least 22 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models.
IEEE Trans. Medical Imaging, June, 2026
Int. J. Comput. Vis., April, 2026
2025
IEEE Trans. Medical Imaging, December, 2025
CoRR, November, 2025
CoRR, March, 2025
Think or Not Think: A Study of Explicit Thinking inRule-Based Visual Reinforcement Fine-Tuning.
CoRR, March, 2025
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models.
CoRR, March, 2025
IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models.
CoRR, January, 2025
To Think or Not To Think: A Study of Thinking in Rule-Based Visual Reinforcement Fine-Tuning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Fontanimate: High Quality Few-Shot Font Generation Via Animating Font Transfer Process.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions.
CoRR, 2024
Boosting Open-Domain Continual Learning via Leveraging Intra-domain Category-aware Prototype.
CoRR, 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining.
CoRR, 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models.
CoRR, 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-Modal Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024