Qingyu Chen

CoRR, November, 2025

Rethinking Retrieval-Augmented Generation for Medicine: A Large-Scale, Systematic Expert Evaluation and Practical Insights.

[BibT_eX]

[DOI]

CoRR, November, 2025

LMOD+: A Comprehensive Multimodal Dataset and Benchmark for Developing and Evaluating Multimodal Large Language Models in Ophthalmology.

[BibT_eX]

[DOI]

CoRR, September, 2025

Memorization in Large Language Models in Medicine: Prevalence, Characteristics, and Implications.

[BibT_eX]

[DOI]

CoRR, September, 2025

Performance of GPT-5 Frontier Models in Ophthalmology Question Answering.

[BibT_eX]

[DOI]

CoRR, August, 2025

BEnchmarking LLMs for Ophthalmology (BELO) for Ophthalmological Knowledge and Reasoning.

[BibT_eX]

[DOI]

CoRR, July, 2025

Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards.

[BibT_eX]

[DOI]

CoRR, June, 2025

Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items.

[BibT_eX]

[DOI]

Minjie Zou

Sahana Srinivasan

Thaddaeus Wai Soon Lo

CoRR, April, 2025

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation.

[BibT_eX]

[DOI]

CoRR, March, 2025

GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking.

[BibT_eX]

[DOI]

CoRR, February, 2025

Is an Ultra Large Natural Image-Based Foundation Model Superior to a Retina-Specific Model for Detecting Ocular and Systemic Diseases?

[BibT_eX]

[DOI]

CoRR, February, 2025

Dialogue is Better Than Monologue: Instructing Medical LLMs via Strategical Conversations.

[BibT_eX]

[DOI]

CoRR, January, 2025

Can OpenAI o1 Reason Well in Ophthalmology? A 6,990-Question Head-to-Head Evaluation Study.

[BibT_eX]

[DOI]

Thaddaeus Wai Soon Lo

CoRR, January, 2025

Enhancing Patient-Centric Communication: Leveraging LLMs to Simulate Patient Perspectives.

[BibT_eX]

[DOI]

CoRR, January, 2025

Medical foundation large language models for comprehensive text analysis and beyond.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2025

Author Correction: Small language models learn enhanced reasoning skills from medical textbooks.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2025

Small language models learn enhanced reasoning skills from medical textbooks.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2025

Social determinants of health extraction from clinical notes across institutions using large language models.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2025

Word-Sequence Entropy: Towards uncertainty estimation in free-form medical question answering applications and beyond.

[BibT_eX]

[DOI]

Eng. Appl. Artif. Intell., 2025

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

AMD-Mamba: A Phenotype-Aware Multi-modal Framework for Robust AMD Prognosis.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning in Medical Imaging - 16th International Workshop, 2025

Clinical Trial Eligibility Criteria Decomposition and Parsing with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the MEDINFO 2025 - Healthcare Smart × Medicine Deep, 2025

Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Archive of PubTator 3.0 source code and trained models.

[BibT_eX]

[DOI]

Dataset, March, 2024

GeneGPT: augmenting large language models with domain tools for improved access to biomedical information.

[BibT_eX]

[DOI]

Bioinform., February, 2024

Opportunities and challenges for ChatGPT and large language models in biomedicine and health.

[BibT_eX]

[DOI]

Briefings Bioinform., January, 2024

Augmenting biomedical named entity recognition with general-domain resources.

[BibT_eX]

[DOI]

J. Biomed. Informatics, 2024

Improving large language models for clinical named entity recognition via prompt engineering.

[BibT_eX]

[DOI]

J. Am. Medical Informatics Assoc., 2024

Ophthalmic care may not align with patient need: An analysis on state-wide patient needs and provider density between 2008 and 2022.

[BibT_eX]

[DOI]

Aidan Gilson

Ron A. Adelman

Int. J. Medical Informatics, 2024

Information Extraction from Clinical Notes: Are We Ready to Switch to Large Language Models?

[BibT_eX]

[DOI]

CoRR, 2024

Humans Continue to Outperform Large Language Models in Complex Clinical Decision-Making: A Study with Medical Calculators.

[BibT_eX]

[DOI]

CoRR, 2024

Demystifying Large Language Models for Medicine: A Primer.

[BibT_eX]

[DOI]

CoRR, 2024

Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model.

[BibT_eX]

[DOI]

Luciano V. Del Priore

CoRR, 2024

Towards Accountable AI-Assisted Eye Disease Diagnosis: Workflow Design, External Validation, and Continual Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Enhancing Large Language Models with Domain-specific Retrieval Augment Generation: A Case Study on Long-form Consumer Health Question Answering in Ophthalmology.

[BibT_eX]

[DOI]

CoRR, 2024

Word-Sequence Entropy: Towards Uncertainty Estimation in Free-Form Medical Question Answering Applications and Beyond.

[BibT_eX]

[DOI]

CoRR, 2024

AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Me LLaMA: Foundation Large Language Models for Medical Applications.

[BibT_eX]

[DOI]

CoRR, 2024

PubTator 3.0: an AI-powered Literature Resource for Unlocking Biomedical Knowledge.

[BibT_eX]

[DOI]

CoRR, 2024

PubMed Computed Authors in 2024: an open resource of disambiguated author names in biomedical literature.

[BibT_eX]

[DOI]

Bioinform., 2024

Advancing entity recognition in biomedicine via instruction tuning of large language models.

[BibT_eX]

[DOI]

Bioinform., 2024

MedCalc-Bench: Evaluating Large Language Models for Medical Calculations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MedINST: Meta Dataset of Biomedical Instructions.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques.

[BibT_eX]

[DOI]

Rui Yang

Haoran Liu

Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024

Yale at "Discharge Me!": Evaluating Constrained Generation of Discharge Summaries with Unstructured and Structured Information.

[BibT_eX]

[DOI]

Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024

2023

BioREx: Improving biomedical relation extraction by leveraging heterogeneous datasets.

[BibT_eX]

[DOI]

J. Biomed. Informatics, October, 2023

MedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval.

[BibT_eX]

[DOI]

Bioinform., October, 2023

AIONER: all-in-one scheme-based biomedical named entity recognition using deep learning.

[BibT_eX]

[DOI]

Bioinform., May, 2023

Comprehensively identifying Long Covid articles with human-in-the-loop machine learning.

[BibT_eX]

[DOI]

Patterns, January, 2023

LitCovid in 2022: an information resource for the COVID-19 literature.

[BibT_eX]

[DOI]

Nucleic Acids Res., January, 2023

MedGen: A Python Natural Language Processing Toolkit for Medical Text Processing.

[BibT_eX]

[DOI]

CoRR, 2023

Integrating UMLS Knowledge into Large Language Models for Medical Question Answering.

[BibT_eX]

[DOI]

Rui Yang

CoRR, 2023

BioCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval.

[BibT_eX]

[DOI]

CoRR, 2023

Large language models in biomedical natural language processing: benchmarks, baselines, and recommendations.

[BibT_eX]

[DOI]

Jingcheng Du

Yan Hu

CoRR, 2023

Interpretable Medical Image Visual Question Answering via Multi-Modal Relationship Graph Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Bioformer: an efficient transformer language model for biomedical text mining.

[BibT_eX]

[DOI]

CoRR, 2023

Attention-based 3D convolutional networks for detection of geographic atrophy from optical coherence tomography scans.

[BibT_eX]

[DOI]

Proceedings of the Medical Imaging 2023: Image Processing, 2023

2022

LitMC-BERT: Transformer-Based Multi-Label Classification of Biomedical Literature With An Application on COVID-19 Literature Curation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Comput. Biol. Bioinform., 2022

Robust convolutional neural networks against adversarial attacks on medical images.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Predicting myocardial infarction through retinal scans and minimal personal information.

[BibT_eX]

[DOI]

Erica Dall' Armellina

Nat. Mach. Intell., 2022

Comprehensive identification of Long Covid articles with human-in-the-loop machine learning.

[BibT_eX]

[DOI]

Robert Leaman

Rezarta Islamaj Dogan

CoRR, 2022

Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations.

[BibT_eX]

[DOI]

CoRR, 2022

A Privacy-Preserving Unsupervised Domain Adaptation Framework for Clinical Text Analysis.

[BibT_eX]

[DOI]

CoRR, 2022

Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations.

[BibT_eX]

[DOI]

Database J. Biol. Databases Curation, 2022

Assigning species information to corresponding genes by a sequence labeling framework.

[BibT_eX]

[DOI]

Database J. Biol. Databases Curation, 2022

Predicting Age-related Macular Degeneration Progression with Longitudinal Fundus Images Using Deep Learning.

[BibT_eX]

[DOI]

Benjamin S. Glicksberg

Proceedings of the Machine Learning in Medical Imaging - 13th International Workshop, 2022

Deep learning automated diagnosis and quantitative classification of cataract type and severity: quantifying the effectiveness and usability of deep learning-assisted disease diagnosis models with 14 ophthalmologists and multi-center validations.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2022, 2022

Automated and Accessible Diagnosis of Age-related Macular Degeneration: a Comparative Analysis of the impact of machine learning models in clinical diagnostic Workflows.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2022, 2022

2021

LitSuggest: a web-based system for literature recommendation and curation using machine learning.

[BibT_eX]

[DOI]

Nucleic Acids Res., 2021

LitCovid: an open database of COVID-19 literature.

[BibT_eX]

[DOI]

Chantal Cousineau-Krieger

Alexis Allot

Zhiyong Lu

Nucleic Acids Res., 2021

Multimodal, multitask, multiattention (M3) deep learning detection of reticular pseudodrusen: Toward automated and accessible classification of age-related macular degeneration.

[BibT_eX]

[DOI]

Caroline C. W. Klaver

Daniel T. Luttikhuizen

J. Am. Medical Informatics Assoc., 2021

Learning Structure from Visual Semantic Features and Radiology Ontology for Lymph Node Classification on MRI.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning in Medical Imaging - 12th International Workshop, 2021

Long Covid: A Comprehensive Collection of Articles Regarding Long-Haul Symptoms in COVID-19 Survivors.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

Multi-task deep learning-based survival analysis on the prognosis of late AMD using the longitudinal data in AREDS.

[BibT_eX]

[DOI]

Gregory C. Ghahramani

Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

Deep learning detection of reticular pseudodrusen using multi-modal, multi-task, and multi-attention mechanisms: towards automated and accessible classification of age-related macular degeneration.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

AM2BERT: attention guided and regularized transformer-based multi-label classification model for COVID-19 literature curation.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

2020

Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records.

[BibT_eX]

[DOI]

BMC Medical Informatics Decis. Mak., April, 2020

BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale.

[BibT_eX]

[DOI]

PLoS Comput. Biol., 2020

Predicting risk of late age-related macular degeneration using deep learning.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2020

Better synonyms for enriching biomedical search.

[BibT_eX]

[DOI]

J. Am. Medical Informatics Assoc., 2020

Privacy concerns of the Australian My Health Record: Implications for other large-scale opt-out personal health records.

[BibT_eX]

[DOI]

Patrick Cheong-Iao Pang

Inf. Process. Manag., 2020

Quality Matters: Biocuration Experts on the Impact of Duplication and Other Data Quality Issues in Biological Databases.

[BibT_eX]

[DOI]

Marc Robinson-Rechavi

Jana Sponarova

Justin Zobel

Karin Verspoor

Genom. Proteom. Bioinform., 2020

Multi-modal, multi-task, multi-attention (M3) deep learning detection of reticular pseudodrusen: towards automated and accessible classification of age-related macular degeneration.

[BibT_eX]

[DOI]

Caroline C. W. Klaver

Daniel T. Luttikhuizen

Chantal Cousineau-Krieger

CoRR, 2020

Artificial Intelligence (AI) in Action: Addressing the COVID-19 Pandemic with Natural Language Processing (NLP).

[BibT_eX]

[DOI]

CoRR, 2020

Navigating the landscape of COVID-19 research through literature analysis: A bird's eye view.

[BibT_eX]

[DOI]

Lana Yeganova

Rezarta Islamaj Dogan

CoRR, 2020

An Empirical Study of Multi-Task Learning on BERT for Biomedical Text Mining.

[BibT_eX]

[DOI]

Yifan Peng

Zhiyong Lu

Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, 2020

Detection of reticular pseudodrusen using deep learning.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2020, 2020

Automatic recognition of abdominal lymph nodes from clinical text.

[BibT_eX]

[DOI]

Proceedings of the 3rd Clinical Natural Language Processing Workshop, 2020

2019

LitSense: making sense of biomedical literature at sentence level.

[BibT_eX]

[DOI]

Nucleic Acids Res., 2019

Search Effectiveness in Nonredundant Sequence Databases: Assessments and Solutions.

[BibT_eX]

[DOI]

J. Comput. Biol., 2019

ML-Net: multi-label classification of biomedical texts with deep neural networks.

[BibT_eX]

[DOI]

J. Am. Medical Informatics Assoc., 2019

A deep learning approach for automated detection of geographic atrophy from color fundus photographs.

[BibT_eX]

[DOI]

CoRR, 2019

Overview of the BioCreative VI Precision Medicine Track: mining protein interactions and mutations for precision medicine.

[BibT_eX]

[DOI]

Database J. Biol. Databases Curation, 2019

BioSentVec: creating sentence embeddings for biomedical texts.

[BibT_eX]

[DOI]

Yifan Peng

Zhiyong Lu

Proceedings of the 2019 IEEE International Conference on Healthcare Informatics, 2019

Evaluation of Five Sentence Similarity Models on Electronic Medical Records.

[BibT_eX]

[DOI]

Proceedings of the 10th ACM International Conference on Bioinformatics, 2019

A deep learning-based survival model for prediction of progression in late Age-related Macular Degeneration (AMD) from color fundus photographs.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2019, 2019

2018

Comparative Analysis of Sequence Clustering Methods for Deduplication of Biological Databases.

[BibT_eX]

[DOI]

ACM J. Data Inf. Qual., 2018

A multi-task deep learning model for the classification of Age-related Macular Degeneration.

[BibT_eX]

[DOI]

CoRR, 2018

DeepSeeNet: A deep learning model for automated classification of patient-based age-related macular degeneration severity from color fundus photographs.

[BibT_eX]

[DOI]

CoRR, 2018

BioCreative VI Precision Medicine Track system performance is constrained by entity recognition and variations in corpus characteristics.

[BibT_eX]

[DOI]

Database J. Biol. Databases Curation, 2018

Sentence Similarity Measures Revisited: Ranking Sentences in PubMed Documents.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

2017

Duplication in biological databases: definitions, impacts and methods.

[BibT_eX]

[DOI]

PhD thesis, 2017

Duplicates, redundancies and inconsistencies in the primary nucleotide databases: a descriptive study.

[BibT_eX]

[DOI]