Yonghui Wu

Orcid: 0000-0002-6780-6135

Affiliations:
  • University of Florida, Gainesville, FL, USA


According to our database1, Yonghui Wu authored at least 114 papers between 2008 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A Study of Large Language Models for Patient Information Extraction: Model Architecture, Fine-Tuning Strategy, and Multi-task Instruction Tuning.
CoRR, September, 2025

A Topic Modeling Analysis of Stigma Dimensions, Social, and Related Behavioral Circumstances in Clinical Notes Among Patients with HIV.
CoRR, June, 2025

Medical foundation large language models for comprehensive text analysis and beyond.
npj Digit. Medicine, 2025

Leveraging undecided cases in chart-reviewed phenotypes to enhance EHR-based association studies.
J. Biomed. Informatics, 2025

Improving Medical Visual Instruction Tuning with Labeled Datasets.
Proceedings of the Foundation Models for General Medical AI - Third International Workshop, 2025

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Comparative Evaluation of Clinical Large Language Models and Machine Learning to Predict Antimicrobial Resistance in Hospital-Onset Sepsis.
Proceedings of the Artificial Intelligence in Medicine - 23rd International Conference, 2025

2024
Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports of Lung Cancer Screening Patients Using Transformer Models.
J. Heal. Informatics Res., September, 2024

Deep learning for identifying personal and family history of suicidal thoughts and behaviors from EHRs.
npj Digit. Medicine, 2024

Identifying social determinants of health from clinical narratives: A study of performance, documentation ratio, and potential bias.
J. Biomed. Informatics, 2024

Model tuning or prompt Tuning? a study of large language models for clinical concept and relation extraction.
J. Biomed. Informatics, 2024

Generative large language models are all-purpose text analytics engines: text-to-text learning is all your need.
J. Am. Medical Informatics Assoc., 2024

Use of natural language processing to extract and classify papillary thyroid cancer features from surgical pathology reports.
CoRR, 2024

Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning.
CoRR, 2024

Narrative Feature or Structured Feature? A Study of Large Language Models to Identify Cancer Patients at Risk of Heart Failure.
CoRR, 2024

Feasibility of Identifying Factors Related to Alzheimer's Disease and Related Dementia in Real-World Data.
CoRR, 2024

Me LLaMA: Foundation Large Language Models for Medical Applications.
CoRR, 2024

Unveiling Fall Risk Factors: Nurse-Driven Corpus Development for Natural Language Processing.
Proceedings of the Innovation in Applied Nursing Informatics, 2024

Identifying Symptoms of Delirium from Clinical Narratives Using Natural Language Processing.
Proceedings of the 12th IEEE International Conference on Healthcare Informatics, 2024

Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

UF-HOBI at "Discharge Me!": A Hybrid Solution for Discharge Summary Generation Through Prompt-based Tuning of GatorTronGPT Models.
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024

2023
The role of health system penetration rate in estimating the prevalence of type 1 diabetes in children and adolescents using electronic health records.
J. Am. Medical Informatics Assoc., December, 2023

Leveraging natural language processing to identify eligible lung cancer screening patients with the electronic health record.
Int. J. Medical Informatics, September, 2023

Clinical concept and relation extraction using prompt-based machine reading comprehension.
J. Am. Medical Informatics Assoc., August, 2023

Contextualized medication information extraction using Transformer-based deep learning architectures.
J. Biomed. Informatics, June, 2023

A study of generative large language model for medical research and healthcare.
npj Digit. Medicine, 2023

Assess the documentation of cognitive tests and biomarkers in electronic health records via natural language processing for Alzheimer's disease and related dementias.
Int. J. Medical Informatics, 2023

On the Impact of Cross-Domain Data on German Language Models.
CoRR, 2023

Extracting Thyroid Nodules Characteristics from Ultrasound Reports Using Transformer-based Natural Language Processing Methods.
CoRR, 2023

Real-World Effectiveness of Lung Cancer Screening Using Deep Learning-Based Counterfactual Prediction.
Proceedings of the MEDINFO 2023 - The Future Is Accessible, 2023

Exploring the Effect of Eligibility Criteria on AD Severity and Severe Adverse Event in Eligible Patients.
Proceedings of the 11th IEEE International Conference on Healthcare Informatics, 2023

On the Impact of Cross-Domain Data on German Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
A large language model for electronic health records.
npj Digit. Medicine, 2022

Identify diabetic retinopathy-related clinical concepts and their attributes using transformer-based natural language processing methods.
BMC Medical Informatics Decis. Mak., 2022

Biases in using social media data for public health surveillance: A scoping review.
Int. J. Medical Informatics, 2022

SODA: A Natural Language Processing Package to Extract Social Determinants of Health for Cancer Studies.
CoRR, 2022

GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records.
CoRR, 2022

Identify Cancer Patients at Risk for Heart Failure using Electronic Health Record and Genetic Data.
Proceedings of the 10th IEEE International Conference on Healthcare Informatics, 2022

A Preliminary Study of Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports Using Natural Language Processing.
Proceedings of the 10th IEEE International Conference on Healthcare Informatics, 2022

2021
Extracting social determinants of health from electronic health records using natural language processing: a systematic review.
J. Am. Medical Informatics Assoc., 2021

Clinical Relation Extraction Using Transformer-based Models.
CoRR, 2021

Identify Diabetic Retinopathy-related Clinical Concepts Using Transformer-based Natural Language Processing Methods.
Proceedings of the 9th IEEE International Conference on Healthcare Informatics, 2021

Transformer-based named entity recognition for parsing clinical trial eligibility criteria.
Proceedings of the BCB '21: 12th ACM International Conference on Bioinformatics, 2021

Data and Model Biases in Social Media Analyses: A Case Study of COVID-19 Tweets.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

Developing an Ontology for Social and Behavioral Determinants of Health.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

2020
Clinical concept extraction using transformers.
J. Am. Medical Informatics Assoc., 2020

Identifying relations of medications with adverse drug events using recurrent convolutional neural networks and gradient boosting.
J. Am. Medical Informatics Assoc., 2020

Assessing the practice of data quality evaluation in a national clinical data research network through a systematic scoping review in the era of real-world data.
J. Am. Medical Informatics Assoc., 2020

Identification of important factors in an inpatient fall risk prediction model to improve the quality of care using EHR and electronic administrative data: A machine-learning approach.
Int. J. Medical Informatics, 2020

Assessing mental health signals among sexual and gender minorities using Twitter data.
Health Informatics J., 2020

A Natural Language Processing Tool to Extract Quantitative Smoking Status from Clinical Narratives.
Proceedings of the 8th IEEE International Conference on Healthcare Informatics, 2020

Developing and Validating a Computable Phenotype for the Identification of Transgender and Gender Nonconforming Individuals and Subgroups.
Proceedings of the AMIA 2020, 2020

2019
A study of deep learning methods for de-identification of clinical notes in cross-institute settings.
BMC Medical Informatics Decis. Mak., 2019

Applying a deep learning-based sequence labeling approach to detect attributes of medical concepts in clinical text.
BMC Medical Informatics Decis. Mak., 2019

Time-sensitive clinical concept embeddings learned from large electronic health records.
BMC Medical Informatics Decis. Mak., 2019

A Study of Deep Learning Methods for De-identification of Clinical Notes at Cross Institute Settings.
Proceedings of the 2019 IEEE International Conference on Healthcare Informatics, 2019

Detect Attributes of Medical Concepts via Sequence Labeling.
Proceedings of the 2019 IEEE International Conference on Healthcare Informatics, 2019

Identifying Cancer Patients at Risk for Heart Failure Using Machine Learning Methods.
Proceedings of the AMIA 2019, 2019

2018
A study of generalizability of recurrent neural network-based predictive models for heart failure onset risk using a large and heterogeneous EHR data set.
J. Biomed. Informatics, 2018

CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines.
J. Am. Medical Informatics Assoc., 2018

PIE: A prior knowledge guided integrated likelihood estimation method for bias reduction in association studies using electronic health records data.
J. Am. Medical Informatics Assoc., 2018

Extraction of BI-RADS findings from breast ultrasound reports in Chinese using deep learning approaches.
Int. J. Medical Informatics, 2018

Detecting Medications and Adverse Drug Events in Clinical Notes Using Recurrent Neural Networks.
Proceedings of the 1st International Workshop on Medication and Adverse Drug Event Detection, 2018

Assessing Mental Health Signals Among Sexual and Gender Minorities using Twitter Data.
Proceedings of the IEEE International Conference on Healthcare Informatics Workshops, 2018

Computable Eligibility Criteria through Ontology-driven Data Access: A Case Study of Hepatitis C Virus Trials.
Proceedings of the AMIA 2018, 2018

Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition.
Proceedings of the AMIA 2018, 2018

2017
A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD).
J. Am. Medical Informatics Assoc., 2017

Comparing Cancer Information Needs for Consumers in the US and China.
Proceedings of the MEDINFO 2017: Precision Healthcare through Informatics, 2017

A comparative study of different methods for automatic identification of clopidogrel-induced bleedings in electronic health records.
Proceedings of the Summit on Clinical Research Informatics, 2017

Evaluating Word Embeddings from Multiple Domains for Symptom Recognition in Psychiatric Notes.
Proceedings of the AMIA 2017, 2017

Detecting Body Location Modifiers of Disorders in Clinical Texts via Sequence Labeling.
Proceedings of the AMIA 2017, 2017

Detecting Contradictory and Consistent Citations in Biomedical Literature.
Proceedings of the AMIA 2017, 2017

CLAMP - A User-Centric Clinical Natural Language Processing Toolkit.
Proceedings of the AMIA 2017, 2017

Clinical Named Entity Recognition Using Deep Learning Models.
Proceedings of the AMIA 2017, 2017

2016
Extracting genetic alteration information for personalized cancer therapy from ClinicalTrials.gov.
J. Am. Medical Informatics Assoc., 2016

Chemical named entity recognition in patents by domain knowledge and unsupervised feature learning.
Database J. Biol. Databases Curation, 2016

CD-REST: a system for extracting chemical-induced disease relation in literature.
Database J. Biol. Databases Curation, 2016

UTHealth at SemEval-2016 Task 12: an End-to-End System for Temporal Information Extraction from Clinical Notes.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

What Can Neural Networks Learn from Unlabeled Clinical Narratives?
Proceedings of the AMIA 2016, 2016

An Empirical Study for Impacts of Measurement Errors on EHR based Association Studies.
Proceedings of the AMIA 2016, 2016

2015
Deciphering Signaling Pathway Networks to Understand the Molecular Mechanisms of Metformin Action.
PLoS Comput. Biol., 2015

A comparison of conditional random fields and structured support vector machines for chemical entity recognition in biomedical literature.
J. Cheminformatics, 2015

UTH-CCB: The Participation of the SemEval 2015 Challenge - Task 14.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Named Entity Recognition in Chinese Clinical Text Using Deep Neural Network.
Proceedings of the MEDINFO 2015: eHealth-enabled Health, 2015

Clinical Abbreviation Disambiguation Using Neural Word Embeddings.
Proceedings of the Workshop on Biomedical Natural Language Processing, BioNLP@IJCNLP 2015, 2015

A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.
Proceedings of the AMIA 2015, 2015

Recognizing Disjoint Clinical Concepts in Clinical Text Using Machine Learning-based Methods.
Proceedings of the AMIA 2015, 2015

Clinical Language Annotation, Modeling, and Processing Toolkit (CLAMP) - a user-centric NLP system.
Proceedings of the AMIA 2015, 2015

Citation Sentiment Analysis in Clinical Trial Papers.
Proceedings of the AMIA 2015, 2015

2014
UTH_CCB: A report for SemEval 2014 - Task 7 Analysis of Clinical Text.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Domain Adaptation for Semantic Role Labeling of Clinical Text.
Proceedings of the AMIA 2014, 2014

Development of a Unified Computable Problem-Medication Knowledge base.
Proceedings of the AMIA 2014, 2014

2013
Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.
BMC Medical Informatics Decis. Mak., 2013

A hybrid system for temporal information extraction from clinical text.
J. Am. Medical Informatics Assoc., 2013

Analyzing Differences between Chinese and English Clinical Text: A Cross-Institution Comparison of Discharge Summaries in Two Languages.
Proceedings of the MEDINFO 2013, 2013

Clinical Acronym/Abbreviation Normalization using a Hybrid Approach.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

Recognizing and Encoding Discorder Concepts in Clinical Text using Machine Learning and Vector Space Model.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

A prototype application for real-time recognition and disambiguation of clinical abbreviations.
Proceedings of the Proceeding of the 7rd International Workshop on Data and Text Mining in Bioinformatics, 2013

Building a Large Clinical Abbreviation Sense Inventory from Discharge Summaries.
Proceedings of the AMIA 2013, 2013

2012
A new clustering method for detecting rare senses of abbreviations in clinical notes.
J. Biomed. Informatics, 2012

Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs.
J. Am. Medical Informatics Assoc., 2012

DTome: a web-based tool for drug-target interactome construction.
BMC Bioinform., 2012

Ranking Gene-Drug Relationships in Biomedical Literature Using Latent Dirichlet Allocation.
Proceedings of the Biocomputing 2012: Proceedings of the Pacific Symposium, 2012

Detecting Adverse Drug Reactions Using Inpatient Medication Orders and Laboratory Tests Data.
Proceedings of the 2012 IEEE Second International Conference on Healthcare Informatics, 2012

Clinical entity recognition using structural support vector machines with rich features.
Proceedings of the ACM sixth international workshop on Data and text mining in biomedical informatics, 2012

A comparative study of current clinical natural language processing systems on handling abbreviations in discharge summaries.
Proceedings of the AMIA 2012, 2012

Clinical Entity Recognition Using Structural Support Vector Machines.
Proceedings of the AMIA 2012, 2012

MedEx-UIMA - An Open-Source System for Medication Information Extraction from Clinical Text.
Proceedings of the AMIA 2012, 2012

2010
On-line Hot Topic Recommendation Using Tolerance Rough Set Based Topic Clustering.
J. Comput., 2010

Topic Detection by Topic Model Induced Distance Using Biased Initiation.
Proceedings of the Advances in Computer Science and Information Technology, 2010

Topic based automatic news recommendation using topic model and affinity propagation.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

2009
STRank: A SiteRank Algorithm Using Semantic Relevance and Time Frequency.
Proceedings of the IEEE International Conference on Systems, 2009

2008
Genre identification of Chinese finance text using machine learning method.
Proceedings of the IEEE International Conference on Systems, 2008


  Loading...