Sheng Yu

Orcid: 0000-0002-6347-0507

Affiliations:
  • Tsinghua University, Center for Statistical Science / Department of Industrial Engineering, Beijing, China
  • Harvard Medical School, Boston, MA, USA (2013 - 2015)
  • Brigham and Women's Hospital, Boston, MA, USA (2012 - 2015)
  • George Washington University, DC, USA (PhD 2012)


According to our database1, Sheng Yu authored at least 43 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction.
CoRR, 2024

2023
Building a trustworthy AI differential diagnosis application for Crohn's disease and intestinal tuberculosis.
BMC Medical Informatics Decis. Mak., December, 2023

A machine learning method for improving liver cancer staging.
J. Biomed. Informatics, January, 2023

Multimodal learning on graphs for disease relation extraction.
J. Biomed. Informatics, 2023

Biomedical Question Answering: A Survey of Approaches and Challenges.
ACM Comput. Surv., 2023

High-throughput Biomedical Relation Extraction for Semi-Structured Web Articles Empowered by Large Language Models.
CoRR, 2023

CoRTEx: Contrastive Learning for Representing Terms via Explanations with Applications on Constructing Biomedical Knowledge Graphs.
CoRR, 2023

EHRDiff: Exploring Realistic EHR Synthesis with Diffusion Models.
CoRR, 2023

2022
CODER: Knowledge-infused cross-lingual medical term embedding for term normalization.
J. Biomed. Informatics, 2022

BIOS: An Algorithmically Generated Biomedical Knowledge Graph.
CoRR, 2022

Semi-constraint Optimal Transport for Entity Alignment with Dangling Cases.
CoRR, 2022

PMC-Patients: A Large-scale Dataset of Patient Notes and Relations Extracted from Case Reports in PubMed Central.
CoRR, 2022

Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Label Refinement via Contrastive Learning for Distantly-Supervised Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Automatic Biomedical Term Clustering by Learning Fine-grained Term Representations.
Proceedings of the 21st Workshop on Biomedical Language Processing, 2022

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model.
Proceedings of the 21st Workshop on Biomedical Language Processing, 2022

An Accurate Unsupervised Method for Joint Entity Alignment and Dangling Entity Detection.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
A Method for Generating Synthetic Electronic Medical Record Text.
IEEE ACM Trans. Comput. Biol. Bioinform., 2021

Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification.
CoRR, 2021

Sentence Alignment with Parallel Documents Helps Biomedical Machine Translation.
CoRR, 2021

Biomedical Question Answering: A Comprehensive Review.
CoRR, 2021

2020
Can natural language processing help differentiate inflammatory intestinal diseases in China? Models applying random forest and convolutional neural network approaches.
BMC Medical Informatics Decis. Mak., 2020

Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition.
J. Biomed. Informatics, 2020

Long-distance disorder-disorder relation extraction with bootstrapped noisy data.
J. Biomed. Informatics, 2020

Developing an automated mechanism to identify medical articles from wikipedia for knowledge extraction.
Int. J. Medical Informatics, 2020

Automated ICD coding via unsupervised knowledge integration (UNITE).
Int. J. Medical Informatics, 2020

CODER: Knowledge infused cross-lingual medical term embedding for term normalization.
CoRR, 2020

High-throughput relation extraction algorithm development associating knowledge articles and electronic health records.
CoRR, 2020

2019
Feature extraction for phenotyping from semantic and knowledge resources.
J. Biomed. Informatics, 2019

High-throughput multimodal automated phenotyping (MAP) with application to PheWAS.
J. Am. Medical Informatics Assoc., 2019

Long distance entity relation extraction with article structure embedding and applied to mining medical knowledge.
Proceedings of the 2019 IEEE International Conference on Healthcare Informatics, 2019

2018
Enabling phenotypic big data with PheNorm.
J. Am. Medical Informatics Assoc., 2018

PheProb: probabilistic phenotyping using diagnosis codes to improve power for genetic association studies.
J. Am. Medical Informatics Assoc., 2018

Word Segmentation as Graph Partition.
CoRR, 2018

Generation of Synthetic Electronic Medical Record Text.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

High-Throughput Multimodal Automated Phenotyping (MAP) Incorporating Natural Language Processing with Application to PheWAS.
Proceedings of the AMIA 2018, 2018

2017
Surrogate-assisted feature extraction for high-throughput phenotyping.
J. Am. Medical Informatics Assoc., 2017

High-throughput Phenotyping via Denoised Normal Mixture Transformation.
Proceedings of the AMIA 2017, 2017

2015
Toward high-throughput phenotyping: unbiased automated feature extraction and selection from knowledge sources.
J. Am. Medical Informatics Assoc., 2015

Computable Phenotypes enabled by the i2b2 Validation Platform.
Proceedings of the AMIA 2015, 2015

Demonstrating the Advantages of Applying Data Mining Techniques on Time-Dependent Electronic Medical Records.
Proceedings of the AMIA 2015, 2015

2014
Classification of CT pulmonary angiography reports by presence, chronicity, and location of pulmonary embolism with natural language processing.
J. Biomed. Informatics, 2014

2013
A Short Introduction to MiniNLP.
CoRR, 2013


  Loading...