Shan Chen

Orcid: 0000-0001-7999-7410

Affiliations:
  • Harvard Medical School, Mass General Brigham, Artificial Intelligence in Medicine (AIM) Program, Boston, MA, USA
  • Boston Children's Hospital, Computational Health Informatics Program, Boston, MA, USA


According to our database1, Shan Chen authored at least 25 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
KScope: A Framework for Characterizing the Knowledge Status of Language Models.
CoRR, June, 2025

MedBrowseComp: Benchmarking Medical Deep Research and Computer Use.
CoRR, May, 2025

Medical Hallucinations in Foundation Models and Their Impact on Healthcare.
CoRR, March, 2025

Sparse Autoencoder Features for Classifications and Transferability.
CoRR, February, 2025

LCD benchmark: long clinical document benchmark on mortality prediction for language models.
J. Am. Medical Informatics Assoc., 2025

WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

2024
Large language models to identify social determinants of health in electronic health records.
npj Digit. Medicine, 2024

Evaluating the ChatGPT family of models for biomedical reasoning and classification.
J. Am. Medical Informatics Assoc., 2024

The use of large language models to enhance cancer clinical trial educational materials.
CoRR, 2024

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?
CoRR, 2024

Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability.
CoRR, 2024

Mapping Bias in Vision Language Models: Signposts, Pitfalls, and the Road Ahead.
CoRR, 2024

Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation.
CoRR, 2024

AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow.
CoRR, 2024

Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data.
CoRR, 2024

Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
The impact of using an AI chatbot to respond to patient messages.
CoRR, 2023

Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly.
CoRR, 2023

Large Language Models to Identify Social Determinants of Health in Electronic Health Records.
CoRR, 2023

Evaluation of ChatGPT Family of Models for Biomedical Reasoning and Classification.
CoRR, 2023

Natural language processing to automatically extract the presence and severity of esophagitis in notes of patients undergoing radiotherapy.
CoRR, 2023

Measuring Pointwise \mathcalV-Usable Information In-Context-ly.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2021
BCH-NLP at BioCreative VII Track 3: medications detection in tweets using transformer networks and multi-task learning.
CoRR, 2021


  Loading...