Danielle S. Bitterman

According to our database1, Danielle S. Bitterman authored at least 39 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Foundation Artificial Intelligence Models for Health Recognition Using Face Photographs (FAHR-Face).
CoRR, June, 2025

KScope: A Framework for Characterizing the Knowledge Status of Language Models.
CoRR, June, 2025

When Models Reason in Your Language: Controlling Thinking Trace Language Comes at the Cost of Accuracy.
CoRR, May, 2025

MedBrowseComp: Benchmarking Medical Deep Research and Computer Use.
CoRR, May, 2025

Sparse Autoencoder Features for Classifications and Transferability.
CoRR, February, 2025

Regulatory Science Innovation for Generative AI and Large Language Models in Health and Medicine: A Global Call for Action.
CoRR, February, 2025

Preventing unrestricted and unmonitored AI experimentation in healthcare through transparency and accountability.
npj Digit. Medicine, 2025

LCD benchmark: long clinical document benchmark on mortality prediction for language models.
J. Am. Medical Informatics Assoc., 2025

Collaborative large language models for automated data extraction in living systematic reviews.
J. Am. Medical Informatics Assoc., 2025

WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Do They Really Know? Evaluating Large Language Models' Ability to Reference and Cite Oncology Guidelines.
Proceedings of the Artificial Intelligence in Medicine - 23rd International Conference, 2025

Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Large language models to identify social determinants of health in electronic health records.
npj Digit. Medicine, 2024

Ethical debates amidst flawed healthcare artificial intelligence metrics.
npj Digit. Medicine, 2024

Evaluating the ChatGPT family of models for biomedical reasoning and classification.
J. Am. Medical Informatics Assoc., 2024

The use of large language models to enhance cancer clinical trial educational materials.
CoRR, 2024

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?
CoRR, 2024

Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability.
CoRR, 2024

Mapping Bias in Vision Language Models: Signposts, Pitfalls, and the Road Ahead.
CoRR, 2024

Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation.
CoRR, 2024

Safety challenges of AI in medicine.
CoRR, 2024

AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow.
CoRR, 2024

Retrieval-Augmented Generation for Generative Artificial Intelligence in Medicine.
CoRR, 2024

Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources.
CoRR, 2024

Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data.
CoRR, 2024

Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Seeing Beyond Borders: Evaluating LLMs in Multilingual Ophthalmological Question Answering.
Proceedings of the 12th IEEE International Conference on Healthcare Informatics, 2024

When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
The impact of using an AI chatbot to respond to patient messages.
CoRR, 2023

Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly.
CoRR, 2023

Considerations for health care institutions training large language models on electronic health records.
CoRR, 2023

Large Language Models to Identify Social Determinants of Health in Electronic Health Records.
CoRR, 2023

Evaluation of ChatGPT Family of Models for Biomedical Reasoning and Classification.
CoRR, 2023

Natural language processing to automatically extract the presence and severity of esophagitis in notes of patients undergoing radiotherapy.
CoRR, 2023

Measuring Pointwise \mathcalV-Usable Information In-Context-ly.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Classifying unstructured electronic consult messages to understand primary care physician specialty information needs.
J. Am. Medical Informatics Assoc., 2022

2021
Deep-learning system to improve the quality and efficiency of volumetric heart segmentation for breast cancer.
npj Digit. Medicine, 2021

2020
Extracting Relations between Radiotherapy Treatment Details.
Proceedings of the 3rd Clinical Natural Language Processing Workshop, 2020


  Loading...