Nigam H. Shah

Orcid: 0000-0001-9385-7158

Affiliations:
  • Stanford University, CA, USA
  • Pennsylvania State University, University Park, PA, USA (former)


According to our database1, Nigam H. Shah authored at least 193 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Characterizing the limitations of using diagnosis codes in the context of machine learning for healthcare.
BMC Medical Informatics Decis. Mak., December, 2024

Standing on FURM ground - A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems.
CoRR, 2024

Zero-Shot Clinical Trial Patient Matching with LLMs.
CoRR, 2024


2023
Self-supervised machine learning using adult inpatient data produces effective models for pediatric clinical prediction tasks.
J. Am. Medical Informatics Assoc., November, 2023

DEPLOYR: a technical framework for deploying custom real-time machine learning models into the electronic medical record.
J. Am. Medical Informatics Assoc., August, 2023

Clinical utility gains from incorporating comorbidity and geographic location information into risk estimation equations for atherosclerotic cardiovascular disease.
J. Am. Medical Informatics Assoc., April, 2023

A framework to identify ethical concerns with ML-guided care workflows: a case study of mortality prediction to guide advance care planning.
J. Am. Medical Informatics Assoc., April, 2023

APLUS: A Python library for usefulness simulations of machine learning models in healthcare.
J. Biomed. Informatics, March, 2023

Assessing the net benefit of machine learning models in the presence of resource constraints.
J. Am. Medical Informatics Assoc., March, 2023

The shaky foundations of large language models and foundation models for electronic health records.
npj Digit. Medicine, 2023

Principled estimation and evaluation of treatment effect heterogeneity: A case study application to dabigatran for patients with atrial fibrillation.
J. Biomed. Informatics, 2023

A Multi-Center Study on the Adaptability of a Shared Foundation Model for Electronic Health Records.
CoRR, 2023

INSPECT: A Multimodal Dataset for Pulmonary Embolism Diagnosis and Prognosis.
CoRR, 2023

Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature.
CoRR, 2023

All models are local: time to replace external validation with recurrent local validation.
CoRR, 2023

Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery.
CoRR, 2023

The Shaky Foundations of Clinical Foundation Models: A Survey of Large Language Models and Foundation Models for EMRs.
CoRR, 2023

Self-Supervised Time-to-Event Modeling with Structured Medical Records.
CoRR, 2023

EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

INSPECT: A Multimodal Dataset for Patient Outcome Prediction of Pulmonary Embolisms.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Efficient Diagnosis Assignment Using Unstructured Clinical Notes.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Considerations in the reliability and fairness audits of predictive models for advance care planning.
Frontiers Digit. Health, 2022

Instability in clinical risk stratification models using deep learning.
Proceedings of the Machine Learning for Health, 2022

Net benefit, calibration, threshold selection, and training objectives for algorithmic fairness in healthcare.
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

Predicting patients who are likely to develop Lupus Nephritis of those newly diagnosed with Systemic Lupus Erythematosus.
Proceedings of the AMIA 2022, 2022

2021
Summarizing Patients Like Mine via an On-demand Consultation Service.
Proc. VLDB Endow., 2021

An informatics consult approach for generating clinical evidence for treatment decisions.
BMC Medical Informatics Decis. Mak., 2021

Language models are an effective representation learning technique for electronic health record data.
J. Biomed. Informatics, 2021

An empirical characterization of fair machine learning for clinical risk prediction.
J. Biomed. Informatics, 2021

Improving hospital readmission prediction using individualized utility analysis.
J. Biomed. Informatics, 2021

Learning decision thresholds for risk stratification models from aggregate clinician behavior.
J. Am. Medical Informatics Assoc., 2021

A survey of extant organizational and computational setups for deploying predictive models in health systems.
J. Am. Medical Informatics Assoc., 2021

A framework for making predictive models useful in practice.
J. Am. Medical Informatics Assoc., 2021

Corrigendum: Conflicting information from the Food and Drug Administration: Missed opportunity to lead standards for safe and effective medical artificial intelligence solutions.
J. Am. Medical Informatics Assoc., 2021

Conflicting information from the Food and Drug Administration: Missed opportunity to lead standards for safe and effective medical artificial intelligence solutions.
J. Am. Medical Informatics Assoc., 2021

Automated model versus treating physician for predicting survival time of patients with metastatic cancer.
J. Am. Medical Informatics Assoc., 2021

ACE: the Advanced Cohort Engine for searching longitudinal patient records.
J. Am. Medical Informatics Assoc., 2021

Computational drug repositioning of atorvastatin for ulcerative colitis.
J. Am. Medical Informatics Assoc., 2021

RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR.
CoRR, 2021

A comparison of approaches to improve worst-case predictive model performance over patient subpopulations.
CoRR, 2021

Systematic Review of Approaches to Preserve Machine Learning Performance in the Presence of Temporal Dataset Shift in Clinical Medicine.
Appl. Clin. Inform., 2021

Multi-Modal Data Science for Healthcare: State of the Art, Challenges, and Opportunities.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

2020
Assessing the accuracy of automatic speech recognition for psychotherapy.
npj Digit. Medicine, 2020

Developing a delivery science for artificial intelligence in healthcare.
npj Digit. Medicine, 2020

Estimating the efficacy of symptom-based screening for COVID-19.
npj Digit. Medicine, 2020

Deep phenotyping: Embracing complexity and temporality - Towards scalability, portability, and interoperability.
J. Biomed. Informatics, 2020

Development and validation of phenotype classifiers across multiple sites in the observational health data sciences and informatics network.
J. Am. Medical Informatics Assoc., 2020

Measure what matters: Counts of hospitalized patients are a better metric for health system capacity planning for a reopening.
J. Am. Medical Informatics Assoc., 2020

MINIMAR (MINimum Information for Medical AI Reporting): Developing reporting standards for artificial intelligence in health care.
J. Am. Medical Informatics Assoc., 2020

Trove: Ontology-driven weak supervision for medical entity classification.
CoRR, 2020

A new paradigm for accelerating clinical data science at Stanford Medicine.
CoRR, 2020

Language Models Are An Effective Patient Representation Learning Technique For Electronic Health Record Data.
CoRR, 2020

Normalizing Clinical Document Titles to LOINC Document Ontology: an Initial Study.
Proceedings of the AMIA 2020, 2020

Data Quality Assessment of Laboratory Data.
Proceedings of the AMIA 2020, 2020

2019
It is time to learn from patients like mine.
npj Digit. Medicine, 2019

Medical device surveillance with electronic health records.
npj Digit. Medicine, 2019

Finding missed cases of familial hypercholesterolemia in health systems using machine learning.
npj Digit. Medicine, 2019

Predicting need for advanced illness or palliative care in a primary care population using electronic health record data.
J. Biomed. Informatics, 2019

The number needed to benefit: estimating the value of predictive analytics in healthcare.
J. Am. Medical Informatics Assoc., 2019

The accuracy vs. coverage trade-off in patient-facing diagnosis models.
CoRR, 2019

Missingness as Stability: Understanding the Structure of Missingness in Longitudinal EHR data and its Impact on Reinforcement Learning in Healthcare.
CoRR, 2019

A Semi-Supervised Machine Learning Approach to Detecting Recurrent Metastatic Breast Cancer Cases Using Linked Cancer Registry and Electronic Medical Record Data.
CoRR, 2019

Countdown Regression: Sharp and Calibrated Survival Predictions.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data.
Proceedings of the Biocomputing 2019: Proceedings of the Pacific Symposium, 2019

Counterfactual Reasoning for Fair Clinical Risk Prediction.
Proceedings of the Machine Learning for Healthcare Conference, 2019

Creating Fair Models of Atherosclerotic Cardiovascular Disease Risk.
Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 2019

2018
Biomedical Data/Content Acquisition, Curation.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Scalable and accurate deep learning with electronic health records.
npj Digit. Medicine, 2018

Improving palliative care with deep learning.
BMC Medical Informatics Decis. Mak., 2018

Call for papers: Deep phenotyping for Precision Medicine.
J. Biomed. Informatics, 2018

An evaluation of clinical order patterns machine-learned from clinician cohorts stratified by patient mortality outcomes.
J. Biomed. Informatics, 2018

Predicting Inpatient Discharge Prioritization With Electronic Health Records.
CoRR, 2018

Countdown Regression: Sharp and Calibrated Survival Predictions.
CoRR, 2018

General-purpose validation and model selection when estimating individual treatment effects.
CoRR, 2018

Scalable and accurate deep learning for electronic health records.
CoRR, 2018

Addressing vital sign alarm fatigue using personalized alarm thresholds.
Proceedings of the Biocomputing 2018: Proceedings of the Pacific Symposium, 2018

Session introduction.
Proceedings of the Biocomputing 2018: Proceedings of the Pacific Symposium, 2018

Identifying cases of metastatic prostate cancer using machine learning on electronic health records.
Proceedings of the AMIA 2018, 2018

Transfer learning to adapt predictive models for pediatric patients in the EHR.
Proceedings of the AMIA 2018, 2018

Treatment Pathways in Patients with Cancer Using a Large-scale Observational Data Network.
Proceedings of the AMIA 2018, 2018

2017
Toward multimodal signal detection of adverse drug reactions.
J. Biomed. Informatics, 2017

Synergistic drug combinations from electronic health records and gene expression.
J. Am. Medical Informatics Assoc., 2017

Learning Effective Representations from Clinical Notes.
CoRR, 2017

Open Data for Discovery Science.
Proceedings of the Biocomputing 2017: Proceedings of the Pacific Symposium, 2017

Learning Attributes of Disease Progression from Trajectories of Sparse Lab Values.
Proceedings of the Biocomputing 2017: Proceedings of the Pacific Symposium, 2017

The Effectiveness of Transfer Learning in Electronic Health Records Data.
Proceedings of the 5th International Conference on Learning Representations, 2017

A novel propensity modeling approach to estimate the causal impact of acute organ dysfunction on long-term survival in sepsis.
Proceedings of the Summit on Clinical Research Informatics, 2017

Electronic phenotyping with APHRODITE and the Observational Health Sciences and Informatics (OHDSI) data network.
Proceedings of the Summit on Clinical Research Informatics, 2017

Quantifying the relative change in physical activity after Total Knee Arthroplasty using accelerometer based measurements.
Proceedings of the Summit on Clinical Research Informatics, 2017

Impact of Clinician Experience on Machine Learned Clinical Order Patterns.
Proceedings of the AMIA 2017, 2017

From Large-Scale Network Analytics to Clinical Solutions in OHDSI.
Proceedings of the AMIA 2017, 2017

2016
An unsupervised learning method to identify reference intervals from a clinical database.
J. Biomed. Informatics, 2016

Harnessing next-generation informatics for personalizing medicine: a report from AMIA's 2014 Health Policy Invitational Meeting.
J. Am. Medical Informatics Assoc., 2016

Learning statistical models of phenotypes using noisy labeled training data.
J. Am. Medical Informatics Assoc., 2016

Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature.
BMC Bioinform., 2016

Thematic issue of the Second combined Bio-ontologies and Phenotypes Workshop.
J. Biomed. Semant., 2016

RegenBase: a knowledge base of spinal cord injury biology for translational research.
Database J. Biol. Databases Curation, 2016

The digital revolution in phenotyping.
Briefings Bioinform., 2016

Discovering Patient Phenotypes Using Generalized Low Rank Models.
Proceedings of the Biocomputing 2016: Proceedings of the Pacific Symposium, 2016

Session Introduction.
Proceedings of the Biocomputing 2016: Proceedings of the Pacific Symposium, 2016

Predicting Emergency Department Visits.
Proceedings of the Summit on Clinical Research Informatics, 2016

A Pilot Study of the Integration of a Quantified-self Wearable Device with EMR data in the Acute Postoperative Setting.
Proceedings of the Summit on Clinical Research Informatics, 2016

Observational Health Data Sciences and Informatics (OHDSI): A Rapidly Growing International Network for Open Science and Data Analytics in Healthcare.
Proceedings of the Summit on Clinical Research Informatics, 2016

Predicting hospital visits from geo-tagged Internet search logs.
Proceedings of the Summit on Clinical Research Informatics, 2016

Learning Effective Treatment Pathways for Type-2 Diabetes from a clinical data warehouse.
Proceedings of the AMIA 2016, 2016

Big Data for Healthcare and Life Sciences: Learning Useful Insights from Imperfect Data.
Proceedings of the AMIA 2016, 2016

Ensuring Reproducibility in Observational Research: Building and Sharing Knowledge Resources in the OHDSI Network.
Proceedings of the AMIA 2016, 2016

2015
A formal concept analysis and semantic query expansion cooperation to refine health outcomes of interest.
BMC Medical Informatics Decis. Mak., 2015

Implications of non-stationarity on predictive modeling using EHRs.
J. Biomed. Informatics, 2015

A method for systematic discovery of adverse drug events from clinical notes.
J. Am. Medical Informatics Assoc., 2015

Functional evaluation of out-of-the-box text-mining tools for data-mining tasks.
J. Am. Medical Informatics Assoc., 2015

Special issue on bio-ontologies and phenotypes.
J. Biomed. Semant., 2015

Provenance-Centered Dataset of Drug-Drug Interactions.
Proceedings of the Semantic Web - ISWC 2015, 2015

Analyzing Search Behavior of Healthcare Professionals for Drug Safety Surveillance.
Proceedings of the Biocomputing 2015: Proceedings of the Pacific Symposium, 2015

Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.
Proceedings of the MEDINFO 2015: eHealth-enabled Health, 2015

The Value of an Open-Source Observational Research Collaboratory: Results from the OHDSI Initiative.
Proceedings of the AMIA 2015, 2015

Recent Advances in Computational Drug Repositioning.
Proceedings of the AMIA 2015, 2015

2014
Mining clinical text for signals of adverse drug-drug interactions.
J. Am. Medical Informatics Assoc., 2014

Toward personalizing treatment for depression: predicting diagnosis and severity.
J. Am. Medical Informatics Assoc., 2014

Selected papers from the 16th Annual Bio-Ontologies Special Interest Group Meeting.
J. Biomed. Semant., 2014

Finding progression stages in time-evolving event sequences.
Proceedings of the 23rd International World Wide Web Conference, 2014

Session introduction.
Proceedings of the Biocomputing 2014: Proceedings of the Pacific Symposium, 2014

Medicine in the age of electronic health records.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

2013
Web-scale pharmacovigilance: listening to signals from the crowd.
J. Am. Medical Informatics Assoc., 2013

Combing signals from spontaneous reports and electronic health records for detection of adverse drug reactions.
J. Am. Medical Informatics Assoc., 2013

STOP using just GO: a multi-ontology hypothesis generation tool for high throughput experimentation.
BMC Bioinform., 2013

Selected papers from the 15th Annual Bio-Ontologies Special Interest Group Meeting.
J. Biomed. Semant., 2013

Session introduction.
Proceedings of the Biocomputing 2013: Proceedings of the Pacific Symposium, 2013

Empirical bayes model to combine signals of adverse drug reactions.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Mining Biomedical Ontologies and Data Using RDF Hypergraphs.
Proceedings of the 12th International Conference on Machine Learning and Applications, 2013

Refining health outcomes of interest using formal concept analysis and semantic query expansion.
Proceedings of the Proceeding of the 7rd International Workshop on Data and Text Mining in Bioinformatics, 2013

Learning Practice-based Evidence from Unstructured Clinical Notes.
Proceedings of the AMIA 2013, 2013

Predictive Models in Mental Health: From Diagnosis to Treatment.
Proceedings of the AMIA 2013, 2013

2012
Changing computational research. The challenges ahead.
Source Code Biol. Medicine, 2012

Chapter 9: Analyses Using Disease Ontologies.
PLoS Comput. Biol., 2012

Unified Medical Language System term occurrences in clinical notes: a large-scale corpus analysis.
J. Am. Medical Informatics Assoc., 2012

The coming age of data-driven medicine: translational bioinformatics' next frontier.
J. Am. Medical Informatics Assoc., 2012

The National Center for Biomedical Ontology.
J. Am. Medical Informatics Assoc., 2012

Using ontology-based annotation to profile disease research.
J. Am. Medical Informatics Assoc., 2012

Selected papers from the 14th Annual Bio-Ontologies Special Interest Group Meeting.
J. Biomed. Semant., 2012

Annotation Analysis for Testing Drug Safety Signals using Unstructured Clinical Notes.
J. Biomed. Semant., 2012

Mining the pharmacogenomics literature - a survey of the state of the art.
Briefings Bioinform., 2012

Session introduction.
Proceedings of the Biocomputing 2012: Proceedings of the Pacific Symposium, 2012

Performance of Left Outer Join on Hadoop with Right Side within Single Node Memory Size.
Proceedings of the 26th International Conference on Advanced Information Networking and Applications Workshops, 2012

2011
NCBO Resource Index: Ontology-based search and mining of biomedical resources.
J. Web Semant., 2011

BioPortal: enhanced functionality via new Web services from the National Center for Biomedical Ontology to access and use ontologies in software applications.
Nucleic Acids Res., 2011

Enabling enrichment analysis with the Human Disease Ontology.
J. Biomed. Informatics, 2011

Computationally translating molecular discoveries into tools for medicine: translational bioinformatics articles now featured in <i>JAMIA</i>.
J. Am. Medical Informatics Assoc., 2011

Mapping between the OBO and OWL ontology languages.
J. Biomed. Semant., 2011

Selected papers from the 13th Annual Bio-Ontologies Special Interest Group Meeting.
J. Biomed. Semant., 2011

Integration and publication of heterogeneous text-mined relationships on the Semantic Web.
J. Biomed. Semant., 2011

HyQue: evaluating hypotheses using Semantic Web technologies.
J. Biomed. Semant., 2011

Workshop Introduction.
Proceedings of the Biocomputing 2011: Proceedings of the Pacific Symposium, 2011

Bioportal: Ontologies and Integrated Data Resources at the Click of a Mouse.
Proceedings of the 2nd International Conference on Biomedical Ontology, 2011

The NCBO Annotator: Ontology-Based Annotation as a Web Service.
Proceedings of the 2nd International Conference on Biomedical Ontology, 2011

The Age of Data-Driven Medicine: Mining the Electronic Health Record.
Proceedings of the 2nd International Conference on Biomedical Ontology, 2011

Proposed SKOS Extensions for BioPortal Terminology Services.
Proceedings of the Semantic Web - Joint International Semantic Technology Conference, 2011

2010
Using text to build semantic networks for pharmacogenomics.
J. Biomed. Informatics, 2010

Selected papers from the 12<sup>th</sup> annual Bio-Ontologies meeting.
J. Biomed. Semant., 2010

Building a biomedical ontology recommender web service.
J. Biomed. Semant., 2010

A UIMA wrapper for the NCBO annotator.
Bioinform., 2010

Optimize First, Buy Later: Analyzing Metrics to Ramp-Up Very Large Knowledge Bases.
Proceedings of the Semantic Web - ISWC 2010 - 9th International Semantic Web Conference, 2010

Extraction of Genotype-Phenotype-Drug Relationships from Text: From Entity Recognition to Bioinformatics Application.
Proceedings of the Biocomputing 2010: Proceedings of the Pacific Symposium, 2010

Indexation et intégration de ressources textuelles à l'aide d'ontologies : application au domaine biomédical.
Proceedings of the IC 2010 : 21es Journées Ingénierie des Connaissances 2010 (Proceedings of the 21st French Knowledge Engineering Conference), 2010

2009
Ontologies for Formal Representation of Biological Systems.
Proceedings of the Handbook on Ontologies, 2009

Biomedical Data/Content Acquisition, Curation.
Proceedings of the Encyclopedia of Database Systems, 2009

BioPortal: ontologies and integrated data resources at the click of a mouse.
Nucleic Acids Res., 2009

Ontology-driven indexing of public datasets for translational bioinformatics.
BMC Bioinform., 2009

Comparison of concept recognizers for building the Open Biomedical Annotator.
BMC Bioinform., 2009

OBO & OWL: Roundtrip Ontology Transformations.
Proceedings of the Workshop on Semantic Web Applications and Tools for Life Sciences, 2009

What Four Million Mappings Can Tell You about Two Hundred Ontologies.
Proceedings of the Semantic Web - ISWC 2009, 8th International Semantic Web Conference, 2009

2008
The Stanford Tissue Microarray Database.
Nucleic Acids Res., 2008

Biomedical ontologies: a functional perspective.
Briefings Bioinform., 2008

Pathway knowledge base: An integrated pathway resource using BioPAX.
Appl. Ontology, 2008

BioPortal: A Web Repository for Biomedical Ontologies and Data Resources.
Proceedings of the Poster and Demonstration Session at the 7th International Semantic Web Conference (ISWC2008), 2008

A System for Ontology-Based Annotation of Biomedical Data.
Proceedings of the Data Integration in the Life Sciences, 5th International Workshop, 2008

UMLS-Query: A Perl Module for Querying the UMLS.
Proceedings of the AMIA 2008, 2008

Comparison of Ontology-based Semantic-Similarity Measures.
Proceedings of the AMIA 2008, 2008

2007
Annotation and query of tissue microarray data using the NCI Thesaurus.
BMC Bioinform., 2007

Current progress in network research: toward reference networks for key model organisms.
Briefings Bioinform., 2007

Searching ontologies based on content: experiments in the biomedical domain.
Proceedings of the 4th International Conference on Knowledge Capture (K-CAP 2007), 2007

Using Annotations from Controlled Vocabularies to Find Meaningful Associations.
Proceedings of the Data Integration in the Life Sciences, 4th International Workshop, 2007

Interpretation Errors related to the GO Annotation File Format.
Proceedings of the AMIA 2007, 2007

2006
A case study in pathway knowledgebase verification.
BMC Bioinform., 2006

Ontology-based Annotation and Query of Tissue Microarray Data.
Proceedings of the AMIA 2006, 2006

2004
CLENCH: a program for calculating Cluster ENriCHment using the Gene Ontology.
Bioinform., 2004

HyBrow: a prototype system for computer-aided hypothesis evaluation.
Proceedings of the Proceedings Twelfth International Conference on Intelligent Systems for Molecular Biology/Third European Conference on Computational Biology 2004, 2004

A Finite Model Theory for Biological Hypotheses.
Proceedings of the 3rd International IEEE Computer Society Computational Systems Bioinformatics Conference, 2004

2003
A tool-kit for cDNA microarray and promoter analysis.
Bioinform., 2003

Can We Identify Cellular Pathways Implicated in Cancer Using Gene Expression Data?
Proceedings of the 2nd IEEE Computer Society Bioinformatics Conference, 2003

A Contradiction-Based Framework for Testing Gene Regulation Hypotheses.
Proceedings of the 2nd IEEE Computer Society Bioinformatics Conference, 2003

1996
PGEN: A Novel Approach to Sequential Circuit Test Generation.
VLSI Design, 1996


  Loading...