Clinical features of COVID-19 mortality: development and validation of a clinical prediction model
Author
Li, Yan-chak
Yadaw, Arjun S
Bose, Sonali
Iyengar, Ravi
Bunyavanich, Supinda
Pandey, Gaurav
Institutions
Abstract
Background The COVID-19 pandemic has affected millions of individuals and caused hundreds of thousands of
deaths worldwide. Predicting mortality among patients with COVID-19 who present with a spectrum of complications
is very difficult, hindering the prognostication and management of the disease. We aimed to develop an accurate
prediction model of COVID-19 mortality using unbiased computational methods, and identify the clinical features
most predictive of this outcome.
MethodsIn this prediction model development and validation study, we applied machine learning techniques to clinical
data from a large cohort of patients with COVID-19 treated at the Mount Sinai Health System in New York City, NY, USA,
to predict mortality. We analysed patient-level data captured in the Mount Sinai Data Warehouse database for individuals
with a confirmed diagnosis of COVID-19 who had a health system encounter between March 9 and April 6, 2020. For
initial analyses, we used patient data from March 9 to April 5, and randomly assigned (80:20) the patients to the
development dataset or test dataset 1 (retrospective). Patient data for those with encounters on April 6, 2020, were used in
test dataset 2 (prospective). We designed prediction models based on clinical features and patient characteristics during
health system encounters to predict mortality using the development dataset. We assessed the resultant models in terms
of the area under the receiver operating characteristic curve (AUC) score in the test datasets.
Findings Using the development dataset (n=3841) and a systematic machine learning framework, we developed a
COVID-19 mortality prediction model that showed high accuracy (AUC=0·91) when applied to test datasets of
retrospective (n=961) and prospective (n=249) patients. This model was based on three clinical features: patient’s age,
minimum oxygen saturation over the course of their medical encounter, and type of patient encounter (inpatient vs
outpatient and telehealth visits).
Interpretation An accurate and parsimonious COVID-19 mortality prediction model based on three features might
have utility in clinical settings to guide the management and prognostication of patients affected by this disease.
External validation of this prediction model in other populations is needed.