key: cord-0113837-1ttqsoyy authors: Zhou, Helen; Cheng, Cheng; Lipton, Zachary C.; Chen, George H.; Weiss, Jeremy C. title: Predicting Mortality Risk in Viral and Unspecified Pneumonia to Assist Clinicians with COVID-19 ECMO Planning date: 2020-06-02 journal: nan DOI: nan sha: 5bf7f468b763f181c31a5e1edc57bce9a6dbd00c doc_id: 113837 cord_uid: 1ttqsoyy Respiratory complications due to coronavirus disease COVID-19 have claimed tens of thousands of lives in 2020. Many cases of COVID-19 escalate from Severe Acute Respiratory Syndrome (SARS-CoV-2) to viral pneumonia to acute respiratory distress syndrome (ARDS) to death. Extracorporeal membranous oxygenation (ECMO) is a life-sustaining oxygenation and ventilation therapy that may be used for patients with severe ARDS when mechanical ventilation is insufficient to sustain life. While early planning and surgical cannulation for ECMO can increase survival, clinicians report the lack of a risk score hinders these efforts. In this work, we leverage machine learning techniques to develop the PEER score, used to highlight critically ill patients with viral or unspecified pneumonia at high risk of mortality or decompensation in a subpopulation eligible for ECMO. The PEER score is validated on two large, publicly available critical care databases and predicts mortality at least as well as other existing risk scores. Stratifying our cohorts into low-risk and high-risk groups, we find that the high-risk group also has a higher proportion of decompensation indicators such as vasopressor and ventilator use. Finally, the PEER score is provided in the form of a nomogram for direct calculation of patient risk, and can be used to highlight at-risk patients among critical care patients eligible for ECMO. Coronavirus disease COVID-19 has spread globally, resulting in millions of infections. Many cases of COVID-19 progress from Severe Acute Respiratory Syndrome (SARS-CoV-2) to viral pneumonia to acute respiratory distress syndrome (ARDS) to death. Extracorporeal membrane oxygenation (ECMO) can temporarily sustain patients with severe ARDS to bridge periods of time when oxygenation via the lungs cannot be achieved via mechanical ventilation. However, ECMO is expensive and applicable only for patients healthy enough to recover and return to a high functional status. While ECMO is more e ective when planned in advance [Combes et al., 2018] , applicable risk scores remain unavailable [Liang et al., 2020 , American College of Cardiology, 2020 . This paper introduces the Viral or Unspeci ed Pneumonia ECMO-Eligible Risk Score, or PEER score for short. This risk score focuses on a critical care population eligible for ECMO and uses measurements acquired at the time of would-be planning-early in the critical care stay. While other pneumonia-based risk scores exist [Fine et al., 1997 , Marti et al., 2012 , Charles et al., 2008 , Lim et al., 2003a , our PEER score speci cally addresses risks associated with viral or unspeci ed pneumonia in the critical care setting and the cohort potentially eligible for ECMO. Inclusion of unspeci ed pneumonia broadens the population studied, and is chosen because the infectious etiology of pneumonia often cannot be determined. Though limited by geographic availability, ECMO usage has increased 4-fold in the last decade [Ramanathan et al., 2020] . COVID-19 guidelines suggest ECMO as a late option in the escalation of care in severe ARDS secondary to SARS-CoV-2 infection [Liang et al., 2020 , Alhazzani et al., 2020 . However, early evidence from epidemiological studies of coronavirus , Yang et al., 2020 , Zhou et al., 2020 has been insu cient to establish ECMO's utility. A pooled analysis of four studies [Henry, 2020] showed mortality rates of 95% with the use of ECMO vs. 70% without, but the number of patients receiving ECMO was small, and none of these studies were controlled nor did they specify indications for ECMO. In this paper, to better understand the role of ECMO as a rescue for ventilation non-responsive, SARS-CoV-2 ARDS, we study risk of death in ARDS among ECMO-eligible patients. Treatment guidelines for ARDS suggest ECMO use in severe ARDS alongside other advanced ventilation strategies [Matthay et al., 2020] . World Health Organization interim guidance also suggests ECMO for ARDS [World Health Organization et al., 2020] , citing e ectiveness in ARDS and mortality reductions in Middle East Respiratory Syndrome (MERS). Despite these recommendations and the resources allocated to ECMO for these situations [Ramanathan et al., 2020] , risk scores tailored to ECMO consideration are lacking. Our study addresses this need by drawing from viral and source unidenti ed cases of pneumonia that escalate to critical care admissions, guided by the intuition that ARDS associated with these pneumonia are expected to better resemble COVID-19 ARDS than all-comer ARDS. We develop and evaluate a risk score for critical care patients with viral or unspeci ed pneumonia and who are eligible for ECMO. Our risk score is validated on two large, publicly available critical care databases and predicts mortality at least as well as other existing risk scores. We stratify our cohorts into low-risk and high-risk groups, and nd that the high-risk group also has a higher proportion of decompensation indicators such as vasopressor and ventilator use. Finally, we provide the risk score in the form of a nomogram for direct calculation of patient risk. We provide this risk score as an assistive tool to highlight at-risk patients among critical care patients eligible for ECMO. There are a number of pneumonia [Lim et al., 2003b , Fine et al., 1997 , Charles et al., 2008 , Lim et al., 2003a , Guo et al., 2019 , COVID-19 [Gong et al., 2020b , Jiang et al., 2020 , Gong et al., 2020a , hospitalization mortality [Zimmerman et al., 2006] , and ECMO risk scores [Schmidt et al., 2015] , but none are focused around the time of risk evaluation for ECMO candidacy. The pneumonia and COVID-19 risk scores are assessed on populations with lower acuity, while APACHE is not focused on patients with respiratory illness. Our risk score focuses at the stage of ECMO planning rather than among patients already receiving ECMO. Registry-based studies have also considered expected mortality variation in SARS-CoV-2 compared to that of other viral infections, including MERS, H1N1 u, and seasonal u. One MERS-related ARDS study of critically ill patients demonstrated a higher mortality rate than those in studies on COVID-related ARDS, but may be attributed to more severely ill patients at enrollment [Arabi et al., 2017] . A similar study of H1N1 reported lower mortality rates (12-17%), albeit considering a younger population with average age of 40 [Aokage et al., 2015] . Physiologic concerns have also been raised about the use of ECMO for SARS-CoV-2. While ECMO is primarily bene cial for respiratory recovery, some argue that a spike in all-cause death but not ARDSrelated death could indicate a limited role of ECMO [Henry and Lippi, 2020] . In addition, COVID-associated lymphopenia might be exacerbated by ECMO-induced lymphopenia which could mechanistically a ect a healthy immune response to infection. In ammatory cytokines and speci cally interleukin 6 elevation is associated with COVID-19 mortality and rises with the use of ECMO [Henry, 2020 , Bizzarro et al., 2011 . These expert voices do not argue for the avoidance of ECMO, but rather call for additional study. The eICU Collaborative Research Database [Pollard et al., 2018] contains data for 200,859 admissions to intensive care units (ICU) across multiple centers in the United States between 2014 and 2015. The MIMIC-III clinical database [Johnson et al., 2016] consists of data from 46,476 patients who stayed in critical care units of the Beth Israel Deaconess Medical Center between 2001 and 2012. Our analysis and model development is done primarily using data derived from eICU, and externally validated on data derived from MIMIC-III. Cohort Selection Data for the study cohort were extracted according to the inclusion and exclusion criteria summarized in Figure 2 .1. Our population of interest is patients with viral or otherwise unspeci ed non-bacterial, non-fungal, non-parasitic, and non-genetic pneumonia. While there are no absolute contraindications of ECMO, the therapy is reserved for patients likely to have functional recovery. Thus, patients over 70 years old were excluded since they would not be good candidates for ECMO, and patients under 18 were excluded because SARS-CoV-2 pneumonia progressing to hypoxic respiratory failure is exceedingly rare in this age group. Other relative contraindications to ECMO include stroke, intracranial hemorrhage, disseminated intravascular coagulation, cancer, liver disease, renal failure, patients undergoing surgery, and congestive heart failure. Over the course of each patient's hospital stay, the rst ICU stay was analyzed. Patients who died or were discharged within the rst 48 hours of being admitted were excluded to focus on the stage of critical care after initial entry when lower-risk oxygen supplementation strategies (e.g., ventilation) are being performed, and, methodologically, to provide a richer set of features for prediction. 6.0 (5.3-6.7) 6.1 (5.3-7.0) Calcium (mg/dL) 8.2 (7.7-8.6) 8.2 (7.8-8.6) Albumin (g/dL) 2.6 (2.2-3.1) 3.0 (2.6-3.5) Prothrombin time ( The eICU and MIMIC cohorts were extracted using string matching on diagnosis codes and subsequent clinician review of the descriptors (provided in Appendix A). Features were manually merged from several tables into their corresponding elements based on a process of visualization, query, and physician review. This involved harmonizing feature units (e.g. Farenheit to Celsius), removing impossible values (e.g. negative blood pressures), and merging redundant data elds (e.g., lling in the sum of the individual Motor, Verbal, and Eye Opening Score elds for the Glasgow Coma Score Total when it was missing). We provide additional feature and outcome extraction details in Appendix B. All features were combined into a xed-length vector, using the most recent value prior to 48 hours after ICU admission. Before imputation, approximately half of the features had missingness below 5%, and 80% of the features had missingness below 30%, however several variables have high missingness (Appendix C). Missing values were imputed using MissForest [Stekhoven and BÃijhlmann, 2011] and our model was not sensitive to MissForest imputation (Appendix F). Features Features were extracted from demographics, comorbidities, vitals, physical exam ndings, and laboratory ndings routinely collected in critical care settings. Numerical features were normalized, and categorical features were converted into binary features (one less than the number of categories). The 52 features provided to the model are listed in Appendix B. Outcomes Our primary outcome of interest is in-ICU mortality. Secondary outcomes indicating decompensation include (1) vasopressor use and (2) mechanical ventilation use. For each outcome, we de ne the time to event as the time to rst outcome or censorship, where censorship corresponds to discharge from the ICU. For details, see Appendix B. Lasso-Cox To predict patients' survival time, we use the Cox proportional hazards model [Cox, 1972] with L1 regularization, referred to as Lasso-Cox [Simon et al., 2011] . We choose Lasso-Cox for its ease of interpretation and calculation, owing to its selection of sparse models. * In particular, for a patient with covariates x ∈ R d , the predicted log hazard of the patient is β x (higher log hazard is associated with shorter survival time), where β ∈ R d is the vector of Cox regression coe cients that can be interpreted as log hazard ratios. During model tting, L1 regularization λ d j=1 |β j | is used to encourage only a few features to have nonzero β value, where λ > 0 is a user-speci ed hyperparameter. Evaluation Metrics To evaluate model performance, we consider concordance and calibration. Concordance (c-index) is a common measure of goodness-of-t in survival models [Harrell and et al., 1982] , de ned as the fraction of pairs of subjects whose survival times are correctly ordered by a prediction algorithm, among all pairs that can be ordered. To compute concordance con dence intervals we use 1000 bootstrapped replicates, each the size of the dataset and sampled with replacement. Calibration is evaluated by plotting the Kaplan-Meier observed survival probability versus the predicted survival probability. Using the R package hdnom [Xiao et al., 2016] , we construct our calibration plots (Figure 4 .2) with 1000 bootstrap resamplings for internal calibration, using 5 groups and a prediction time of 3 days. We divided the eICU cohort into a training set (70% of the data, n=2537) and test set (30%, n=1080). The eICU training set is used for model selection and analysis, whereas the eICU test set and entirety of the MIMIC cohort are used for model evaluation. Throughout our analysis, we compare * We also tried the Cox model with elastic-net regularization (combined L1 and L2 regularization) but found little to no gain in concordance. our risk score (PEER) to three pneumonia risk scores: CURB-65 [Lim et al., 2003b] , PSI/PORT [Fine et al., 1997] , and SMART-COP [Charles et al., 2008] ; and one COVID-19 risk score: GOQ [Gong et al., 2020b] . Model selection We select λ via 10-fold cross validation and grid search on the eICU training set to maximize concordance subject to su cient sparsity. We observe that λ = 0.01 gives the best trade-o between concordance (0.73) and number of features selected (18), as a 0.01 increase in concordance corresponds to 10 additional non-zero features. To check the stability of this hyperparameter choice, we imputed our data using ten random seeds and ran 10-fold cross validation on the resulting datasets. Across all runs, λ = 0.01 achieved concordance of approximately 0.73 and selected similar features and coe cients. Additional details about grid search, the concordance and sparsity tradeo , and robust selection of coe cients can be found in Appendix D, Appendix Figure F Code for data extraction and all model results is available at https://github.com/hlzhou/peer-score. As explained in Section 3, model selection yielded a L1-regularized Cox model with λ = 0.01. The learned hazard ratios, i.e. the PEER score, for normalized data are displayed in Table 2 and as a boxplot in Appendix Figure E .1. For reference, Appendix E contains the standard deviation and mean of each feature in the model. To assist with easy risk score calculation, we also provide a nomogram [Xiao et al., 2016] in Figure 4 .1 † . [Lim et al., 2003b] 0 To evaluate discriminative ability, we use concordance. On both the eICU and MIMIC datasets, our Lasso-Cox model (which we name the PEER score) achieves concordance greater than or comparable to that of existing risk scores (Table 3) . On the eICU test set, the PEER score achieves the highest concordance among the risk scores, 0.77, with the second highest, 0.73, attained by SMART-COP. On the MIMIC dataset, PEER and SMART-COP again achieve the highest concordances, but concordance degrades to 0.66 for both scores. One possible reason for this degradation is that arterial oxygen saturation (SaO2) is a signi cant feature and only 1.5% of these values are missing in eICU, but in MIMIC 72.6% of these values are missing (Appendix C). From the PEER score calibration curves (Figure 4 .2), we see one high risk group separate from the other low risk groups. While predicted survival probability of the high risk group is overestimated in the training set, it is estimated within con dence intervals in both test sets. To compare low-risk and high-risk populations (as de ned by our model), we set a threshold at the 90th percentile of the predicted risk value from the training set. We nd that over a 7 day time period, the Kaplan-Meier survival curves for the high and low risk subpopulations are clearly distinct In addition to mortality, decompensation can be indicated by vasopressor and ventilator use. As expected, vasopressors and ventilators are more commonly used in the high risk group (Figure 4.4) . Our PEER score achieves greater or comparable concordance to baselines on the eICU (in-domain) and MIMIC (out-of-domain) test sets. The Lasso-Cox model selects 18 features, making it easy to calculate. We nd that lower SaO2, which is associated with poorer oxygenation status, is predictive of decompensation. As expected, we also nd that old age is predictive of death. Red blood cell distribution width, associated with expanded release of immature red blood cells as a response to insu cient oxygen delivery to tissues, is also a strong risk factor for death in patients with COVID-19 [Gong et al., 2020a] . Three variables (pH, prothrombin time, and age) violate the proportional hazards assumption of the Cox model, and Lasso-Cox models shrink coe cients towards 0, so the hazard ratios themselves should be interpreted with caution. Stratifying each cohort into low risk and high risk subpopulations based the PEER score, there is a clear separation in their survival curves (Figure 4 .3) across all three datasets, with the low risk group observing a survival of 0.9 or greater after 7 days and the high risk group observing a survival of 0.75 or less. The mortality-based high risk groups also are associated with our proximate indicators of decompensation: vasopressor and ventilator use (Figure 4.4) . Compared to the survival curves for low and high risk subpopulations derived from other risk scores, the survival curves from the PEER score are the most distinctly separated (Appendix G Figure G.1) . Calibration plots for the PEER score also show a high risk group clearly separated from the rest ( Figure 4 .2). While the survival probability of the high risk group is overestimated in the training set, it is within error bars in the test set. For ECMO allocation, practically, accurate ranking of risk, as measured by concordance, may be more important than the probabilities predicted. While the PEER score outperformed other risk scores in the eICU test set, there was a decline in performance on the MIMIC test set, and the performance of the PEER score was comparable to that of SMART-COP. We also note that in MIMIC, arterial oxygen saturation (SaO2) has 72.6% missingness yet is an important feature of the PEER score. In contrast, it has 1.5% missingness in eICU. This demonstrates the importance of thinking critically about how our risk score, which was trained on the eICU cohort and depends on 18 speci c features, generalizes to the population to which the score is being applied. Limitations and Future Work Importantly our cohort is de ned not by COVID-19 positive pneumonia patients but instead by viral or unspeci ed pneumonia patients who are ECMO-eligible. While our risk score demonstrates good discriminative ability and is interpretable, there are several additional decisionmaking considerations beyond the scope of this paper. Clinicians interested in applying the risk score to COVID-19 pneumonia should consider how representative this population is of their own. Because ECMO is a constrained resource, there are also ethical questions about who should get treatment. This risk score does not attempt to address these questions, but simply provides relevant information to those making such decisions. More broadly, we hope to provide this risk score as a potential resource for future SARS-like diseases that require ECMO consideration. The cohort for model development and analysis on a held-out set was extracted from the eICU Collaborative Research Database Version 2.0. A cohort was also extracted from MIMIC-III Version 1.4 for external validation. For both the eICU and MIMIC cohorts, viral or unspeci ed pneumonia patients were included by string matching of International Classi cation of Disease (ICD 9) diagnosis code descriptions and subsequent clinician review of these descriptors. Using the patient and demographics tables, patients over 70 or under 18 were excluded. Other relative contraindications to ECMO were likewise excluded by string matching and clinician review of the resulting descriptors. OR ((lower(diagnosisstring) like '%hepatic%' OR lower(diagnosisstring) like '%hepatitis%') AND (diagnosisstring NOT IN ('gastrointestinal|post-GI surgery|s/p hepatic surgery', 'gastrointestinal|hepatic disease|toxic hepatitis', 'infectious diseases|GI infections|intra-abdominal abscess|hepatic|bacterial', 'burns/trauma|trauma -abdomen|hepatic trauma', 'gastrointestinal|abdominal/ general|intra-abdominal abscess|subhepatic', 'gastrointestinal|hepatic disease|hepatorenal syndrome', 'gastrointestinal|hepatic disease|hepatic infarction', 'cardiovascular|shock / hypotension|sepsis|sepsis with single organ dysfunction-acute hepatic failure', → 'toxicology|drug overdose|acetaminophen overdose|hepatic injury expected', 'gastrointestinal|hepatic disease|hepatic dysfunction|pregnancy related', 'gastrointestinal|hepatic disease|hepatic dysfunction', 'gastrointestinal|trauma|hepatic trauma', 'toxicology|drug overdose|acetaminophen overdose|hepatic injury unexpected'))) OR lower(diagnosisstring) like '%liver disease%' OR lower(diagnosisstring) like '%congestive heart failure%' OR ((lower(diagnosisstring) like '%renal%') AND (lower(diagnosisstring) not like '%adrenal%') AND (icd9code like '40%')); Here is the R code used to extract the MIMIC cohort: # exclude based in ICD codes exclusion = bind_rows( fread(paste0(mimicdir,"DIAGNOSES_ICD.csv")) %>% as_tibble() %>% inner_join(read_csv(paste0(mimicdir,"D_ICD_DIAGNOSES.csv")) %>% select(ICD9_CODE,LONG_TITLE), by="ICD9_CODE") %>% filter(str_detect(str_to_lower(LONG_TITLE), pattern="isseminated intra|cerebral hem|cerebral inf")) %>% filter(!str_detect(str_to_lower(LONG_TITLE), pattern="history")), fread(paste0(mimicdir,"DIAGNOSES_ICD.csv")) %>% as_tibble() %>% inner_join(read_csv(paste0(mimicdir,"D_ICD_DIAGNOSES.csv")) %>% select(ICD9_CODE,LONG_TITLE), by="ICD9_CODE") %>% filter(str_detect(str_to_lower(LONG_TITLE), pattern="urgical")) %>% filter(!str_detect(str_to_lower(LONG_TITLE), pattern=" not ")) %>% filter(SEQ_NUM < 3) # only remove if surgery in top 3) %>% select(SUBJECT_ID) %>% rename(subject_id=SUBJECT_ID) %>% distinct() exclusion = exclusion %>% bind_rows( codx %>% # codx.tsv list from MIT-LCP MIMIC extraction code select(liver_disease, renal_failure,congestive_heart_failure, subject_id) %>% mutate(anyofthem=liver_disease+renal_failure+congestive_heart_failure) %>% filter(anyofthem>0) %>% select(subject_id)) %>% distinct() # Additional filtering. # For brevity, the following code only includes the filter operations # (original code had many joins, mutations, etc. and will released upon publication) pna_cohort = pna_cohort %>% filter(as.numeric(age at icu admission)<70) %>% filter(! hosp_id %in% exclusion$subject_id) %>% filter(censor_or_deceased_days > 0) %>% filter(cancer==0) Features and outcomes in eICU were extracted from tables containing patient ICU stays, demographics, diagnoses, nurse charting, nurse assessments, periodic vitals, laboratory ndings, and treatment information. In MIMIC, this data were extracted from tables describing patient demographics, hospital encounters, ICU encounters, diagnoses, laboratory measurements, nurse charting events, event monitoring, and procedures. Features Features were manually merged into their corresponding elements based on a process of visualization, query, and physician review. For example, both eICU and MIMIC store the Glasgow Coma Score as a Total and as individual Motor, Verbal, and Eye Opening Scores respectively which requires a merge. The full list of 52 features our model had access to (prior to model selection): Table 4 contains the levels of missingness for the variables in Table 1 . Table 2 ). For reference, the means and standard deviations of the nonzero features are in Table 5 . MissForest [Stekhoven and BÃijhlmann, 2011 ] is a non-parametric missing value imputation technique which iteratively ts random forests to impute missing values in each column, starting with the column with the fewest missing values. The process repeats itself until the di erence between imputed arrays over successive iterations meets a stopping criterion. Our experiments use the missingpy package to perform MissForest imputation. To check the sensitivity of model selection to MissForest imputation, we re-ran the imputation algorithm with 10 di erent seeds and re-ran grid search to nd the best set of parameters on each of these 10 datasets. As discussed in the paper, Lasso-Cox with a penalty (λ) level of 0.01 consistently achieved high concordance (around 0.73) while only selecting approximately 18 same features out of a total of 52 features across the 10 di erent seeds. Selecting additional features by lowering the penalty level did not yield substantial gains in performance ( Figure F.1) . With a penalty level of 0.01, Lasso-Cox was consistent in the features selected and coe cients it learned. Figure F.2 shows that across all except one of the ten random forest imputations of the dataset (in which it selected one less feature), all 18 features were selected and the model learned relatively consistent coe cients. Figure F.2: Coe cients learned by the L1-regularized cox model with penalty 0.01 from ten random forest imputations of the data. The x-axis has the variable name (number of times that the variable has nonzero coe cient across 10 runs) G Survival Curves for High and Low Risk Groups Figure G.1 contains survival curves for high and low risk groups derived from the PEER score (our model), CURB-65, PSI/PORT, SMART-COP, and GOQ. The high/low risk groups for CURB-65 and PSI/PORT were created by following the criteria listed in their respective papers where former one to greater or equal to 3 and the later one to higher or equal to 130. For SMART-COP and GOQ, since we did not nd a cuto criterion in their papers, we just follow our criterion to set the cuto point as the 90th percentitle of the train set risk. Figure G .1: Comparison of survival curves corresponding to risk strata derived from the PEER score, CURB-65, PSI/PORT, SMART-COP, and GOQ Surviving sepsis campaign: guidelines on the management of critically ill adults with coronavirus disease 2019 (COVID-19) American College of Cardiology Extracorporeal membrane oxygenation for acute respiratory distress syndrome Critically ill patients with the Middle East Respiratory Syndrome: a multicenter retrospective cohort study Infections acquired during extracorporeal membrane oxygenation in neonates, children, and adults SMARTâĂŘCOP: A Tool for Predicting the Need for Intensive Respiratory or Vasopressor Support in CommunityâĂŘAcquired Pneumonia ECMO for severe acute respiratory distress syndrome Regression models and life-tables A prediction rule to identify low-risk patients with community-acquired pneumonia Multicenter development and validation of a novel risk nomogram for early prediction of severe 2019-novel coronavirus pneumonia A tool to early predict severe 2019-novel coronavirus pneumonia (covid-19): a multicenter study using the risk nomogram in wuhan and guangdong Clinical Features Predicting Mortality Risk in Patients With Viral Pneumonia: The MuLBSTA Score Evaluating the Yield of Medical Tests COVID-19, ECMO, and lymphopenia: a word of caution. The Lancet Respiratory Medicine Poor survival with extracorporeal membrane oxygenation in acute respiratory distress syndrome due to coronavirus disease 2019: Pooled analysis of early reports Towards an Articial Intelligence Framework for Data-Driven Prediction of Coronavirus Clinical Severity MIMIC-III, a freely accessible critical care database Handbook of COVID-19 prevention and treatment. The First A liated Hospital De ning community acquired pneumonia severity on presentation to hospital: an international derivation and validation study De ning community acquired pneumonia severity on presentation to hospital: An international derivation and validation study Prediction of severe community-acquired pneumonia: A systematic review and meta-analysis Treatment for severe ARDS from COVID-19. The Lancet Respiratory Medicine The eICU collaborative research database, a freely available multi-center database for critical care research Planning and provision of ECMO services for severe ARDS during the COVID-19 pandemic and other outbreaks of emerging infectious diseases. The Lancet Respiratory Medicine Predicting survival after ECMO for refractory cardiogenic shock: The survival after veno-arterial-ECMO (SAVE)-score Regularization paths for Cox's proportional hazards model via coordinate descent MissForestâĂŤnon-parametric missing value imputation for mixedtype data Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus-infected pneumonia in Wuhan, China World Health Organization et al. Clinical management of severe acute respiratory infection (sari) when COVID-19 disease is suspected: interim guidance hdnom: Building nomograms for penalized Cox models with highdimensional survival data. bioRxiv Clinical course and outcomes of critically ill patients with SARS-CoV-2 pneumonia in Wuhan, China: a single-centered, retrospective, observational study. The Lancet Respiratory Medicine Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan rbcs', 'wbc', 'platelets', 'hemoglobin', 'hct', 'rdw', 'mcv', 'mch', 'mchc', 'neutrophils', 'lymphocytes', 'monocytes', 'eosinophils', 'basophils', 'bun', 'temperature A slightly larger set of features relevant to COVID-19 was initially considered (including smoking, Creactive protein, discharge to nursing home), but due to unreliable extraction and high missingness, these were excluded in both our model's training and in calculation of existing risk scores Outcomes Vasopressor use was de ned by use of norepinephrine, epinephrine, phenylephrine, vasopressin, milrinone, dobutamine, or dopamine. Mechanical ventilation was de ned by the documentation of mechanical ventilator use in the treatment table (eICU) or nurse charting (MIMIC) Below are the diagnosisstring's used to select patients with viral or unspeci ed pneumonia.'infectious diseases|chest/pulmonary infections|pneumonia|ventilator-associated', 'surgery|respiratory failure|ARDS|pulmonary etiology|pneumonia', 'infectious diseases|chest/pulmonary infections|pneumonia|community-acquired|viral', 'infectious diseases|chest/pulmonary infections|pneumonia', 'infectious diseases|chest/pulmonary infections|lung abscess|secondary to pneumonia', 'pulmonary|pulmonary infections|pneumonia|community-acquired|viral|respiratory syncytial', 'pulmonary|pulmonary infections|pneumonia|community-acquired', 'surgery|infections|pneumonia|hospital acquired (not ventilator-associated)', 'infectious diseases|chest/pulmonary infections|empyema|associated with pneumonia', 'pulmonary|pulmonary infections|pneumonia|hospital acquired (not ventilator-associated)', 'pulmonary|pulmonary infections|pneumonia|hospital acquired (not ventilator-associated)', 'infectious diseases|chest/pulmonary infections|pneumonia|opportunistic', 'surgery|respiratory failure|acute lung injury|pulmonary etiology|pneumonia', 'pulmonary|respiratory failure|acute lung injury|pulmonary etiology|pneumonia', 'infectious diseases|chest/pulmonary infections|pneumonia|community-acquired', 'surgery|infections|pneumonia', 'pulmonary|pulmonary infections|pneumonia|community-acquired|viral', 'infectious diseases|chest/pulmonary infections|pneumonia|hospital acquired (not ventilator-associated)', 'pulmonary|respiratory failure|ARDS|pulmonary etiology|pneumonia', 'pulmonary|pulmonary infections|pneumonia|hospital acquired (not ventilator-associated) |viral', 'transplant|s/p bone marrow transplant|idiopathic pneumonia syndrome -bone marrow transplant', 'pulmonary|pulmonary infections|lung abscess|secondary to pneumonia', 'infectious diseases|chest/pulmonary infections|pneumonia|community-acquired|viral| respiratory syncytial', 'pulmonary|pulmonary infections|pneumonia', 'pulmonary|pulmonary infections|pneumonia|ventilator-associated', 'pulmonary|pulmonary infections|pneumonia|opportunistic ', 'surgery|infections|pneumonia|community-acquired' In addition to ltering for patients between 18-70 years old, the following code was used to exclude other contraindications based on ICD codes. All resulting descriptors were clinician-reviewed: CREATE