key: cord-0956037-8qa5jh1k authors: Lin, Ju-Kuo; Chien, Tsair-Wei; Wang, Lin-Yen; Chou, Willy title: An artificial neural network model to predict the mortality of COVID-19 patients using routine blood samples at the time of hospital admission: Development and validation study date: 2021-07-16 journal: Medicine (Baltimore) DOI: 10.1097/md.0000000000026532 sha: f609bc765bb05b904f744b53cac6a2b96ae79d63 doc_id: 956037 cord_uid: 8qa5jh1k BACKGROUND: In a pandemic situation (e.g., COVID-19), the most important issue is to select patients at risk of high mortality at an early stage and to provide appropriate treatments. However, a few studies applied the model to predict in-hospital mortality using routine blood samples at the time of hospital admission. This study aimed to develop an app, name predict the mortality of COVID-19 patients (PMCP) app, to predict the mortality of COVID-19 patients at hospital-admission time. METHODS: We downloaded patient records from 2 studies, including 361 COVID-19 patients in Wuhan, China, and 106 COVID-19 patients in 3 Korean medical institutions. A total of 30 feature variables were retrieved, consisting of 28 blood biomarkers and 2 demographic variables (i.e., age and gender) of patients. Two models, namely, artificial neural network (ANN) and convolutional neural network (CNN), were compared with each other across 2 scenarios using: 1. raw laboratory versus normalized data and 2. training vs testing datasets (n = 361 and n = 106/361≅30%) to verify the model performance (e.g., sensitivity [SENS], specificity [SPEC], and area under the receiver operating characteristic curve [AUC]). An app for predicting the mortality of COVID-19 patients was developed using the model's estimated parameters for the prediction and classification of PMCP at an earlier stage. Feature variables and prediction results were visualized using the forest plot and category probability curves shown on Google Maps. RESULTS: We observed that: 1. the normalized dataset gains a relatively higher AUC(>0.9) when compared to that(<0.9) in the raw-laboratory dataset based on training data, 2. the normalized dataset in ANN yielded a high AUC of 0.96 that that(=0.91) in CNN based on testing data, and 3. a ready and available app, where anyone can access the model to predict mortality, for PMCP was developed in this study. CONCLUSIONS: Our new PMCP app with ANN model accurately predicts the mortality probability for COVID-19 patients. It is publicly available and aims to help health care providers fight COVID-19 and improve patients’ classifications against treatment risk. COVID-19 is a highly contagious infection caused by SARS-CoV2. [1] [2] [3] As of June 10, 2021, COVID-19 cases and deaths are approaching 1.75 billion and 3.78 million, respectively, worldwide in more than 191 countries/regions. [4] 1. 1 When a new disease (e.g., starts to spread, many questions emerge. [5, 6] One of the most frequently asked questions is how high is the case fatality rate (CFR) [5] and what is the model to predict patients at risk of high mortality. [1, 7] If we ignore the classification of patients at high risk with proper treatments, the condition of patients might rapidly deteriorate. [1] Numerous studies addressed that deceased patients in the COVID-19 epidemic initially had mild symptoms and then suddenly transitioned to a critical stage, or leading to death. [8] [9] [10] For example, over 75% of deceased patients in Italy presented mild symptoms(e.g., fever, dyspnea, and cough) at admission to the hospital. [3] In the COVID-19 pandemic, the shortage of resources and medical staff causes big problems (e.g., high case fatality rate [5] ) in the health-care system. [1] As such, a prognostic model to predict mortality at an earlier stage is required for the development. Traditional solutions to classify the mortality of COVID-19 patients (PMCP) merely 1. predicted the mortality of individual patients more than 10 days to survival or death (i.e., at least 10 days from admission to hospital to death or discharge), [2] 2. compared model accuracies without an app provided to readers, [11] [12] [13] 3. applied machine learning techniques with professional software [14] [15] [16] [17] instead of deep learning on the popular and friendly-use MS Excel (Microsoft Corp.), [18] [19] [20] and 4. displayed the prediction probability and the classification (e.g., death or survival) only with a statice measure (or diagram) [1, 14] rather than a dashboard that can be practiced to interpret the prediction result on their own. Accordingly, artificial intelligence (AI), helping healthcare providers efficiently and effectively classify COVID-19 patients is critical and of importance. Furthermore, many types of prediction methods that can be used in prediction mortality on COVID-19, such as Logistic regression, Naïve Bayes, Decision trees, Random Forests, Gradient tree boosting, [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] and other artificial neural networks(e.g., a Feed forward Neural Network, a Radial Basis Function Neural Network, a Multilayer Perceptron, a Recurrent Neural Network, a Modular Neural Network, or a Sequence-To-Sequence Model). [31] None of the research applied artificial neural network (ANN) or convolutional neural network (CNN) to develop an app used for PMCP at the time of hospital admission. Many studies have extracted feature variables and developed AI prediction models of mortality. [1, 2, [14] [15] [16] [17] [18] [19] [20] Some [1, 2, 11] applied blood samples to make a prediction of mortality for COVID-19 patients. However, none provided an app (or web application) with dynamical visualizations of prediction results. In machine learning, data normalization instead of raw data has been discussed in the literature. [32] Similar data dealt with different data formats(e.g., raw-laboratory and normalized data) were studid. [1, 2] Although using normalization methods to remove various technical biases was suggested, there are no previous studies evaluating the impacts of normalization on disease diagnosis. [32] Feature variables might be seen based on either the original observed scores [2] or the corresponding normalized data [1] in machine learning. We are motivated to investigate whether the PMCP performance would be different using raw or normalized data. Therefore, in this study, we aimed to 1. develop an ANN model to predict the mortality of COVID-19 patients at hospital-admission time, and 2. design an app, name PMCP app, on a public website so that all patients and medical staff could predict mortality of COVID-19 patients on their own. We downloaded patient records from two studies, including 361 COVID-19 patients in Wuhan, China, [2] and 106 COVID-19 patients in 3 Korean medical institutions. [1] A total of 30 feature variables were retrieved, consisting of 28 blood biomarkers and 2 demographic variables (i.e., age and gender) of patients. [2] Lin et al. Medicine (2021) 100:28 Medicine All data [33] (Supplemental Content 1, http://links.lww.com/MD2/ A274) used in this study were downloaded from the deposited datasets in the previous studies, [1, 2] which means the study is not necessary for ethical approval according to the regulation promulgated by the Taiwan Ministry of Health and Welfare. We replaced the missing data (i.e., 4.9% and 31.6% in training and testing sets, [1] respectively) with the mean value of each biomarker for training and testing data sets. Next, the mean and standard deviation (SD) were computed for each variable from the training set. Normalized data (i.e., mean = 0, SD = 1) were applied to the training and testing data sets [1] using the formula (=[observed scoresmean]/standard deviation [SD]). We then added 2 more features (i.e., age, and gender) to the 28 biomarkers and trained our AI model using 30 features. [2] Forest plots [34, 35] were drawn to present these 30 features. The standard mean difference (SMD) method was utilized to compare the differences in variables alone (like t-test) using the forest plot. The Chi-Squared test was conducted to assess the heterogeneity between variables. The forest plots (confidence interval [CI] plot) were drawn to display the effect estimates and their CIs for each study. Two models, namely, artificial neural network (ANN) and convolutional neural network (CNN), were compared with each other across 2 scenarios using First, the 361-patient training and 106-patient testing datasets came from the studies, [1, 2] where the training set was used to predict the testing set. Second, the accuracies (e.g., SENS, SPEC, and AUC) in training data used to predict the testing set were verified. The data are shown in reference. [33] The artificial neural network (ANN) and convolutional neural network (CNN) were analyzed using the 2 scenarios (i.e., data types and model sets) previously mentioned. The CNN has traditionally been performed on Microsoft (MS) Excel (Microsoft Corp.), [18] [19] [20] whereas the ANN has not been performed on MS Excel in the past. As illustrated in Figure 1 , the ANN process involves data input in Layer 1 where the data are combined with 2 types of parameters and run through the sigmoid function algorithms in Layers 2 and 3. Finally, as shown on the right side and bottom of Figure 1 , the prediction model was deemed complete when the total residuals were minimized through the MS Excel function of SUMXMY2 and Solver add-in. Comparisons of balanced accuracies across 2 scenarios were made in both ANN and CNN models based on the medians in respective models. Comparison of prediction models using the Weka software. To better understand the effectiveness and efficacy of the ANN and CNN models, several machine learning algorithms in the Weka software [36] were used to compare the ANN and CNN models and evaluate the high indicators of SENS, SPEC, balanced accuracy, and AUC in 2 data types (i.e., raw and normalized data) and 2 types of the model mentioned in the previous section. Developing an App for PMCP. An app, named PMCP app, for early PMCP, was designed and developed. Model parameters were embedded in the computer module. The results of the classification (i.e., death and survival) would instantly appear on smart phones. The visual representation with binary (i.e., deceased and survival) categorical probabilities are shown on a dashboard displayed on Google Maps. [18] [19] [20] An author-made MS Excel VBA module was applied to draw the forest plot and perform the ANN and CNN algorithms. The significance level of Type I error was set at 0.05. ANN and CNN were performed on MS Excel as well. A visual representation of the classification was plotted using two curves based on the Rasch model. [37] The study flowchart and the ANN modeling process are shown in Figure 2 and Supplemental Content 1, http://links.lww.com/MD2/A275. For training data, we used the blood test results obtained from 375 COVID-19 patients collected between January 10, 2020, and February 24, 2020, in Tongji Hospital, Wuhan, China. [2] Of [1] were displayed using the forest plot in Figure 3 , where the age showing the older with a high probability of death on the right-hand side due to older, and the gender on the left-hand side, indicating the male (i.e., coded 1 vs female coded 2) with a tendency toward death due to a lower value. As such, we can easily discriminate which variables with the tendency toward death are positively (the more) to the right panel or negatively (the less) to the left part. The highest Z-score (=À14.17) is at the variable of Lymphocyte and the least (=4.22) at INT (International standard ratio). The Q-statistic is 2,060,934 with degrees of freedom = 29 (P < .001), indicating that the SMD of the 30 variables is significantly different. Readers are invited to click on the links [38] and examine the detail of the feature variables on the online forest plot. Comparison of model performance in 2 types of data formats 3.2.1. AUC equals 0.96 in ANN using normalized data. Three parts are separated in Table 1 , including (A) raw data, (B) normalized data, and (C) the models from the previous study. [1] We can see that 1. the normalized dataset gains a higher AUC(>0.9) when compared to that (<0.9) in the raw-laboratory dataset when we referred to 95% CIs of AUC in CNN and ANN (see the footnote beneath Similarly, we examine the model performances using the training data to predict the testing data in Table 2 . The CNN has a higher balanced accuracy (=0.86) than that in ANN (=0.82) based on the raw-laboratory data. In contrast, the ANN has a higher balanced accuracy (=0.80) than that in CNN (=0.73) based on the normalized data. and CNN compared to other models. Basically, all those models have equivalently equal balanced accuracies, see the median of 0.92, 0.93, 0.92 (at the bottoms in these 3 panels of Table 1 ), and the footnote of AUC 95% CIs beneath Table 1 . The reason for explaining the difference in AUC (=0.87 and 1.0) using Random forest methods is worthy of investigation in the discussion section due to similar data used in our study and the previous research. [1] The median (=0.86) in the normalized data is higher than that (=0.84) in the raw-laboratory data, but the ANN has a higher balanced accuracy (=0.82) in raw-laboratory data than that (=0.80) in normalized data if the 95% CIs were ignored. If the 95% CIs of AUC are considered in the normalized data, the ANN has 0.72 and 0.88 in the range of AUC, equal to the CNN ranged between 0.65 and 0.81, and other models, but below the EDRnet with the boundary limits between 0.92 and 0.99 reported in the previous study. [1] The interface of the PMCP app is shown on the left-hand side of Figure 4 . Readers are invited to click on the links [39] and interact with the PMCP app. Notably, all 53 model parameters are embedded in the 30-item ANN model. Once the responses after clicking on the icon of the test [39] are submitted, the app generates the result (shown on the right-hand side of Fig. 4 ) as a classification of either possible death or survival on smart phones. An example in which the patient scored with a high probability (0.96) of survival is shown on the right-hand side of Figure 4 . The curve starts from the top-left corner to the bottom-right corner. The sum of the probabilities of death and survival is 1.0. The odds ratio can be calculated using the formula p/[1Àp] (= 0.04/0.96 = 0.04), indicating that this COVID19 patient has a high probability of survival within the later treatments in hospitalization. We observed that 1. the normalized dataset gains a higher AUC(>0.9) when compared to that(<0.9) in the raw-laboratory dataset based on training data, 2. the normalized dataset in ANN yielded a high AUC of 0.96 that that(=0.91) in CNN on testing data, and 3. a ready and available PMCPapp was developed for anyone who can practice the model to predict mortality of COVID-19 patients on their own. The sudden increase in COVID-19 cases drives high pressure to healthcare services and requires a rapid, accurate, and early clinical assessment of the disease severity in the hospital. [2] The highest Z-score(=À14.17) is at Lymphocyte, the fourth at lactate dehydrogenase (LDH) LDH=12.54, and the sixth at highsensitivity C-reactive protein (hs-CRP) (=12.35), see Figure 3 . Numerous studies have shown that the disease progression of COVID-19 is not only associated with lymphocyte, [2, 40, 41] LDH, [2, 7, [42] [43] [44] [45] and hs-CRP, [2, 7, 44, [46] [47] [48] [49] but also with other blood-based biomarkers, such as neutrophil counts(at the 5th in Fig. 3) , [48, 50, 51] albumin(at the third in Fig. 3) , [50, 52, 53] and prothrombin activity (at the 7th in Fig. 3 ). [50, [54] [55] [56] Similarly, Yan et al applied the mortality prediction model with 3 blood biomarkers (i.e., LDH, lymphocyte, and hs-CRP) to predict mortality with 90% accuracy. [2] Hu et al proposed 4 variables of age, hs-CRP, lymphocyte count, and d-dimer level to PMCP with AUC = 0.881. [14] Ko et al developed an AI model, EDRnet, to PMCP using 28 blood biomarkers obtained within 24 hours after hospital admission along with 2 demographic data of age and gender (similar to the current study) with high sensitivity (100%), specificity (91%), and accuracy (92%). [1] However, Yan et al [2] predicted mortality ten days before the occurrence of survival or death instead of blood biomarkers obtained within 24 hours. Instead, the mortality prediction at the time of admission can be substantially more informative for clinicians because the critical time regarding disease progression is 10 to 14 days from the onset of symptoms. [41, 48, 57] Many studies compared model performance as we did in Tables [11] [12] [13] and performed machine learning using professional software [14] [15] [16] [17] instead of the popular and friendly-use MS Excel (Microsoft Corp.) [18] [19] [20] as we did in Figure 1 . A few [1, 14] provided an app to display the prediction probability and the classification (e.g., death or survival) rather than an animationtype dashboard to interpret the prediction result as Figure 4 shown in this study. Besides 3 features, including 1. ANN model performed in Microsoft Excel, 2. an app combined with ANN and PMCP displayed on a dashboard, and 3. the online forest plot used to present feature variables, in this study, other 2 are 1. many prediction models compared in a study using the same data to present the model accuracy, and 2. model accuracy compared between raw data and normalized data. Table 2 Model comparison using training data to predict testing data (n = 106). [58, 59] was performed on MS Excel, which has not been reported in the literature, but is easily understood by readers who are familiar with Microsoft Excel. The animation-type dashboard enable readers practice the app on their own with an easy understanding of the classification resultswe. Furthermore, the online forest plot is unique and modern to display feature variables in comparison with each other, [38] unlike others commonly using a simple table with P value to present feature variables. It is novel and easy to understand 1. the direction of a feature toward death on either the left (with the less value) or right (with the more value) side, and 2. the importance of a feature with the Z-score. 3. For instance, Yan et al [2] plotted a decision rule using 3 key features and their thresholds to PMCP. In which, LDH 365 UI À1 denotes death (similar to the right side in Fig. 3 ), Lymphocyte 14.7% represents death (similar to the right side in Fig. 3) , and hs-CRO≥41.2 mgI À1 possibly stands for death (similar to the right side in Fig. 3 ). The different types of algorithms for classification in machine learning [60, 61] were compared each other, include logistic regression, support vector machine, naïve Bayes, random forest classification, ANN, CNN, [18] [19] [20] and k-nearest neighbor, by observing their 95% CIs. In addition, whether data in machine learning should be normalized has been evident in Tables 1 and 2 , which provide an reference to the furture study related to the machine learning. A single-center usually attempts to increase model performance. [62] As expected, the model accuracies reached 90% and 0.881 for 2 studies, [2, 14] respectively, in Tongji Hospital, Wuhan, China. [2] Surprisingly, Ko et al gained a higher (not lower as expected) accurate rate (92%, see Random forest at the bottom panel in Table 2 ) using data in China to predict the testing data in 3 Korean medical institutions. [1] Two questions were thus raised: why results of Random forest are different from 2 studies using similar data 1. in training datasets(e.g., 1 vs 0.89 in Table 1 ), and 2. in testing datasets (e.g., 0.90 vs 0.92 in Table 2 ) to externally validate the final predictive model. We doubt the inconsistent result (0.92 > 0.87 in testing and training data, respectively [1] ) that multicenter models reasonably exhibit poor performance, [62] which should be cautious and discussed in the future. The ANN exhibited better accuracy than CNN in normalized data ( Table 2) . We have not seen others using the ANN approach to predict PMCP in the literature, which is a breakthrough in this study. We have also not noticed any article incorporating indicators of raw and normalized data to compare model performance, which is also required for verifications in the future. More than 2114 articles searched using the keyword "artificial neural network" [Title] have been found in PubMed Central on December 26, 2020. [63] None of the articles used MS Excel to perform the ANN. The interpretations of the ANN concept and process, as well as the parameter estimations, are shown in Figure 1 , Supplemental Content 1, http://links.lww.com/MD2/ A275, and the app. [39] Readers can adjust the input values in the ANN model on their own and examine the differences in results. In addition to the performance of the ANN model (i.e., Balanced accuracy = 0.96 and 0.80 with normalized raw data in training and testing data, respectively), we considered its generalizability in the future. As did in the previous study, [1] researchers are encouraged to apply the original training data [33] to PMCP against the testing data of their own for COVD-19 patients in the future. The categorical probability curves are shown in Figure 4 . The binary categories (e.g., success and failure of an assessment in the psychometric field) have been frequently applied in health-related outcomes. [18] [19] [20] [64] [65] [66] However, we are the first to provide the categorical probability curves of the PMCP dashboard displayed on Google Maps (Fig. 4) . Although our model was designed to be specific to COVID-19 patients, it does not work for patients of pregnant and breastfeeding women, patients younger than 18 years, recordings with data material less than 80% complete, [2] and patients without a blood test within 1 day after the hospital admission according to the definition in the previous study. [1] Second, we have not discussed possible further improvement in predictive accuracy. Whether other feature variables (e.g., variables not shown in Fig. 3 ) should be applied to the ANN model to increase the accuracy rate is worth discussing. In the future, it would be useful to look for other variables that can improve the power of the PMCP model, such as underlying comorbidities. Third, the study was performed using the ANN model. Whether other prediction models not illustrated in Table 1 have higher accuracy than the ANN model has yet to be investigated. Fourth, the study patients in training data were taken from Tongji Hospital, Wuhan, China. [2] The model parameters estimated for PMCP are only suitable for the Chinese COVID-19 patients because geolocation is associated with socioeconomic status, which has been reported to be linked to the difference. [4] Similarly, the testing data were from 3 Korean medical institutions. [1] The results in Table 2 cannot be generalized to other disparate regions. Thus, a generalization of these PMCP findings (e.g., the model parameters) needs to be cautiously done because the sample only included patients with COVID-19 aged ≥18 years in China. Additional studies in other countries are required in the future to re-examine the feature variables that are similar to those used in this study (Supplemental Content 1: Abstract video of this study at https://youtu.be/rBVMehC7TgI). In this study, 1. ANN is performed on MS Excel, 2. an online app is built to display the results using the visual dashboard on Google Maps, and 3. the categorical probability curves based on the Rasch model are combined with the ANN prediction model. The novelty of the app with our ANN algorithm improves the accuracy of predicting PMCP up to AUC = 0.96 in the normalized training data. The integration of this app into the hospital information system is encouraged to the hospital, not limited to Korea as we did in this study. An artificial intelligence model to predict the mortality of COVID-19 patients at hospital admission time using routine blood samples: development and validation of an ensemble model An interpretable mortality prediction model for COVID-19 patients COVID-19 Surveillance Group. Characteristics of COVID-19 Patients Dying in Italy: Report Based on Available Data on COVID-19 Dashboard The computation of case fatality rate for novel coronavirus (COVID-19) based on Bayes theorem: An observational study Geographic risk assessment of COVID-19 transmission using recent data: An observational study Elevated lactate dehydrogenase (LDH) level as an independent risk factor for the severity and mortality of COVID-19 Early coagulation tests predict risk stratification and prognosis of COVID-19 Asymptomatic carrier state, acute respiratory disease, and pneumonia due to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2): Facts and myths World Health Organization declares global emergency: a review of the Machine learning to predict mortality and critical events in a cohort of patients with COVID-19 in New York City: model development and validation Physiological and socioeconomic characteristics predict COVID-19 mortality and resource utilization in Brazil Clinical features of COVID-19 mortality: development and validation of a clinical prediction model Early prediction of mortality risk among patients with severe COVID-19, using machine learning Machine learning for mortality analysis in patients with COVID-19 Medicine (2021) 100:28 www.md-journal Living risk prediction algorithm (QCOVID) for risk of hospital admission and mortality from coronavirus 19 in adults: national derivation and validation cohort study Qualitative and quantitative chest CT parameters as predictors of specific mortality in COVID-19 patients An app for classifying personal mental illness at workplace using fit statistics and convolutional neural networks: survey-based quantitative study An app developed for detecting nurse burnouts using the convolutional neural networks in microsoft excel: population-based questionnaire study An app for detecting bullying of nurses using convolutional neural networks and web-based computerized adaptive testing: development and usability study Generalized linear models The regression analysis of binary sequences Explaining odds ratios An essay towards solving a problem in the doctrine of chances. By the late Rev On the optimality of the simple bayesian classifier under zero-one loss Induction of decision trees Random Forests. Encyclopedia of Machine Learning and Data Mining Random decision forests Stochastic gradient boosting The Alternating Decision Tree Algorithm A Comprehensive Guide to Types of Neural Networks How does normalization impact RNA-seq disease diagnosis? Code, data, and results for this article Facilitating meta-analyses by deriving relative effect and precision estimates for alternative comparisons from a set of estimates presented by exposure level or disease category Significant effects of late evening snack on liver functions in patients with liver cirrhosis: A metaanalysis of randomized controlled trials The WEKA Data Mining Software: An Update Probabilistic Models for Some Intelligence and Attainment Tests Using the forest plot to interpret the feature variables PMCPapp used for input data Lymphopenia in severe coronavirus disease-2019 (COVID-19): systematic review and meta-analysis Clinical predictors of mortality due to COVID-19 based on an analysis of data of 150 patients from Wuhan, China Clinical evaluation of potential usefulness of serum lactate dehydrogenase (LDH) in 2019 novel coronavirus (COVID-19) pneumonia Hematologic, biochemical and immune biomarker abnormalities associated with severe illness and mortality in coronavirus disease 2019 (COVID-19): A meta-analysis Lactate dehydrogenase levels predict coronavirus disease 2019 (COVID-19) severity and mortality: a pooled analysis Lactate dehydrogenase: An old enzyme reborn as a COVID-19 marker (and not only) C-reactive protein levels in the early stage of COVID-19 Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus-infected pneumonia in Wuhan, China China Medical Treatment Expert Group for Covid-19. Clinical characteristics of coronavirus disease 2019 in China C-reactive protein correlates with computed tomographic findings and predicts severe COVID-19 early D-dimer and prothrombin time are the significant indicators of severe COVID-19 and poor prognosis Higher level of neutrophil-tolymphocyte is associated with severe COVID-19 The association of low serum albumin level with severe COVID-19: A systematic review and metaanalysis Elevated interleukin-6 and severe COVID-19: a meta-analysis COVID-19 coagulopathy: an evolving story Abnormal coagulation parameters are associated with poor prognosis in patients with novel coronavirus pneumonia Lupus anticoagulant and abnormal coagulation tests in patients with COVID-19 Epidemiologic and clinical characteristics of 91 hospitalized patients with COVID-19 in Zhejiang, China: a retrospective, multi-centre case series A deep artificial neural network-based model for prediction of underlying cause of death from death certificates: algorithm development and validation Artificial neural network (ANN) Machine learning utilising spectral derivative data improves cellular health classification through hyperspectral infrared spectroscopy Machine learning method using position-specific mutation based classification outperforms one hot coding for disease severity prediction in haemophilia 'A' Prediction of pneumonia 30-day readmissions: a single-center attempt to increase model performance Articles with title in artificial neural network in Pubmed Improving inpatient surveys: web-based computer adaptive testing accessed via mobile phone QR codes Item Response Theory: Applications to Psychological Measurement Predictors and outcomes of unplanned readmission to a different hospital Medicine (2021) 100:28 www.md-journal We thank Enago (www.enago.tw) for the English language review of this manuscript.