key: cord-0942290-biy7s4wg authors: Li, Jiaxia; Wan, Li; Feng, Yuan; Zuo, Huilin; Zhao, Qian; Ren, Jiecheng; Zhang, Xiaochu; Xia, Mingwu title: Laboratory Predictors of COVID-19 Pneumonia in Patients with Mild to Moderate Symptoms date: 2021-06-17 journal: Lab Med DOI: 10.1093/labmed/lmab015 sha: a0e6ef11aed767818a8f626c9ca5d3ce75c8f1ca doc_id: 942290 cord_uid: biy7s4wg OBJECTIVE: This research aims to develop a laboratory model that can accurately distinguish pneumonia from nonpneumonia in patients with COVID-19 and to identify potential protective factors against lung infection. METHODS: We recruited 50 patients diagnosed with COVID-19 infection with or without pneumonia. We selected candidate predictors through group comparison and punitive least absolute shrinkage and selection operator (LASSO) analysis. A stepwise logistic regression model was used to distinguish patients with and without pneumonia. Finally, we used a decision-tree method and randomly selected 50% of the patients 1000 times from the same specimen to verify the effectiveness of the model. RESULTS: We found that the percentage of eosinophils, a high–fluorescence-reticulocyte ratio, and creatinine had better discriminatory power than other factors. Age and underlying diseases were not significant for discrimination. The model correctly discriminated 77.1% of patients. In the final validation step, we observed that the model had an overall predictive rate of 81.3%. CONCLUSION: We developed a laboratory model for COVID-19 pneumonia in patients with mild to moderate symptoms. In the clinical setting, the model will be able to predict and differentiate pneumonia vs nonpneumonia before any lung computed tomography findings. In addition, the percentage of eosinophils, a high–fluorescence-reticulocyte ratio, and creatinine were considered protective factors against lung infection in patients without pneumonia. Since the outbreak of COVID-19 in Wuhan, China, in December 2019, the COVID-19 epidemic has developed rapidly. 1 By April 2020, the epidemic had affected most countries and regions in the world. 2 The disease has caused serious global health and social problems. Patients with COVID-19 with severe symptoms usually die of pneumonia within a short period of time after infection, whereas a small proportion of patients die of other causes. 3 Mild acute respiratory infection symptoms, such as fever, dry cough, and fatigue, usually occur in the early stages of COVID-19, 4 but those who develop acute respiratory distress syndrome, acute respiratory failure, multiple organ failure, and other fatal complications die rapidly. 5 Generally, patients infected with COVID-19 without pneumonia recover, and asymptomatic infection is not life-threatening. 6 However, a specific treatment method for COVID-19 has not been fully developed. 7 Because of decreased immunity and underlying diseases, 8 the symptoms and mortality associated with COVID-19 in older adults are more serious. 9 Older adult patients are more susceptible to viral infections and death 10 and have more underlying diseases, such as hypertension, hyperlipidemia, diabetes, and rheumatoid arthritis. [10] [11] [12] However, it is not clear whether age and underlying disease can predict pneumonia. Current research on COVID-19 has focused on the epidemiology and clinical characteristics of patients, but information on the susceptibility to pneumonia has not been clear. Pneumonia clearly plays a vital role in the prognosis of COVID-19. Therefore, we were committed to finding a way to identify whether a patient was susceptible to pneumonia before a chest computed tomography (CT) scan or before symptoms of pneumonia appear. Quickly identifying such patients will help prevent more serious cases of infection. It is a way to fight the death caused by COVID-19 infection. This research aimed to develop a model that can accurately distinguish patients with pneumonia from those without pneumonia in patients with COVID-19 and determine the factors that are significant in fighting infections in the lungs. The study investigated a group of patients at Hefei Second People's Hospital in China. Herein, we report our epidemiological, clinical, radiological, and laboratory examination results. We enrolled 50 patients who were diagnosed with COVID-19 at Hefei Second People's Hospital and Affiliated Hospital of Anhui Medical University. The inclusion criteria were as follows: patients with a confirmed diagnosis of COVID-19, patients with mild to moderate symptoms, men or women, patients with or without underlying diseases, and patients without any signs of death, including any symptoms of acute respiratory distress and/or failure of any organ. Patients with mild to moderate symptoms were defined by fever, fatigue, and smell and taste disorders, with or without respiratory symptoms (ie, coughing, sputum, and lung CT or X-ray showing pneumonia). They were managed in the hospital from January 2020 to April 2020, and as the final outcome all patients were discharged from the hospital. Patients were defined as having COVID-19 infection if they met any of the following criteria: (i) respiratory tract or blood specimens that were positive for SARS-CoV-2 per nucleic acid test using real-time fluorescent reverse-transcription polymerase chain reaction (RT-PCR) and/or (ii) through the SARS-CoV-2 gene sequencing method, a new virus found in the respiratory tract or blood specimens that was highly homologous to COVID-19. These data were used to construct a predictive model of pneumonia. The research was approved by the Institutional Review Board of Anhui Medical University and complied with the Declaration of Helsinki. Patients were told verbally that their data would be used for medical research anonymously. After obtaining their permission, we collected written informed consent. Previous medical history, age, and symptoms (fever, fatigue, smell and taste disorders, and respiratory symptoms) were recorded daily by resident physicians and nurses. The laboratory data after hospital admission were collected. Routine blood tests were performed multiple times over the course of the disease, and they were a part of the patient's standard care and testing. The blood test used 2 mL of blood from the cubital vein of the patient, and the blood was stored in an EDTA-dipotassium anticoagulation tube. Routine blood tests were completed using an automatic blood analyzer (Hitachi 7600 automatic analyser, Japan) and electrical impedance methods. The items obtained included the number of red blood cells, hemoglobin, white blood cells, platelet counts, absolute lymphocyte values, absolute intermediate cells, absolute neutrophils, lymphocyte percentage, intermediate cell percentage, percentage of neutral granulocytes, hematocrit, average red blood cell volume, average red blood cell hemoglobin, average hemoglobin concentration, red blood cell distribution width, average platelet volume, platelet distribution width, and platelet hematology. For blood biochemistry indexes, 4 mL of venous blood was drawn, and the supernatant was removed and put into an automatic biochemical instrument (Hitachi, Japan) for analysis. Enzyme assay was used to obtain the reaction rate data for enzyme kinetic analysis, which was used to test the myocardial enzyme spectrum and adenosine dehydrogenase. Total bile acid was detected using an enzymatic cycle method. γ-glutamyltransferase was measured using the gamma-glutamyl-p-nitroanilide (GPNA) substrate method. Carbon dioxide was measured using a phosphoenolpyruvate carboxylase (PEPC) enzymatic method. Uric acid was measured using the oxidase method. Apolipoprotein A1 and apolipoprotein B were measured using the immunoturbidimetric method. Inorganic phosphorus was measured using the phosphomolybdate method. High-density lipoprotein (HDL) cholesterol and C-reactive protein were measured using the immunoturbidimetric method. Magnesium was measured using the xylene blue method. Creatinine was measured using the sarcosine oxidase method. Urea was measured using the ureaseglutamate dehydrogenase method. Triglycerides were measured using the glycerol phosphate oxidase (GPO-PAP) method. Cystatin C was measured using a latex-enhanced immunoturbidimetric method. Total bile acid was measured using an enzyme cycle method. Total bilirubin was determined by using the vanadate oxidation method. Albumin was determined by using the bromocresol green method. Total protein was determined by using the biuret method. Hypersensitive C-reactive protein was determined by using immunofluorescence chromatography. Using the Siemens Somatom definition 64-row spiral CT scan, with patients in the supine position and the head at an incline, the researchers told patients to hold their breath during the scan. The scanning range was from the top to the bottom of both lungs and the cross-sectional area. The scanning parameters were as follows: tube voltage 120 kV, tube current 320 mA, matrix 512 × 512, layer thickness and layer moment 5 mm. 13 For data analyses, we selected the examination data taken at the most severe time in the course of the disease (that is, when the patient's self-reported symptoms, including fever, cough, chest pain, and/or muscle weakness, were the most severe). The baseline demographic and clinical characteristics of all participants at the time of enrollment are presented as continuous variables and categorical variables ( Table 1 ). The CT scan results divided all patients into 2 groups: patients with pneumonia and patients without pneumonia. The χ 2 test, 1-way analysis of variance, and Kruskal-Wallis tests were used to analyze the differences in these variables between the 2 groups. We used SPSS version 24 was used for statistical analyses, and G-power was used to determine whether each step in the statistics reached sufficient power. In the initial factor selection step, we selected potential predictors by comparing the 2 groups, and the factors showing group differences were considered candidate predictors. In the second factor selection step, regularized regression with least absolute shrinkage and selection operator (LASSO) variable selection was used. 14 The LASSO penalization selected important predictors by shrinking the coefficients of weaker predictors to zero and excluded predictors with estimated zero coefficients from the final sparse prediction model. To avoid model overfitting in the training samples, the variable selection used 10-fold cross-validation 15 to select the best adjustment or penalty level, which was measured by the Bayesian information criterion. 16 In the model development step, a stepwise logistic regression generalized an estimating equation, with the factors selected above entered as independent variables and the CT scan results entered as dependent variables. In the final validation step, a decision tree with the growth method of Chi-square automatic interaction detector (CHAID) and split sample tests were conducted, and random allocation of the whole sample (including all patients) was 50% for training and 50% for test sampling for 1000 times. The logistic model derived from the above was applied in the 50% randomly selected validation sample to calculate the predicted probabilities for each patient. All patients were actively treated after admission, with daily droplet isolation, contact isolation, and routine care for Class A infectious diseases. The daily medication regimen of patients included lopinavir/ritonavir tablets 1000 mg (500 mg tablet × 2), 2 mL nebulized saline by inhalation, injection of 5 million units of recombinant human interferon alpha 2b, Chinese medicine decoction, vitamin C tablet 0.2 g, arbidol tablet 0.2 g, thymosin enteric-coated tablet 15 mg, and chloroquine hydrogen sulfate tablet 300 mg. Patient symptoms all improved after 4 to 34 days of treatment, and all patients were discharged home from the hospital. The oropharyngeal swab nucleic acid test was assessed by the Centers for Disease Control and Prevention in Yaohai District, Hefei City. By using a fluorescent PCR method, a QIAamp Viral RNA Mini Kit (Qiagen, Hilden, Germany) was used to extract RNA from the specimens from each patient into 50 µL of eluate. Forward (5′-CCT ACTAAATTAAATGATCTCTGCTTTACT-3′) and reverse (5′-CAAGCTATAACGCAGCCTGTA-3′) primers targeted the S gene of COVID- 19 Routine urine analysis was completed using urine dip strips and a dry chemical method, along with a urine analyzer and colorimetry. The main parameters evaluated in the urine included urine color, urine pH, urine specific gravity, qualitative protein, and microscopic examination. Routine fecal examination was the direct microscopic examination of slides under a microscope. The main parameters analyzed in feces included stool color and hardness, mucus, and a microscopic examination of the stool for helminthiasis (hookworm eggs, roundworm eggs, and whipworm eggs) and other parasites. Among 50 patients with COVID-19 with mild to moderate symptoms, 26 had positive signs on lung CT leading to a COVID-19 pneumonia diagnosis, and 24 were diagnosed with COVID-19 infection without pneumonia. Twenty-three patients had underlying diseases before they entered the hospital, including hypertension (n = 10), hyperlipidemia (10), diabetes (5), rheumatoid arthritis (1), chronic bronchitis with emphysema (2), cervical spondylosis (1), femoral head necrosis (1), cerebral infarction (1), chronic hypothyroidism (1), fatty liver (1), and chronic superficial gastritis (2); in addition, 1 patient was pregnant and another was breastfeeding. The demographic characteristics of the two groups are shown in Table 1 . Patients with pneumonia (ages 50.42 ± 14.18 years) were older than the patients without pneumonia (ages 40.25 ± 18.32 years) (P < 0.05). Otherwise, the 2 groups were similar regarding the time of hospital stay, sex composition, ratio of underlying disease, and ratio of family cluster outbreaks. In the initial factor selection step, we found that patients without pneumonia had a greater number of lymphocytes e107 and eosinophils, a higher percentage of eosinophils, a greater number of basophils, a greater absolute value of reticulocytes, a higher percentage of reticulocytes, and a higher ratio of high-fluorescent reticulocytes than patients with pneumonia, P <.05 ( Table 2) . G-power analyses determined that the effect size of these factors between the 2 groups ranged from 0.63 to 1.28, and the power of each comparison ranged from 0.49 to 0.97. Patients without pneumonia also had higher levels of calcium, lower creatinine, a higher urea/creatinine ratio, lower globulin, a higher albumin/globulin ratio, lower creatine kinase, lower lactate dehydrogenase, a higher HDL/cholesterol ratio, higher pre-albumin, higher apolipoprotein A1, and lower serum amyloid A (SAA) levels in blood than patients with pneumonia (P <.05; Table 2 ). G-power analyses determined that the effect size of these factors between the 2 groups ranged from 0.75 to 0.93, and the power of each comparison ranged from 0.60 to 0.75. In the second factor selection step, we observed that the percentage of eosinophils, the absolute value of reticulocytes, the highfluorescence-reticulocyte ratio, creatinine, the albumin/ globulin ratio, and lactate dehydrogenase survived the LASSO penalty and were selected as potential predictive factors (see Tables 3 and 4) . With a backward stepwise logistic regression model (see Table 5 ), age, the percentage of eosinophils, the absolute value of reticulocytes, the high-fluorescence-reticulocyte ratio, creatinine, the albumin/globulin ratio, and lactate dehydrogenase were entered as independent variables, and the pneumonia group was entered as the dependent variable. We found that the percentage of eosinophils, the highfluorescence-reticulocyte ratio, and creatinine had better discriminatory power than the other factors. The predictive rate of the model was 77.1%, with χ 2 = 35.25, Cox-Snell R 2 = .52, and P <.05. In the final validation step, a decision tree with the growth method of CHAID and split sample (ie, all patients) tests were conducted. The logistic model derived from above was applied in the 50% randomly selected validation sample to calculate predicted probabilities for each patient for 1000 times. We observed that the validation samples had an overall predictive rate of 81.3%. The study included 50 patients with mild to moderate symptoms of COVID-19. We have established a laboratory model that can predict pneumonia through readily available laboratory measures. The model showed good discrimination, and external verification was satisfactory. This is the first study to determine the important factors that may fight COVID-19 pneumonia or potential protective factors in patients with mild to moderate symptoms. Generally, age and underlying disease are regarded as risk factors for COVID-19 pneumonia. Older adults are generally at higher risk of chronic diseases and are more susceptible to infection. 17 Age is a risk factor for poor prognosis in patients with COVID-19, partly because age-related immune dysfunction is caused by low-grade chronic inflammation. 18 In addition, older adult patients may also have other risk factors, such as comorbidities and sarcopenia. 19, 20 For example, a history of hypertension is an important risk indicator of the MuLBSTA score, a viral pneumonia death warning model. 21 Hypertension has been found to be a predictor of death in patients with COVID-19. 18 However, in this study, neither age nor underlying disease was an important factor in distinguishing the pneumonia group. Although the age of patients with pneumonia was relatively higher than for those without pneumonia, this finding may have been because our patients had mild to moderate symptoms rather than severe symptoms, different from previous studies. With the predictive model, we found that the percentage of eosinophils, the high-fluorescence-reticulocyte ratio, and creatinine in the blood were good predictors or discriminators for patients with and without pneumonia. Patients without pneumonia had a higher percentage of eosinophils, a greater high-fluorescence-reticulocyte ratio, and lower creatinine levels than patients with pneumonia. Eosinophils are produced by bone marrow stem cells and account for 1% to 5% of the total number of white blood cells in the blood. 22 Eosinophils play an important role in adaptive immune function, specifically resisting viruses. 23 As immune modulators, eosinophils are not only associated with the effector arm of adaptive immunity but also trigger a polarized adaptive response process. 24 In diseases caused by respiratory viruses, the blood and immune organs are severely damaged 25 and viruses directly inhibit the proliferation of bone marrow cells. Some clinical studies have found that viral infections cause a decrease in the percentage of eosinophils in routine blood tests. 26 The relatively higher percentage of eosinophils in our patients without pneumonia may indicate better adaptive immunity and a better polarized adaptive response process, which helped these patients avoid lung infection. Studies have shown that a variety of viral infections cause bone marrow hematopoietic arrest and inhibit bone marrow cell proliferation. [27] [28] [29] After the virus invades, it binds to red blood cell membrane proteins and damages red blood cell production, resulting in a decrease in the number of red blood cell lines and reticulocytes in the blood. 30 The number of reticulocytes has been used to clinically judge the severity of viral infections and the hematopoietic function of bone marrow. 31 Viral infection in this study was associated with decreased reticulocytes, and the relatively higher level of the high-fluorescence-reticulocyte ratio indicated better bone marrow cell proliferation, which was also a factor that helped patients without pneumonia avoid lung infection. The abnormal levels of urea, creatinine, and the urea/creatinine ratio were clinically indicative of impaired renal function. 32, 33 In addition to damaging the respiratory system, COVID-19 has also been found to harm the kidneys and liver. 34 One study showed that approximately 3% to 10% of patients with COVID-19 had abnormal renal function, including a significant increase in creatinine and/or blood urea nitrogen. 35 In this study, patients without pneumonia had a lower level of creatinine, indicating a better kidney function. The laboratory model showed good discriminatory power with a predictive rate of 77.1%, a sensitivity of 100.00%, and a negative predictive value of 100.00%. The validation samples (ie, patients) had an overall predictive rate of 81.3%. Among the patients with mild to moderate symptoms of COVID-19, age and underlying diseases were not significant in the discrimination between the pneumonia and nonpneumonia groups. The percentage of eosinophils, the high-fluorescence-reticulocyte ratio, and creatinine in the blood were good discriminators between the 2 groups. Patients without pneumonia had a higher percentage of eosinophils, a greater high-fluorescence-reticulocyte ratio, and lower creatinine levels than patients with pneumonia. The relatively higher percentage of eosinophils may indicate better adaptive immunity and a better polarized adaptive response process. The relatively higher level of the high-fluorescence-reticulocyte ratio indicated better bone marrow cell proliferation, and the lower level of creatinine may have indicated better kidney function. These factors may be protective factors for patients without pneumonia against lung infection. LM A koronavírus okozta COVID-19-pandémia. Korábbi tapasztalatok és tudományos evidenciák 2020. március végén Mathematical modeling of the spread of the coronavirus disease 2019 (COVID-19) taking into account the undetected infections. The case of China Case-fatality rate and characteristics of patients dying in relation to COVID-19 in Italy Cardiac involvement in a patient with coronavirus disease 2019 (COVID-19) Risk factors associated with acute respiratory distress syndrome and death in patients with coronavirus disease 2019 pneumonia in Wuhan, China Auxiliary role of mesenchymal stem cells as regenerative medicine soldiers to attenuate inflammatory processes of severe acute respiratory infections caused by COVID-19 Research and development on therapeutic agents and vaccines for COVID-19 and related human coronavirus diseases Systems biological assessment of immunity to mild versus severe COVID-19 infection in humans Age-related morbidity and mortality among patients with COVID-19 The epidemiology and pathogenesis of coronavirus disease (COVID-19) outbreak Clinical features of patients infected with 2019 novel coronavirus in Wuhan Attempted clinico-topographic correlation in aphasias caused by non-thalamic subcortical lesions The clinical and chest CT features associated with severe and critical COVID-19 pneumonia Regression shrinkage and selection via the lasso The Elements of Statistical Learning Bayes factors Prevalence of comorbidities and its effects in patients infected with SARS-CoV-2: a systematic review and meta-analysis Clinical and laboratory predictors of inhospital mortality in patients with COVID-19: a cohort study in Wuhan, China Prevalence and comorbidity of insomnia and effect on functioning in elderly populations Frequency of sarcopenia and associated factors among hospitalized elderly patients Clinical features predicting mortality risk in patients with viral pneumonia: the MuLBSTA score Eosinophils in innate immunity: an evolving story The eosinophil in infection Viral infections: beneficial role of eosinophils The role of peripheral blood eosinophil counts in COVID-19 patients HIV-1 infection of bone marrow hematopoietic progenitor cells and their role in trafficking and viral dissemination Infant hypervitaminosis A causes severe anemia and thrombocytopenia: evidence of a retinol-dependent bone marrow cell growth inhibition Mechanisms of dengue virus-induced bone marrow suppression A case report of cytomegalovirus infection in a patient with pure red cell apiasis A summary of the diagnostic and prognostic value of hemocytometry markers in COVID-19 patients The elevated serum urea:creatinine ratio in canine babesiosis in South Africa is not of renal origin Correlation between selected prognostic factors and postoperative course in liver transplant recipients Interleukin-6 use in COVID-19 pneumonia related macrophage activation syndrome Relation between chest CT findings and clinical conditions of coronavirus disease (COVID-19) pneumonia: a multicenter study We thank Anhui Medical University for giving ethical approval. We are also grateful for the laboratory personnel working in all targeted laboratories for giving permission and participation in the data collection activities. The original data for this study are available from the corresponding authors upon request. The study was ethically approved by the Institutional Review Committee of Anhui Medical University, and official permission was obtained from all participating health facilities. Data from each facility were presented anonymously using code. All discordant results were also communicated to the respective laboratories. The study was supported by the fund of Anhui Province Key Research and Development Plan Project (1804h08020233). The authors report no biomedical financial interests or potential conflicts of interest.