key: cord-0910954-1izmo6v0 authors: Spinato, Giacomo; Fabbris, Cristoforo; Conte, Federica; Menegaldo, Anna; Franz, Leonardo; Gaudioso, Piergiorgio; Cinetto, Francesco; Agostini, Carlo; Costantini, Giulio; Boscolo‐Rizzo, Paolo title: COVID‐Q: Validation of the first COVID‐19 questionnaire based on patient‐rated symptom gravity date: 2021-09-22 journal: Int J Clin Pract DOI: 10.1111/ijcp.14829 sha: ca68a549d48fa762e1b17c76320bd710b086d93f doc_id: 910954 cord_uid: 1izmo6v0 OBJECTIVES: The aim of the present study was to develop and validate the CoronaVirus‐Disease‐2019 (COVID‐19) Questionnaire (COVID‐Q), a novel symptom questionnaire specific for COVID‐19 patients, to provide a comprehensive evaluation that may be helpful for physicians, and evaluate the questionnaire's performance in identifying subjects at higher risk of testing positive. MATERIALS AND METHODS: Consecutive non‐hospitalised adults who underwent nasopharyngeal‐throat swab for severe‐acute‐respiratory‐syndrome‐coronavirus‐2 (SARS‐CoV‐2) detection at Treviso Hospital in March 2020, were enrolled. Subjects were divided into positive (cases) and negative (controls). All subjects answered the COVID‐Q. Patients not able to answer COVID‐Q because of clinical conditions were excluded. Parallel Analysis and Principal Component Analysis identified items measuring the same dimension. The Item Response Theory (IRT)‐based analyses evaluated the functioning of item categories, the presence of clusters of local dependence among items, item fit within the model and model fit to the data. RESULTS: Answers obtained from 230 cases (113 males; mean age 55 years, range 20‐99) and 230 controls (61 males; mean age 46 years, range 21‐89) were analysed. Six components were extracted with parallel analysis: asthenia, influenza‐like symptoms, ear and nose symptoms, breathing issues, throat symptoms, and anosmia/ageusia. The final IRT models retained 27 items as significant for symptom assessment. The total questionnaire's score was significantly associated with positivity to the molecular test: subjects with multiple symptoms were more likely to be affected (P < .001). Older age, male gender presence of breathing issues and anosmia/ageusia were significantly related to positivity (P < .001). Comorbidities had not a significant association with the COVID‐19 diagnosis. CONCLUSION: COVID‐Q could be validated since the evaluated aspects were overall significantly related to infection. The application of the questionnaire to clinical practice may help to identify subjects who are likely to be affected by COVID‐19 and address them to a nasopharyngeal swab in order to achieve an early diagnosis. COVID-19 is an emerging infectious disease because of SARS-CoV-2, whose worldwide outbreak led, during late 2019 and during 2020, to the current pandemic condition. [1] [2] [3] The clinical presentation of COVID-19 ranges from the complete absence of symptoms to a severe acute respiratory distress syndrome, potentially leading to death. 1, 4, 5 In symptomatic patients, the most common presentation includes fever, myalgia and fatigue, associated with upper and lower respiratory tract involvement, resulting in nasal congestion, sore throat, anosmia, ageusia, dry cough and dyspnoea. 4 In addition to these respiratory symptoms, gastrointestinal, 6 cardiovascular 7 and neurological 8, 9 manifestations have also been described. Among the upper airways symptoms, the new onset of an altered sense of smell or taste has been regarded with particular interest as a clinical hallmark of early SARS-CoV-2 infection, potentially able to discriminate subjects with COVID-19 from those with other types of acute respiratory tract infections. [10] [11] [12] [13] [14] [15] [16] [17] Given the critical importance of a comprehensive assessment of the severity of COVID-19 clinical presentation, risk factors and comorbidities useful to characterise patients who will develop a severe COVID-19 have been discussed. 18 On the other hand, currently there is a lack of validated and standardised means to evaluate symptoms. Clinical evaluation models based on patient-reported outcomes (PROs) have been regarded with increasing interest, in particular in the field of pneumology, to evaluate conditions such as asthma, pneumonia, and chronic obstructive pulmonary disease (COPD). [19] [20] [21] [22] To date, only a PRO questionnaire, with adequate content validity and psychometric properties, has been developed and validated to measure the severity, functional impact and response to therapy of acute respiratory tract infections in adults. 23 Such reporting tool, the Acute Respiratory Tract Infections Questionnaire (ARTIQ), 23 resulted to be effective in discriminating between participants with and without acute respiratory tract infections. To the best of our knowledge, no standardised tool has been proposed yet for a uniform and comprehensive evaluation and reporting of symptoms in COVID-19 patients. The present study aimed to develop and validate the COVID-Q, a novel questionnaire specific for COVID-19 patients in order to provide a comprehensive and standard clinical evaluation. A secondary goal of the present study was to evaluate the performance of the questionnaire in identifying subjects at higher risk of being tested positive for COVID-19. This study was approved by the Ethical Committees of the Academic Hospital of Treviso (Italy) and the Hospital of Belluno (Italy), and it was conducted in accordance with the principles of the Helsinki Declaration. Data were examined in compliance with Italian privacy and sensitive data laws. All participants gave their consent to be enrolled in this investigation and to the treatment of their personal data for scientific purposes. The study was conducted in the Hospital of Treviso. All patients both in the case and the control group answered the COVID-Q. The COVID-Q included items focused on general, ENT, respiratory and gastrointestinal symptoms, as well as psychological symptoms, drug assumption and changes in daily activity. Such items in part belonged to the ARTIQ 23 and the SNOT-22 24 questionnaires, and the remaining were developed ex novo by us to better characterise symptoms and clinical conditions specific to COVID-19, based on the data available from the Atlanta CDC guidelines (https://www. cdc.gov/coron aviru s/2019-ncov/sympt oms-testi ng/sympt oms. html). All items based on the ARTIQ were scored on a three-point scale (0 = none, 1 = a little, 2 = a lot) in line with the ARTIQ itself, while the questions focused on anosmia and ageusia were scored on a scale from 0 (none) to 5 (complete loss of smell or taste), is derived from the SNOT-22. An additional section of the questionnaire focused on demographic data, including age, sex, smoking status (0 = never, 1 = former smoker, 2 = current smoker) and comorbidities (0 = absent, 1 = present). All subjects in both groups answered the questionnaire via a telephone interview, which took place within 19 days from the first diagnosis in cases and within 43 days from the last negative test in controls. Consecutive subjects who underwent nasopharyngeal and throat swabs for SARS-CoV-2 detection were included in the study. People were tested in the hospital of Treviso from 4 to 24 March 2020. This paper shows the COVID-Q, a novel symptom questionnaire specific for COVID-19 patients. The aim is to provide a comprehensive evaluation that may be helpful to clinicians in order to suspect SARS-CoV-2 infection or not. Tests of SARS-CoV-2 RNA by polymerase chain reaction on nasopharyngeal and throat swabs were performed according to the World Health Organization recommendation (https://www.who.int/ emerg encie s/disea ses/novel -coron aviru s-2019/techn ical-guida ncepubli catio ns?publi catio ntype s=01bc7 99c-b461-4a52-8c7d-294c8 4cd7b2d). Reasons for being tested were symptoms, close contact with affected individuals or working in public health services. Non-hospitalised adults (≥18 years) were considered eligible for the study. The whole sample was split into two arms: "cases" who tested positive for SARS-CoV-2 infection and "controls" who tested negative. The inclusion criteria were the consent to take part in the study and the positive test for COVID-19 (for inclusion in the cases group). The main exclusion criteria were clinical conditions leading to the inability to answer the COVID-Q (severe respiratory syndrome, hospitalisation in the intensive care unit, cognitive impairment). All subjects in both groups answered the questionnaire via a telephone interview. The statistical analyses were performed only on 41 of the items from the ARTIQ and SNOT-22, since the remaining regarded selfadministrated drugs and social activities, which were not taken into account because of the peculiar circumstances of data collection. Indeed, participants were tested during the first lockdown phase in Italy, when social and work activities had been heavily restricted and people were advised not to take any medication without consulting with their physician. Item response theory (IRT) models for polytomous items were fit on each of the components identified by the PCA. We first compared the fit of a freely estimated Generalized Partial Credit Model (GPCM) 25 to a common-slope GPCM, in which the slopes (ie, discrimination parameters) of all items are constrained to be equal. We evaluated the two alternatives according to the Log-Likelihood value and to the penalised-likelihood AIC 26 and BIC 27 criteria, which apply penalties for a greater number of free parameters/model complexity. Higher values of Log-likelihood and lower values of AIC and BIC indicate a better fitting and more parsimonious model. The IRT-based analyses evaluated the functioning of item categories, the presence of clusters of local dependence among items, item fit within the model and model fit to the data. Items were selected through an iterative procedure. Items for which the three response categories did not function as intended were recoded into two categories: 0 = absence of symptom, 1 = presence of symptom. Local dependence was measured by the Q3 index: residual correlations between items were compared with the average residual correlation within the dataset, and the greatest deviances were taken to suggest possible item redundancy or underlying dimensions. Item fit was based on the scaled version of the chi-square statistic χ 2 *: P-values smaller than .05 were considered as indicating significant misfit and led to the exclusion of the item. Finally, the omnibus fit was evaluated by the M2* limitedinformation test and was considered acceptable when P values were above the .05 threshold. When needed, the DETECT method in package sirt 28 was used to further investigate and clarify patterns of item clustering. The items retained in the IRT models were then used to produce scale and total scores, computed as raw sums, and represented the final validated version of the questionnaire. Item selection was based on the above-illustrated analyses, but also informed by current literature on COVID-19. For instance, we chose to retain especially frequent or characteristic symptoms of the disease, such as muscle or body aches, loss of taste or smell (https:// www.cdc.gov/coron aviru s/2019-ncov/sympt oms-testi ng/sympt oms.html). We estimated the extent to which self-reported symptoms could help identify COVID-19 by means of two different logistic regression models. In both models, the diagnosis (1 = positive, 0 = negative) was the dependent variable, and age, sex, smoking status and presence of comorbidities, mean-centred, were used as covariates. The predictor of interest in the first model was the total questionnaire score. In the second model, the predictors of interest were the scores on each of the scales identified through the PCA and IRT analyses. By design, affected individuals were over-represented in our sample compared with the population: this was taken into account by applying the correction for rare events suggested by King and Zeng 29 and implemented in the relogit module of the R package Zelig. 30 Public-domain data for the Veneto region provided by the Italian Civil Protection Agency (updated to 16 May 2020; https://github. com/pcm-dpc/COVID -19/tree/maste r/dati-regioni) indicated that the proportion of positive cases over swabs was 3.8%. About 17% of positive patients were hospitalised with symptoms, whereas 83% were not hospitalised. Thus, we estimated that, given a swab, the probability of testing positive without being hospitalised was about 3.1%. We used this as the estimate of the tau parameter for the rareevent logistic regression. 34 The discriminative ability of the model was evaluated by the c-statistic. Finally, we identified questionnaire scores associated with specific risk thresholds for COVID-19. A sample of 460 participants (175 males and 285 females; mean age =51 years, range 20-99 years) was recruited. Among participants, there were 230 cases (113 males and 117 females; mean age 55 years, range 20-99 years) and 230 controls (61 males and 169 females; mean age 46 years, range 21-89). The telephone interview took place within 19 days from the first diagnosis in cases and within 43 days from the last negative test in controls. The latter time frame resulted to be reliable because, being the controls negative, there was no evolution in their clinical condition, which was comparable with the moment of the first swab. The prevalence of smoke and comorbidities in cases and controls is shown in Table S1 . An unadjusted negative binomial regression analysis, with the total count of comorbidities as outcome and the group (ie, cases or controls) as a predictor, revealed that the number of comorbidities per person was significantly higher among cases (β = 0.659, P < .001). Chi-squared tests (or, when the number of observed comorbidities in one or both groups was 10 or lower, Fisher's exact tests) showed that hypertension, diabetes, and cardiovascular diseases were significantly more frequent among cases, compared with controls (P < .001, P = .001 and P = .043, respectively). No significant differences in the distribution of the remaining comorbidities or in the proportion of smokers emerged between the two groups. Liver diseases were excluded from all analyses, having been observed in only one case. Parallel analysis led to the extraction of five components, which corresponded to as many clinical presentation patterns: asthenia (including fatigue, sleep disorders and depression mood), influenza-like symptoms (including fever, stomach ache and headache), ear and nose symptoms, breathing issues and throat symptoms. Two symptom classes (chest pain and anosmia/ageusia), did not load clearly on any of the principal components. Chest pain was therefore excluded from further analyses, while the symptom of anosmia/ageusia, given its relevance to COVID-19 symptomatology, was retained as a single item, separated from the other five symptom classes. The five detected components shared significant weak to moderate correlations (rs from 0.15 to 0.52) and together accounted for 71.05% of the observed variance. The unconstrained GPCM for Asthenia, Influenza, Breathing issues and Throat symptoms showed the best fit to their relative data, according to AIC, BIC and Log Likelihood. Only in the case of ear/nose symptoms, the fixed-slope (ie, 1PL) GPCM model showed a comparatively better fit to the data and thus was preferred. The comparative statistics of the full and constrained models are presented in Table S2 . Of the original nine items regarding asthenia, items "vertigo" and "felt dizzy" were removed because of a significant misfit to the model. Examination of the M2* index showed that the model estimated on the remaining seven items did not fit the data adequately. The Q3 index showed a high residual correlation (r = 0.51)-compared with the dataset's average (r = −0.12)-between item "been in a bad mood" and "been irritable," suggesting the presence of a secondary dimension associated with mood alterations. Once these items were removed, leaving a final set of five items, the P-value of the M2* index was greater than .05, indicating that a good model-data fit was achieved. The items regarding influenza-like symptoms did not fit a single GPCM model. Further examination using the DETECT method indicated the presence of a "fever" cluster (including only three items: "feeling feverish," "sweat" and "chills"), distinct from the remaining 5 items, which assessed more specifically gastro-intestinal issues. We decided to consider these two clusters separately. The IRT model of the "gastro-intestinal" cluster led to dichotomising item "vomiting" and excluding item "loss of appetite" for the misfit, thus resulting in a four-item scale. As a result of the very low number of items, we did not fit an IRT model to the fever cluster. Regarding ear and nose symptoms, despite no significant item misfit or problematic category functioning, the initial model did not adequately fit the data, according to the M2* index. Examination of residual correlations showed high residuals between items "muscle pain" and "joint pain" and between items "painful pressure in the ears" and "painful sinuses" (residual r = 0.40 for a mean residual correlation of r = −0.12 and residual r = 0.08, for a mean residual correlation of r = −0.15, respectively). We therefore excluded items "joint pain" and "painful pressure in the ears" to reduce redundancy. Item "muscle pain" no longer fit the reduced model and was excluded in turn. However, since "muscle or body aches" are indicated as one of the most common symptoms of COVID-19, we decided to retain it as a single item next to the final five items assessing ear and nose symptoms. Regarding breathing issues, item "difficulty in thinking clearly" and "difficulty in going about your daily business" were removed because of misfit, whereas item "being so unwell you had to stay in bed" was removed because of high residual correlations with item "felt tired" (r = 0.48, mean residual correlation of the model r = −0.13). Item "coughing up mucus" was instead dichotomised to improve the functioning of answer categories. The final model included six items. All five items associated with throat symptoms fit the model well, and the model offered a good representation of the data. The final IRT models retained 27 items: 5 assessing asthenia, 3 assessing fever, 4 assessing gastro-intestinal symptoms, 5 assessing nose symptoms, 6 assessing breathing problems and 4 assessing throat symptoms. Items "coughing up mucus" and "vomiting" were recoded as dichotomous (0 = symptom absent. 1 = symptom present). Items "anosmia/ageusia" and "muscle pain" were not included in any of the models but were kept as single items because of their relevance in diagnosing COVID-19. Table 1 shows item parameters and item fit index for the final models. Table 2 shows model fit indices. Cronbach's alpha was from 0.69 to 0.92 for all subscales and for the total scale. Table 3 reports the questionnaire in its final validated form, which contains only the items retained in the IRT models. Table S3 reports descriptive statistics for variables used in the logistic linear regressions. The effects of the total questionnaire score in predicting COVID-19 diagnosis are reported in Table S4 shows the results from the model examining individual scale scores (asthenia, gastrointestinal symptoms, fever, ear/nose symptoms, breathing issues, throat symptoms, anosmia/ageusia, and muscle pain), which is shown in Table S5 . The total score on the questionnaire was significantly associated with positivity at the molecular test for SARS-CoV-2: as expected, individuals displaying a larger number of symptoms or experiencing them more often, as reported in the questionnaire, were significantly more likely to be affected by COVID-19 (exp(β) = 1.203, P < .001). Note: P, P-values associated with the scaled χ 2 statistic. Abbreviations: Discrimination, ability of the item to discriminate between individuals with different symptom frequencies; S_ χ 2 , scaled χ 2 statistic indicating item-model fit; Threshold 1, symptom frequency value that marks the cutoff between adjacent answer categories: 0 = "no symptom" and 1 = "I experience this symptom a little;" Threshold 2, cutoff value between category 1 = "I experience this symptom a little" and 2 = "I experience this symptom a lot". a Dichotomous item: 0 = symptom absent, 1 = symptom present. Table 4 (c = 92.69%). According to the analyses of the present study, six subscales and two individual items, each accounting for a separate dimension of COVID-19 symptomatology, were identified in order to validate the novel questionnaire. Such questionnaire presented some analogy with the ARTIQ, 23 although some clinical categories of the latter, such as "ENT" and the "influenza" symptoms, in our novel assessment model, turned into more homogeneous components. Besides providing a reliable tool for assessing and reporting symptoms, the COVID-Q also provided a quantitative score which appeared to correlate with the risk of testing positive for SARS-CoV-2 infection. In particular, higher scores on the "breathing issues" subscale (ie, "dry cough," "coughing up mucus," "problems with breathing," "wheezing," "shortness of breath" and "felt tired") and on the anosmia/ageusia item (ie, "loss of smell or taste") were significantly associated with a higher chance of a positive SARS-CoV-2 test. Many other available studies on COVID-19 were focused on patientreported symptoms, also collected via non-conventional means of clinical assessment, including tele-medicine 33 and the use of social networks. 34 However, to date, a comprehensive and specific tool to collect and scale symptoms from COVID-19 patients has not been described yet. The COVID-Q may potentially fill this gap, providing a validated and standardised assessment tool to support clinical data collection, also in the absence of a direct contact with the patient, such as in the tele-medicine field. Although this study also provided score thresholds to define patients at higher risk of testing positive for the SARS-CoV-2 infection, the COVID-Q should not be used as a diagnostic tool. Rather, its application to those who did not receive a microbiological test may help to discriminate COVID-related symptoms, potentially aiming to identify subjects who should be tested. To Practitioners to suggest self-isolation. Asthenia "Being awake most of the night" 0 (none) 1 (a little) 2 (a lot) "Difficulty falling asleep" 0 (none) 1 (a little) 2 (a lot) "Waking up several times at night" 0 (none) 1 (a little) 2 (a lot) "Poor quality of sleeping" 0 (none) 1 (a little) 2 (a lot) "Not feeling yourself" 0 (none) 1 (a little) 2 (a lot) Gastrointestinal symptoms "Vomit" 0 (absent) 1 (present) "Nausea" 0 (none) 1 (a little) 2 (a lot) "Diarrhoea" 0 (none) 1 (a little) 2 (a lot) "Abdominal pain" 0 (none) 1 (a little) 2 (a lot) Fever "Feeling feverish" 0 (none) 1 (a little) 2 (a lot) "Sweat" 0 (none) 1 (a little) 2 (a lot) "Chills" 0 (none) 1 (a little) 2 (a lot) Ear and nose symptoms "Headache" 0 (none) 1 (a little) 2 (a lot) "Runny nose" 0 (none) 1 (a little) 2 (a lot) "Blocked nose" 0 (none) 1 (a little) 2 (a lot) "Sneezing" 0 (none) 1 (a little) 2 (a lot) "Watery eyes" 0 (none) 1 (a little) 2 (a lot) Breathing issues "Problems with breathing" 0 (none) 1 (a little) 2 (a lot) "Wheezing" 0 (none) 1 (a little) 2 (a lot) "Shortness of breath" 0 (none) 1 (a little) 2 (a lot) "Coughing up mucus" 0 (absent) 1 (present) "Dry cough" 0 (none) 1 (a little) 2 (a lot) "Felt tired" 0 (none) 1 (a little) 2 (a lot) Throat symptoms "Swollen glands" 0 (none) 1 (a little) 2 (a lot) "Sore throat" 0 (none) 1 (a little) 2 (a lot) "Hoarseness" 0 (none) 1 (a little) 2 (a lot) "Tickles in the throat" 0 (none) 1 (a little) 2 (a lot) Therefore, possible score differences may be encountered by using the questionnaire. At last, the present study has been conducted in a single geographical area, so its application in different areas of other Countries may be useful in order to observe any possible score differences. In conclusion, in this study, we developed and proposed a novel questionnaire to assess and scale the COVID-19-associated symptoms. Its application to COVID-19 patients could improve the reliability in collecting clinical data and assessing the severity of symptoms, while its extension to subjects who have not received a COVID-19 diagnosis yet might help to identify those at higher risk of being affected and, therefore, needing a microbiological test. Abbreviations: b, unstandardised regression coefficient; COPD, chronic obstructive pulmonary disease; Exp(b), exponential transformation of b; P, P-values associated with Z; SE, standard measurement error associated with Exp(b);Z, z-test statistic. a Gender was coded as 0 for females and 1 for males. b Smoking status was coded as 0 for never, and 1 for former or current smokers. c Hypertension, Diabetes, Cardiovascular and Cerebrovascular diseases, Tumours, COPD, and Renal impairment were coded as 0 for absent and 1 for present. d These predictors were mean-centred for the regression analysis. Rare-event logistic regression predicting SARS-CoV-2 infection with unweighted sum of the breathing symptoms scale and of the anosmia/ ageusia item. COVID-19 symptom questionnaire's partial score used as predictor The epidemiology and pathogenesis of coronavirus disease (COVID-19) outbreak Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding The outbreak of COVID-19: an overview What we know so far: COVID-19 current clinical knowledge and research Clinical features of patients infected with 2019 novel coronavirus in Wuhan Review article: gastrointestinal features in COVID-19 and the possibility of faecal transmission Cardiovascular considerations for patients, health care workers, and health systems during the COVID-19 pandemic Central nervous system manifestations of COVID-19: a systematic review COVID-19 in otolaryngologist practice: a review of current knowledge Features of anosmia in COVID-19 Association of chemosensory dysfunction and COVID-19 in patients presenting with influenzalike symptoms Is oro/nasopharyngeal swab for SARS-CoV-2 detection a safe procedure? Complications observed among a case series of 4876 consecutive swabs Predominance of an altered sense of smell or taste among long-lasting symptoms in patients with mildly symptomatic COVID-19 Evolution of altered sense of smell or taste in patients with mildly symptomatic COVID-19 Alterations in smell or taste in mildly symptomatic outpatients with SARS-CoV-2 infection New onset of loss of smell or taste in household contacts of home-isolated SARS-CoV-2-positive subjects Self-reported alteration of sense of smell or taste in patients with COVID-19: a systematic review and meta-analysis on 3563 patients Features of severe COVID-19: a systematic review and meta-analysis The Wisconsin Upper Respiratory Symptom Survey (WURSS): a new research instrument for assessing the common cold Development and validation of a questionnaire to measure asthma control The community-acquired pneumonia symptom questionnaire: a new, patient-based outcome measure to evaluate symptoms in patients with communityacquired pneumonia Development and validation of an improved, COPD-specific version of the St The development and validation of a multidimensional sum-scaling questionnaire to measure patient-reported outcomes in acute respiratory tract infections in primary care: the acute respiratory tract infection questionnaire Psychometric validity of the 22-item sinonasal outcome test A generalized partial credit model: application of an EM algorithm A new look at the statistical model identification Procedures for Psychological, Psychometric, and Personality Research Logistic regression in rare events data Everyone's Statistical Software Risk management during COVID-19: safety procedures for otolaryngologists Immunosenescence and Inflammaging: Risk Factors of Severe COVID-19 in Older People COVID-19 transforms health care through telemedicine: Evidence from the field Self-reported COVID-19 symptoms on Twitter: an analysis and a research resource Additional Supporting Information may be found in the online version of the article at the publisher's website. How to cite this article An informed consent has been obtained for any procedure involving the patients described in this article. The manuscript has not been submitted to more than one journal for simultaneous consideration.The manuscript has not been published previously (partly or in full).All of the authors have participated in the planning, writing or revising the manuscript. The authors have declared no disclosures. Research Data are not shared.