key: cord-0945420-eqfirkv4 authors: Mika, Justyna; Tobiasz, Joanna; Zyla, Joanna; Papiez, Anna; Bach, Małgorzata; Werner, Aleksandra; Kozielski, Michał; Kania, Mateusz; Gruca, Aleksandra; Piotrowski, Damian; Sobala-Szczygieł, Barbara; Włostowska, Bożena; Foszner, Paweł; Sikora, Marek; Polanska, Joanna; Jaroszewicz, Jerzy title: Symptom-based early-stage differentiation between SARS-CoV-2 versus other respiratory tract infections—Upper Silesia pilot study date: 2021-06-30 journal: Sci Rep DOI: 10.1038/s41598-021-93046-6 sha: 1165bbbd45a41915f19072ad9d20d5bc4d9229ae doc_id: 945420 cord_uid: eqfirkv4 In the DECODE project, data were collected from 3,114 surveys filled by symptomatic patients RT-qPCR tested for SARS-CoV-2 in a single university centre in March-September 2020. The population demonstrated balanced sex and age with 759 SARS-CoV-2( +) patients. The most discriminative symptoms in SARS-CoV-2( +) patients at early infection stage were loss of taste/smell (OR = 3.33, p < 0.0001), body temperature above 38℃ (OR = 1.67, p < 0.0001), muscle aches (OR = 1.30, p = 0.0242), headache (OR = 1.27, p = 0.0405), cough (OR = 1.26, p = 0.0477). Dyspnea was more often reported among SARS-CoV-2(-) (OR = 0.55, p < 0.0001). Cough and dyspnea were 3.5 times more frequent among SARS-CoV-2(-) (OR = 0.28, p < 0.0001). Co-occurrence of cough, muscle aches, headache, loss of taste/smell (OR = 4.72, p = 0.0015) appeared significant, although co-occurrence of two symptoms only, cough and loss of smell or taste, means OR = 2.49 (p < 0.0001). Temperature > 38℃ with cough was most frequent in men (20%), while loss of taste/smell with cough in women (17%). For younger people, taste/smell impairment is sufficient to characterise infection, whereas in older patients co-occurrence of fever and cough is necessary. The presented study objectifies the single symptoms and interactions significance in COVID-19 diagnoses and demonstrates diverse symptomatology in patient groups. Demographic and epidemiological profile. 3 ,114 patients filled the surveys (Table 1) . Among them, about 25% were tested positive for SARS-CoV-2 infection, whereas the remaining 75% got negative test results. The population was balanced regarding sex. Among all patients, 62% reported symptoms of infection and the remaining 38% were categorised as healthy. The age of patients ranged from 1 to 98 years, with a median of 42. 1,201 people were aware of their blood type, among which the most common were type 0 (39%) and type A (34%), and the least common was type AB (almost 10%). Detailed characteristics of the sample are provided in Supplementary Table S1. Demographic profile of symptomatic patients. First, we investigated the completeness of the surveys in the subset of 1,941 symptomatic patients. About half of the queries had more than 30% missing information, including five characteristics that did not have answers provided by more than 60% of patients. The other half of the queries had information provided by all patients (10 characteristics) or by most of the patients (4 queries). Supplementary Figure S1 shows the distribution of missing values. Next, we examined the demographic and epidemiological profile of symptomatic patients regarding the COVID-19 diagnosis versus other respiratory tract infections. Among 1,941 patients presenting any symptoms, 30% (n = 586) were infected with SARS-CoV-2, and the remaining 70% (n = 1355) got negative test results (Table 1 ). There were no statistically significant differences, neither in the number of men/women with and without COVID-19 nor the relation between the age of patients and their diagnosis (Table 2) . Noteworthy, pairwise analysis of age and sex regarding COVID-19 diagnosis showed significant differences in median age between men and women in the group of people infected www.nature.com/scientificreports/ with SARS-CoV-2 (p = 0.0140). Women get a positive diagnosis of COVID-19 when being older compared to men, however, with negligible effect (OR = 1.01). Blood type was evenly distributed in both groups of patients. As expected, having contact with an infected person significantly twofold increased the odds of being SARS-CoV-2 infected ( Table 3 ). The duration of symptoms is slightly longer for people not infected with SARS-CoV-2, however careful examination of results showed that the effect is driven by people reporting many days of observing symptoms (> 10) in the group of COVID-19 negative diagnosis. The median duration of symptoms in both groups equals four days. Symptoms and diseases differentiating COVID-19 from other infections. Next, we focused on a list of symptoms (Table 3 ) and coexisting diseases ( Table 2 ) to identify the ones that have a significant association with COVID-19. The strongest co-occurrence was observed between loss of taste or smell and COVID-19 disease (Fig. 1, Panel A) . The odds of getting a positive COVID-19 diagnosis are more than three times bigger among people with loss of taste or smell (Fig. 1, Panel B) . The observed OR value corrected after the subgroup size imbalance interprets as the medium effect. Other symptoms coexisting with early SARS-CoV-2 infection were high temperature (> 38°C), muscle aches, headache and cough, although, for all, the effect is small or negligible. On the other hand, a negative association with COVID-19 was observed for dyspnea, gastrointestinal symptoms (GI symptoms) and tiredness. These symptoms were less frequent in the group of patients with early SARS-CoV-2 infection. The observed effects were medium level at most as the odds of getting a positive COVID-19 diagnosis were about Table 1 . Summary of data based on demographic profile. One patient did not provide information regarding sex. Table 2 . The differences in prevalence of studied clinical characteristics between patients positive and negative for SARS-CoV-2. * % calculated referring to N of particular characteristics in SARS-CoV-2(-) and SARS-CoV-2( +). ** Characteristics for sex were provided considering men as reference. *** Unadjusted p-value. www.nature.com/scientificreports/ two times lower for patients observing symptoms mentioned above (Fig. 1, Panel B) . Notably, a negative relation with SARS-CoV-2 infection was also observed for the allergies with a small effect. Table 3 also presents the summary of the chosen quality indices for the single symptom logistic regression (LR) based predictor of SARS-CoV-2 status. The mean value and its 95% confidence interval obtained in tenfold cross-validation are provided there. The mean AUC (Area Under the Curve) varies from 50.7% (loss of appetite) to 63.4% (loss of taste/smell). The maximum negative predictive value (NPV), equal to 87.7%, was observed for GI symptoms with a positive predictive value (PPV) at the level of 47.0%. Pairwise association of symptoms. Subsequently, we examined the correlation between every pair of surveyed queries. The highest correlations were observed between age and coexisting diseases: lung disease, kidney disease, diabetes, heart disease and hypertension (Fig. 2) , which is in agreement with the general observations. A medium-level association was observed between loss of appetite and saturation (i.e. baseline Sp02; negative association) and duration of symptoms (positive association). Loss of appetite also coincides with patient age and maximum temperature with a small effect. While considering the headache, it co-occurs with a series of symptoms: muscle aches, sore throat and dizziness with positive small but very close to medium effect, with sex and loss of taste or smell with small positive effect, and with age with the small negative association. Another Table 3 . The differences in prevalence of studied symptoms between patients positive and negative for SARS-CoV-2 and the quality indices for the logistic regression based predictor for the chosen single symptoms (obtained from tenfold cross-validation). * % calculated referring to N of particular characteristics in SARS-CoV-2(-) and SARS-CoV-2( +). ** Unadjusted p-value. www.nature.com/scientificreports/ positive association (with small or close to medium effect) were observed between duration of symptoms and tiredness, loss of appetite, loss of taste or smell, cough and skin reactions. Interestingly, maximum temperature correlates negatively with a list of characteristics: allergies, contact with infection and conjunctivitis, with small but close to medium effect. Moreover, it correlates negatively with cough, lung disease, sore throat, dyspnea, and saturation with a small effect. Similarly, a negative correlation was observed between saturation and age, tiredness, and most of the coexisting diseases (with small effect). Occurrence and co-occurrence of symptoms. Next, we examined if the co-occurrence of symptoms is specific for COVID-19 diagnosis. Table 4 and Fig. 3 shows the frequency of patients observing one symptom only (and no other signs from the list) or a particular combination of symptoms from a list of four queries: temperature > 38℃, cough, dyspnea, and loss of taste or smell. Table 4 also presents the value of the prediction quality indices for the occurrence and co-occurrence of the analysed symptoms derived from the whole data set. The highest effect was observed for the co-occurrence of two symptoms only: cough and dyspnea, with no temperature > 38℃ and no loss of taste or smell, which seemed protective against COVID-19 and was about 3.5 times more frequent among people with negative COVID-19 diagnosis (p < 0.0001, NPV = 86%, PPV = 38%, DOR = 3.82). Observation of only high temperature or only cough speaks against COVID-19 too (OR = 0.42 and 0.67 respectively, p < 0.0001 and p = 0.0201). The COVID-19 prediction indices for such situations are NPV = 81%, PPV = 37% and DOR = 2.49 for the observation of high temperature (> 38℃) only, and NPV = 73%, PPV = 36% and DOR = 1.50 for cough only, respectively. Among the single symptom occurrence, only loss of taste/smell brings a significant increase of COVID-19 odds with OR = 2.09 and p = 0.0166, and NPV = 66%, PPV = 53% and DOR = 2.11. The co-occurrence of cough and loss of taste/smell without high temperature and dyspnea increases slightly the risk of SARS-CoV-2 infection (p < 0.0001, OR = 2.49, NPV = 67%, PPV = 55%, DOR = 2.52). Observing the co-occurrence of the third symptom: high temperature or even presence of all four symptoms, do not significantly increase the odds (OR = 2.67 and OR = 2.00, respectively). Adding four more symptoms to the co-occurrence analysis presented a more complex www.nature.com/scientificreports/ picture of the associations with COVID-19 disease (see Supplementary Figure S2 ). Muscle aches, headache, sore throat and dizziness have been inspected for co-appearance with symptoms mentioned above. Occurrence of muscle aches or headache only gives nonsignificant OR for COVID-19 diagnosis, while combination with some of the main symptoms changes OR significantly. For example, the co-occurrence of headache with cough and loss of taste/smell increases the estimate of odds ratio from OR = 2.49 to OR = 3.74, while cooccurrence of muscle ache with the two main symptoms does not impact their OR (2.49 versus 2.15). However, the co-occurrence of the four mentioned above symptoms (cough, loss of smell/taste, headache and muscle ache) results in higher OR = 4.72. The prediction power of the symptom-based classifiers. Additionally, to support the early-stage symptom-based decision on the patient being or not SARS-CoV-2 positive, the LR-based predictors were also built for a different subset of the nine main symptoms considered (N = 398). The summary of the results is presented in Supplementary Table S2 Profiles of patients infected with SARS-CoV-2 regarding sex and age subsets. Finally, considering hormonal and immune response differences between men and women, as well as between younger and older people, we performed separate analyses for the mentioned subsets. The age threshold was set to 46 years as indicated by the Gaussian Mixture Model (GMM) decomposition of the age characteristic 28 . Sex subsets. Analysis of association between COVID-19 diagnosis and a single query showed that both subsets are characterised by associations between COVID-19 diagnosis and contact with an infected person, temperature > 38℃, dyspnea, and loss of taste or smell. Cough, muscle aches, and headache appear significantly more frequently only in SARS-CoV-2 infected men, whereas GI symptoms are less frequent only in men tested positive. The odds for women are not significant (see Supplementary Table S3 ). On the other hand, decreased odds of sore throat and heart disease are significant exclusively in women. Notably, the correlation between lung disease and saturation is observed only in men and not in women (see Supplementary Figure S3 www.nature.com/scientificreports/ Age subsets. Only in the younger subset, we have an association between COVID-19 diagnosis and sex, where odds of being infected with SARS-CoV-2 are about 0.7 times lower in women than in men (see Supplementary Table S3 ). Both subsets are characterised by the coincidence of COVID-19 diagnosis and contact with the infection, temperature > 38℃, dyspnea and loss of taste or smell. In addition, younger patients have a positive association between COVID-19 diagnosis and i) headache and ii) muscle aches, where odds of being infected with SARS CoV-2 are about 1.5 times bigger while having these symptoms. Younger people also show negative associations between positive COVID-19 diagnosis and sore throat, dizziness, GI symptoms, and asthma. On www.nature.com/scientificreports/ the other hand, older people infected with SARS-CoV-2 observe cough more often than not infected people and have heart diseases less often. Analysis of pairwise correlation between all queries showed a bigger number of at least small correlations for older patients. Not only do we observe a correlation between coexisting diseases, but also a negative correlation between age and a list of symptoms, including saturation, cough, headache or skin reactions (see Supplementary Figure S3 , Panel b). Co-occurrence of four main symptoms with COVID-19 diagnosis analysis showed that most of the associations are significant for younger people and not for older ones (Fig. 4) . From the subset of people who answered all questions about temperature > 38℃, cough, dyspnea and loss of taste or smell, the following symptoms were associated with COVID-19 diagnosis among older patients: (1) cough and dyspnea (as a protective factor), (2) temperature > 38℃, cough and loss of taste or smell (risk factor) and (3) high temperature (above 38℃) only, with no other symptoms (serves as the protective factor against SARS-CoV-2 infection). Younger people had the same relations as the entire population, with an exception to temperature > 38℃ and cough exhibiting lack of association. Considering analysis only on a subset of patients infected with SARS-CoV-2 (see Supplementary Figure S5 , Panel b), the biggest percentage of younger patients observed a combination of temperature > 38℃ and cough, as well as the co-occurrence of loss of taste or smell and cough (about 16% both combinations). Older people more frequently observe co-occurrence of temperature > 38℃ and cough (21%), as well as the cooccurrence of loss of taste or smell, cough and high temperature (17%). The main findings regarding the analysis of the single symptoms are in compliance with the already published reports [29] [30] [31] [32] [33] [34] . The main importance of this study is the frequency and correlation analyses among the symptoms and clinical characteristics of SARS-CoV-2 infected versus control group consisting of people with suspected SARS-CoV-2 infection, which was not confirmed through PCR tests of material from upper respiratory system swabs. This approach enables not only characterising the COVID-19 symptomatology but also a determination of prognostic significance for each symptom in COVID-19 diagnostics. Moreover, dependencies between symptoms and their interactions have been determined, giving a comprehensive view of the disease. For instance, in the studied population, dyspnea as a symptom, often indicated as crucial in SARS-CoV-2 diagnostics 35, 36 , occurred more frequently in non-infected individuals. This fact may suggest the subjectivity of the symptom and possible connection to stress resulting from fear of SARS-CoV-2 infection or other underlying diseases. Alternatively, it may signal that dyspnea and fatigue (also less frequent among SARS-CoV-2 infected) may develop at later disease stages 37 . Noteworthy, the occurrence of cough and dyspnea without fever or smell and taste impairment significantly diminishes the risk of testing SARS-CoV-2 positive, contrary to patients with acute lung disease or atypical lung inflammation. A similar situation occurs in gastrointestinal symptoms, which are associated with SARS-CoV-2 symptomatology; however, their appearance was much rarer compared to other infections www.nature.com/scientificreports/ that may have been treated incorrectly as COVID-19 symptoms. Consequently, the occurrence of GI symptoms alone presented little prognostic value. Remarkably, we were able to distinguish certain sets of symptoms based on correlation analysis which form a logical entity and discriminate their pathogenic association. For instance, the headache was linked with muscle pain and loss of taste and smell, forming a neurological panel 38 which correlated negatively with age. Furthermore, loss of appetite was observed to a greater degree in older patients with longer disease duration and higher temperature. Nonetheless, the higher temperature was found in men and people who pointed out having neither asthma, allergies, nor dyspnea. It might indicate that the following population is more susceptible to anergy. The observed in our study, positive association between baseline SpO2 and allergies, mainly respiratory allergies, is currently a very important topic. It is well known that innate antiviral responses are decreased but also an expression of SARS-Cov2 receptor ACE2 upregulated in epithelial cells is chronic rhinosinusitis 39, 40 which could aggravate the course of COVID-19. On the other hand, majority of those patients receive different medications modifying the course of disease, which include inhaled corticosteroids, short-or long-acting β agonist. Interestingly, recent data suggest that inhaled corticosteroids could decrease ACE2 expression 40 but, even more importantly, reduced the likelihood of needing urgent medical in the early stages of COVID-19 as shown in STOIC randomised controlled trial 41 . This association undoubtedly calls for further investigations. The dependency analysis and co-occurrence of symptoms showed that in the entire group of single symptoms, loss of taste and smell proved to be the most strongly discriminating. Additionally, in the case of symptom interactions, combining loss of taste and smell with temperature > 38°C and cough does not increase the odds ratio significantly but shrinks its confidence interval range, therefore, recognition precision. Further, interesting associations were discovered in groups of men and women, e.g., the significance of headache in men. It may be attributed to more frequent migraines and tension headaches in women who do not take notice of such pains 42 . Moreover, age appears to be a significant factor-the symptomatology changes around the age of 46. It is clearly visible that in younger people, only taste or smell impairment is sufficient, whereas, in older patients, for similar measurements to be observed, there is a need for the co-occurrence of fever and cough. It is of significance to note that the findings in this work are based on data collected in a single hospital centre, and therefore, depending on the demographic profile, some variation may be observed between regions. Another limitation of the study may be attributed to the lack of follow-up surveys which could enhance the understanding of early stage symptoms' impact. It also leads to consider the absence of information on mortality rates in this specific group of patients that inhibits from profiting fully from information regarding, e.g. comorbidities. The presented results not only objectify the significance of single symptoms and their interactions in COVID-19 diagnostics but also demonstrate diverse symptomatology in particular groups of individuals, e.g. younger versus older, men versus women. The essential observation may be concluded through the fact that clearly in younger people, loss of taste and smell is a sufficient symptom for SARS-CoV-2 infection when in older patients, the symptom needs to co-occur with fever and cough. The results of this study may prove beneficial in the process of patient selection for PCR or antigen testing towards SARS-CoV-2 infection by means of e.g., remote clinical services, especially in the case where testing the entire population is infeasible, or web-based self-assessment tools. Following analysis showed that patients infected with SARS-CoV-2 more frequently observe the loss of taste or smell, high temperature > 38℃ and cough. Notably, dyspnea reported by other research groups as more frequent among people with positive COVID-19 diagnosis here was less common among patients infected with SARS-CoV-2 at an early disease stage. Similarly, co-occurrence of cough and dyspnea appeared with lower rates in people with positive COVID-19 diagnosis. On the other hand, co-occurrence of loss of taste or smell with a list of symptoms was more frequent among people with positive COVID-19 diagnosis, including combinations with (1) temperature > 38℃, (2) cough, or (3) temperature > 38℃ and cough. Noteworthy, different responses are observed by people younger and older than 46 years. Older patients infected with SARS-CoV-2 more often observe cough and its co-occurrence with high temperature and loss of taste or smell. Younger people with positive COVID-19 diagnosis frequently report headaches and muscle aches, in addition to the main symptoms of loss of taste or smell, high temperature > 38℃ and cough. Worth noticing that in younger people, only taste or smell impairment is sufficient, whereas, in older patients, for similar measurements to be observed, there is a need for the co-occurrence of fever and cough. Data acquisition. Data were collected in the course of DECODE project (Data drivEn COVID-19 DEtection) 43 from paper surveys presented to the patients of the Department of Infectious Diseases and Hepatology, Bytom, Poland who were tested for SARS-CoV-2 infection. Patients were tested due to the presence of any symptoms of respiratory tract infection and/or direct contact with SARS-CoV-2 infected subjects (asymptomatic group). In all subjects included vital signs, complete physical examination and oxygen saturation prior to O 2 supplementation were performed. SARS-CoV-2 infection was determined by use of RT-PCR from nasopharyngeal swabs with 2019-nCOV Triplex RT-qPCR Detection Kit, Vazyme with a lower limit of detection 200 copies of RNA/mL. The study was approved by Bioethics Committee of Silesian Medical Chamber in Katowice, Poland. Patients provided written informed consent to participate in the study. We complied with all relevant ethical regulations and guidelines. Surveys were collected in the time range from March 3rd to September 30th, 2020. During the whole time of data acquisition, every patient who observed any symptom or had contact with an infected person was allowed to participate in the study. Patients reported their health state at the early stage of the disease. Captured data consist of demographics, previous contacts with SARS-CoV-2 infection, current www.nature.com/scientificreports/ symptoms, medical history including co-medications and vital signs with oxygen saturation. The collection of surveys was not supervised, thus resulting in a number of missing information. Every survey was completely anonymised to prevent personal data leak. Next, a database was created, including information from all survey queries. Comparative analysis. Statistical analysis was applied to determine queries having different characteristics between patients infected and not infected with SARS-CoV-2. The odds ratio (further denoted as OR) was estimated and accompanied by its 95% confidence interval (CI) as well as test for OR difference to one 44 . For categorical variables (i.e. blood type and all discrete having presence/absence state), Wilson frequency estimator was used 45 in the case of frequencies < 10%. For continuous variables, OR was computed from logistic regression coefficients. The significance level for statistical testing was set to 0.05. Moreover, the odds ratios served as a measure of effect size. Correlation analysis. To find associations between the analysed features, the following effect size coefficients were calculated: (1) Spearman's correlation coefficient between two continuous variables (further denoted as R), (2) Cramer's V coefficient (further denoted as V) between two discrete variables 46 , and (3) Wendt's biserial coefficient of correlation (further denoted as RBCC) between continuous and discrete variable 47 . All pairwise comparisons were examined. For the unbiased estimation of V coefficient, χ2 statistics for the independence test was computed with Yates continuity correction 48 in the case of low occurrence counts (n < 10). Effect size interpretation. The effect size quantification was done for absolute values of relevant statistics. Interpretation of OR is defined as small, medium and large for the relative thresholds considering allocation ratio for every query independently 49 . Interpretation of effect size for R, V and RBCC coefficients is the same and is defined as small, medium and large for the following thresholds: 0.1, 0.3 and 0.5, respectively 50 . The logistic regression models for single characteristic and for the combinations of co-occurring symptoms were built. Patients who failed to provide the necessary information in terms of particular characteristics used for a model construction were excluded from the training process. Each model aims to classify a patient as SARS-CoV-2 ( +) or (−) based on the set of input features. The models were evaluated with a tenfold cross-validation procedure. All available symptomatic samples were split into 10 subsets with preserved SARS-CoV-2 ( +)/(−) distribution. Youden index maximisation served for the selection of probability cut-off value from the range 0.1-0.9. Mean values and their 95% confidence intervals of performance measures are provided. The datasets generated during and/or analysed during the current study are available from the corresponding author on request. COVID-19: to be or not to be; that is the diagnostic question COVID-19: specific and non-specific clinical manifestations and symptoms: the current state of knowledge Prevalence and duration of acute loss of smell or taste in COVID-19 patients Clinical course and outcomes of critically ill patients with SARS-CoV-2 pneumonia in Wuhan, China: a singlecentered, retrospective, observational study Novel Coronavirus Pneumonia Emergency Response Epidemiology Team. The epidemiological characteristics of an outbreak of 2019 novel coronavirus diseases (COVID-19) in China Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study Clinical characteristics of coronavirus disease 2019 in China Clinical features of patients infected with 2019 novel coronavirus in Wuhan China Clinical course and outcomes of patients with severe acute respiratory syndrome coronavirus 2 infection: a preliminary report of the first 28 patients from the Korean Cohort Study on COVID-19 Comparison of seven commercial SARS-CoV-2 rapid point-of-care antigen tests: a single-centre laboratory evaluation study Cochrane COVID-19 Diagnostic Test Accuracy Group Rapid, point-of-care antigen and molecular-based tests for diagnosis of SARS-CoV-2 infection Clinical and radiological diagnosis of non-SARS-CoV-2 viruses in the era of Covid-19 pandemic A model to predict SARS-CoV-2 infection based on the first three-month surveillance data in Brazil Seroprevalence of SARS-CoV-2 antibodies in people with an acute loss in their sense of smell and/or taste in a community-based population in London, UK: An observational cohort study Comorbidity and its Impact on Patients with COVID-19. SN Compr Acute kidney injury associated with COVID-19: A retrospective cohort study (1234567890) Scientific Reports | (2021) 11:13580 | Patterns of COVID-19 testing and mortality by race and ethnicity among United States veterans: A nationwide cohort study Comorbidities associated with mortality in 31,461 adults with COVID-19 in the United States: A federated electronic medical record analysis Clinical features and treatment of COVID-19 patients in northeast Chongqing Rapid Epidemiological Analysis of Comorbidities and Treatments as risk factors for COVID-19 in Scotland (REACT-SCOT): a population-based case-control study Children with Covid-19 in pediatric emergency departments in Italy Clinical progression of patients with COVID-19 in Shanghai China Infection fatality rate of SARS-CoV2 in a super-spreading event in Germany BCG vaccine protection from severe coronavirus disease 2019 (COVID-19) Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19 disease Immersion in an emergency department triage center during the Covid-19 outbreak: first report of the Liège University hospital experience Symptom-based screening for COVID-19 in healthcare workers: the importance of fever GaMRed-adaptive filtering of high-throughput biological data Real-time tracking of self-reported symptoms to predict potential COVID-19 Self-Reported loss of smell and taste in SARS-CoV-2 patients: primary care data to guide future early detection strategies COVID-19 diagnosis and management: a comprehensive review Neurological symptoms, manifestations, and complications associated with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and coronavirus disease 19 (COVID-19) Diagnosis of COVID-19 based on symptomatic analysis of hospital healthcare workers in belgium: observational study in a large belgian tertiary care center during early COVID-19 outbreak An analysis of patient self-reported COVID-19 symptoms during the first wave of the pandemic in Ireland Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study Features of 20 133 UK patients in hospital with covid-19 using the ISARIC WHO clinical characterisation protocol: prospective observational cohort study The early natural history of SARS-CoV-2 infection: Clinical observations from an urban, ambulatory COVID-19 clinic Neurological symptoms as a clinical manifestation of COVID-19: implications for internists Decreased expression of type I (IFN-β) and type III (IFN-λ) interferons and interferon-stimulated genes in patients with chronic rhinosinusitis with and without nasal polyps Distinct expression of SARS-CoV-2 receptor ACE2 correlates with endotypes of chronic rhinosinusitis with nasal polyps Inhaled budesonide in the treatment of early COVID-19 (STOIC): a phase 2, open-label, randomised controlled trial Estrogen, migraine, and vascular risk Classification supporting COVID-19 diagnostics based on patient survey data Explaining odds ratios Probable inference, the law of succession, and statistical inference The two-dimensional case in Dealing with a common problem in social science: A simplified rank-biserial coefficient of correlation based on the statistic Contingency table involving small numbers and the χ2 test Relative effect sizes for measures of risk Statistical power analysis for the behavioral sciences JJ, JP conceived the concept of the study. JJ, DP, BSS coordinated patient recruitment. JM, JT AP, JZ, MB, AW, MKa, MKo, BW participated in data collection and preprocessing. PF curated the database. JP, M, AG contributed to the data mining processing. JP supervised the statistical analysis and methodology. JM, JT, AP, JZ were responsible for statistical analysis, visualisation and validation. JJ and JM wrote the manuscript. All authors edited and approved the final version of the manuscript. The authors declare no competing interests. Correspondence and requests for materials should be addressed to J.P.Reprints and permissions information is available at www.nature.com/reprints. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.