key: cord-1039237-251gwpqu authors: Lulbadda, Kushan Tharuka; Kobbekaduwa, Dhanushka; Guruge, Malika Lakmali title: The impact of temperature, population size and median age on COVID-19 (SARS-CoV-2) outbreak date: 2020-09-28 journal: Clin Epidemiol Glob Health DOI: 10.1016/j.cegh.2020.09.004 sha: 8815dfe36dc30ae8e8fd302b4209711659f4ef16 doc_id: 1039237 cord_uid: 251gwpqu OBJECTIVES: This study aims to explore the association between the spread of COVID-19 and external parameters. In this regard, temperature, population size, median age, and health care facilities of 58 different countries are considered as external factors. METHODS: A negative binomial regression model was fitted to identify the associations between the factors and cases of COVID-19 during the study periods. RESULTS: The temperature, population size, and median age are positively associated with the spreading rate of COVID-19. There is no evidence supporting that case counts of COVID-19 could decline in countries with better health care facilities. Also, an empirical model was presented to estimate the number of cases within a country using the external parameters. CONCLUSIONS: It is impossible to express the change in the number of cases for a unit increase in each of the variables because a change in a single variable depends on different values of other variables. However, the findings of this study provide useful implications for the authorities and decision-makers to take specific precautionary measures. The recent coronavirus (COVID-19) epidemic 1 has had a significant impact in many countries, and it must gain the maximum degree of inspection. Therefore, it is essential to discuss the history of this virus family 2 before considering its behavior. Coronavirus is a large family of viruses that cause infections in humans and animals. For humans, the virus can cause illnesses extending in severity from the common cold to Severe Acute Respiratory Syndrome (SARS). 3 Currently focusing on the development of novel therapeutic options such as antivirals and vaccines. 4 There are several examples as follows; one was in 2003, the severe acute respiratory syndrome coronavirus (SARS-CoV) 5 triggered an outburst of the disease SARS in China and Hong Kong. Middle East respiratory syndrome coronavirus (MERS-CoV) 6 started an epidemic in Saudi Arabia and among other countries in 2012. Both outbreaks were controlled by a combination of human involvement and still unknown natural conditions. In 2020, coronaviruses became a familiar name among people. All viruses from this family are zoonotic. 7 They originate in animals and can then be spread to humans after mutation. The naming of more than 40 coronavirus 8 has done by the International Committee for the Taxonomy of Viruses. Four kinds of these are community developed and have been circulating unceasingly through the human population for a very long time. Fig. 1 gives a clear idea of these special categories. Coronaviruses are called a family of enveloped viruses. 9 The meaning is that they are coated in an oily coat called a lipid bilayer, studded with proteins that stick out like spikes of a crown, which caused them to give them their name as corona which is the Latin term for the crown. There are few external factors of the countries which have direct as well as indirect effects on the eruption of COVID-19. Among them, the average temperature, the current population of the country, the overall health system performance by taking the health care rank, as well as the median age of people are considered for this research study. Gross domestic product (GDP) in Purchasing Power Parity (PPP) is added to the study to get the highest practical aspect in terms of economy. For https://doi.org/10.1016/j.cegh.2020.09.004 Received 26 June 2020; Received in revised form 25 August 2020; Accepted 11 September 2020 the analysis, the top 58 countries that recorded more than 2000 confirmed cases in the initial 60 days, starting from the first case are used. The paper is organized as follows; Section 2 is about the worldwide distribution of the virus. Section 3 discusses the impacts of external factors on the virus. Then, Section 4 describes the method of analysis. Section 5 provides the results of the investigation, and the last section presents the discussion with conclusions. The Covid-19 has spread around the world and been declared a pandemic by the World Health Organization (WHO). According to the data from WHO, 3,096,626 confirmed cases were reported worldwide at the end of April 20, 20. 10 WHO has categorized the world into six regions, and the distribution of cases by the end of April 2020 is shown in Fig. 2 . The virus is spread by small droplets formed by sneezing, coughing, and talking. Rather than traveling through air for long distances, the droplets typically fall to the ground or onto surfaces. 11 It is most transmissible during the first three days after the beginning of symptoms. 12 Also, outspread is possible before the appearance of symptoms, and from people who do not show signs. Suggested preventive actions include washing hands with soap for at least 20 s, maintaining a considerable distance of 1 m from each other, wearing a face mask in public places, and quarantine for suspicious or infected people. 13 Moreover, the worldwide authorities have decided to apply travel restrictions, hazard controls for workstations, and facility closures with lockdowns to mitigate the risk. There are many remaining unknown facts for the spreading of the virus through the environment. The high fluctuation of the behaviors and qualities of the virus can be considered as the major reason for this unpredictable outbreak. 14 This situation is urgently required to be collaborated with environmental researchers and scientists to identify the environmental features as well as other external facts that affect the transmission of the deadly virus. There are many ongoing types of research and laboratory experiments to determine the behavior of factors such as temperature, wind, and humidity, which can cause the spreading rate. The following sections describe the external parameters that are under consideration. Temperature 15 is one of the most important fact to consider. The active period of the virus has been identified as 8-10 days on dry surfaces and 37°C is recorded as the human body temperature where the virus survives. However, the threshold value of temperature to destroy COVID-19 is still unknown. Several kinds of research conducted by analyzing a laboratory-grown copy of the coronavirus (SARS-CoV-2) have indicated that heat has a direct impact on the behavior of the virus. A temperature of 60°C for an hour-long period has successfully suppressed numerous deadly viruses such as Ebola. But, this temperature can be insufficient to destroy SARS-CoV-2 coronavirus. Researchers have discovered that heating the virus to a temperature of 92°C for 15 min can ultimately make the virus inactive. Even though numerous scholars have made valuable discoveries within laboratory conditions, the real-life situation is complicated and ungovernable. 16 A group of scientists at University College London has suggested that a higher number of coronavirus cases were reported in February, while the rates decreased in summer. This is due to some improvements of the immune 17 in human bodies with increasing temperature. The spread of a deadly virus can be enhanced with the increasing number of human beings. Therefore, it is vital to consider the transmission of COVID-19 concerning to the population size of a geographical area. In a research study, the analysis of Receiver Operating Characteristics was used to obtain the sensitivity of population size. The results suggested that the areas with a population size greater than 1.7 million can enhance the transmission rate. 18 The primary concern on the spread of COVID-19 is the human to human transmission. The history of pandemics indicates that the disease in a particular population can cross international boundaries. The first outbreak of cholera in 1816, the Spanish flu pandemic in 1918, and the Asian flu pandemic in 1957 are good examples from the past which started in a single area and then transmitted to numerous parts of the world. 19 During a pandemic, the availability of health care facilities is a critical aspect irrespective of the disease that is being spread. In the case of COVID-19, the concern on health care services can be crucial due to the requirements of testing services, personal protective equipment for the health officials and more space to isolate the confirmed patients. A diverse kind of study has been carried out by a group of scholars to analyze the Mexican indigenous population and their capacity to respond to the COVID-19. The overall conclusion suggested that the considered population is more vulnerable to COVID-19 due to the identified social and health factors along with the shortage of proper health care facilities. 20 Hand washing stations, hygienic equipment, and proper waste disposal management systems are considered as primary health care facilities. However, these necessary facilities were reported insufficient in most of the hospitals in the aforementioned countries. 21 Therefore, in a situation of a pandemic, the available health care facilities can contribute in many ways to mitigate or control the spread of viruses. A virus can infect a person regardless of his/her age. But the response from the human body to the virus can vary depending on the age of the person. The lack of immunity in most of older people is one of the identified reasons. Considering the pandemic of COVID-19, several published reports have stated that most of the non-survivors are relatively old. The presence of more than one disease in older patients such as diabetes, kidney diseases or hypertension has increased the risk due to the weakening of immunity. 22 A study conducted in the Philippines concluded that age has a positive influence on COVID-19 mortality. The results showed that COVID-19 patients older than 60 years of age have a chance of 8.15 times more likely to die compared to the patients below 60 years. 23 Therefore, an in-depth analysis of age as a risk factor can be significant to safeguard senior citizens worldwide by minimizing the spread of COVID-19. GDP within a country is defined as the total market or monetary value of all the finished goods and services produced within a period. 24 This value exhibits the standard of living between countries. 25 Also, GDP per capita based on PPP is observed to international currency using purchasing power parity rates. Moreover, GDP is a tool used to measure the economic progress of a country. 26 Therefore, concerning the spreading rate of COVID-19, the influence of the economy can be related to the GDP. Hence, to understand the connection between the economic status of a country and the transmission rate of COVID-19, GDP per capita based on PPP values can be considered as a useful tool. Atmospheric factors such as rainfall, humidity, air quality index, and temperature are some dynamic factors that can modify the spread of deadly viruses. Aside from these environmental conditions, several more factors like demography of the considered area, foreign affairs of the country, technological expansion with medical development, lock down time, national net wealth, etc. could affect the outbreak situation. Table 1 shows the list of top 58 countries that recorded more than 2000 confirmed cases for the initial 60 days from the day of the first case reported, obtained from the WHO COVID-19 dashboard which is used for the analysis. The secondary data of average temperature as given in weather to visit website 27 and accuweather website 28 for the considered period, population size in the country for 2020 as the worldometer website, 29 the current health care ranking by WHO considering the overall health system performance, 30 the median age for 2020 of the country as United Nations world population prospects 2019 31 and GDP of the country as PPP value from the International Monetary Fund World Economic Outlook 32 are considered for the statistical analysis. Various media outlets have explained the extraordinary increase in the number of incidents reported in the country. However, the delay of establishing safety rules and regulations, economy prioritized authorities, and running out of health resources and facilities are considered to be the major reasons. As the behavior of this virus outbreak situation is unpredictable, it is difficult to describe the behavior by statistical analysis. The analysis is conducted using the negative binomial regression model. 33 In this analysis, the number of cases recorded during the first 60 days from the first case is used, and all the independent variables are covariates. Since the dependent variable (number of cases) consists of count data, the Poisson regression model which is the basic statistical model for count data has fitted. Thus, it has tried to fit the Poisson regression model for all possible models of the variables. But since Pearson Chi square values were highly greater than one for all those This analysis is based on secondary data, which was collected from several resources, as mentioned above. The series consists of 290 secondary data of the five factors discussed above on 58 countries. This analysis is done to identify the association between the number of cases and the variables. Table 2 shows the mean and standard deviations for each of those factors, which gives an idea about the dispersion of each element. The selected model is described in Equation (1) as follows, Number of Cases = exponential [6.661 + 0.073*X 1 -2.337*10 −8 *X 2 + 0.023*X 3 + 3.347*10 −5 *X 4 + 2.457*10 −9 *X 2 *X 3 -1.139*10 −12 * X 2 *X 4 -6.735*10 −11 * X 1 *X 2 *X 3 + 6.072*10 −14 * X 1 *X 2 *X 4 -4.793*10 −8 *X 1 *X 3 *X 4 ] (1) X 1 = Average temperature in 0 C X 2 = Population Size X 3 = Median age X 4 = GDP per capita PPP The healthcare facility rank has been removed from the above model because it showed no association with the model. Therefore, healthcare rank has a minor impact on the number of cases which is negligible. According to the model, the number of cases cannot be described from the considered factors individually due to the existence of interactions. But the interaction of the factors describes the situation perfectly. Equation (1) can be effectively used to predict the number of cases within a country. However, it is important to note that the data substituted in the formula should be in the range of the values given in this study. This section provides the results of the statistical analysis. The goodness of fit test (using Pearson Chi-square) has been carried out to test the validity of the model. The negative binomial regression model fits the data well for the above factors. Moreover, to check the significance of the model, the omnibus test has been carried out. According to the test result, it can be concluded that the negative binomial regression model for this data is significant at 5% level. Further, the significance of the model parameters has been checked by comparing p values with a 0.05 significance level. According to the model effects shown in Table 3 , though the average temperature, population size and median age are not significant, the interaction terms associated with those three variables are significant in the model. Therefore the main effects from those variables in the model cannot be omitted. In order to check the model adequacy, residual diagnostics have been used as follows. To check the outliers (unusual observation), standardized Pearson residuals have used. It found that all the standardized Pearson residuals are within −2 and +2 except the last three observations which can be considered as outliers in this data set. According to Fig. 3 , it can be seen that all the observations lie within −2 and +2 of standardized deviance residuals except one observation. That means 95% of the residuals are under the absolute value of 2.0. Therefore, it is evident that the model adequately fits the data. Further, the linearity in the model parameters has satisfied since there is no pattern in the plot. So the assumed model fits the data and describes the situation correctly. The primary intension of this work is to enhance the knowledge on the spread of COVID-19 related to external parameters such as temperature, population size, median age, and health care facilities. The analyzed results were presented using a negative binomial regression model. The results suggested that the population size, average temperature and median age does not have much impact when they are individually considered within the model. However, their interactions with other variables show a significant impact on the number of cases. Therefore in the model, the effect of those variables on the number of cases cannot be ignored. In contrast, healthcare facilities within a country showed a negligible effect on the spread of the virus. GDP was considered to incorporate the economic status within a country to make the analysis more practical. Since there are interactions among factors, it is difficult to explain precisely on how the number of cases changes for a unit increase in each of the variables, because that change depends on different values taken by other variables as well. Finally, an empirical formula has been presented to estimate the number of cases reported within a country under certain conditions. The critical analysis conducted in this study can be useful in undertaking precautionary measures to mitigate or minimize the spread throughout the world. The recommendations provided by this study will be valuable in imposing preventive measures in the future to face a similar kind of pandemic. This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. Ethical approval was not required for this work as freely available open data were used. The authors have no competing or conflicting interests. A review ON corona virus ( COVID-19 ) Severe Acute Respiratory COVID-19: consider cytokine storm syndromes and immunosuppression Middle East Respiratory Syndrome Coronavirus Transmission Bat flight and zoonotic viruses The SARS, MERS and novel coronavirus(COVID-19) epidemics, the newest and biggest global health threats: what lessons have we learned? Fusion of Enveloped Viruses in Endosomes COVID-19) Dashboard. World health organization How COVID-19 Spreads Accessed date Stop the Novel Corona Viral Infection Now Stop the Novel Corona Viral Infection Now How to identify factors affecting COVID-19 transmission New evidence for significant effect of air temperature on COVID-19 transmission Evaluation of heating and chemical protocols for inactivating SARS-CoV-2. bioRxiv Does the Immune System's Ability to Fight Viruses Change with the Seasons? University of SURREY The sensitivity and specificity analyses of ambient temperature and population size on the transmission rate of the novel coronavirus (COVID-19) in different provinces of Iran Scatter plot for standardized deviance residuals vs mean predicted Classroom-Resources/Illuminations/Interactives/Pandemics-How-Are-Viruses-Spread/, Accessed date Critical review of social , environmental and health risk factors in the Mexican indigenous population and their capacity to respond to the COVID-19 Healthcare Facilities in Developing Countries a High Risk for Coronavirus Transmission Estimation of risk factors for COVID-19 mortality -preliminary results. medRxiv Age as a risk factor of COVID-19 mortality in the Philippines Gross domestic product (GDP) Real GDP per capita, how to calculate it, and data since 1947 Does high GDP mean economic prosperity? Invest Accessed date: 27 The World 2020. World population review List of Countries by Projected GDP per capita On ridge estimators for the negative binomial regression model The authors wish to thank the following individuals who assisted with the medical background of this research study and proofreading of