key: cord-0933956-p70i6swe authors: Cássaro, Fábio A.M.; Pires, Luiz F. title: Can we predict the occurrence of COVID-19 cases? Considerations using a simple model of growth date: 2020-08-01 journal: Science of The Total Environment DOI: 10.1016/j.scitotenv.2020.138834 sha: 95267da7356fbae7d075d3175a96a12d876db7bf doc_id: 933956 cord_uid: p70i6swe Abstract This study aimed to present a simple model to follow the evolution of the COVID-19 (CV-19) pandemic in different countries. The cumulative distribution function (CDF) and its first derivative were employed for this task. The simulations showed that it is almost impossible to predict based on the initial CV-19 cases (1st 2nd or 3rd weeks) how the pandemic will evolve. However, the results presented here revealed that this approach can be used as an alternative for the exponential growth model, traditionally employed as a prediction model, and serve as a valuable tool for investigating how protective measures are changing the evolution of the pandemic. Can we predict the occurrence of COVID-19 cases? Considerations using a simple model of growth Some European countries and more recently the United States of America has been making the headlines around the world as important epicenters of the widespread COVID-19 (CV-19) (severe acute respiratory syndrome -coronavirus 2) pandemic. Unfortunately, it is happening mainly by the high number of daily cases and deaths these countries have been facing and reporting (Saglietto et al., 2020) . A question that could be raised is: Could one, based on initial observations of the increasing rates of COVID-19 growth, estimate how the cases would evolve? It is a challenging and complex question, as many aspects should be considered for its answer, especially those Science of the Total Environment j o u r n a l h o m e p a g e : w w w . e l s e v i e r . c o m / l o c a t e / s c i t o t e n v associated with social mobility restrictions and community transmission characteristics. Even though, based on a reasonable mathematical model some scenarios can be drawn and serve as a warning on how the severity of one specific situation can evolve (Biswas and Sen, 2020) . It is well known and observed that as time passes, the number of CV-19 cases experiences a rapid rising followed by stabilization after some time. It presents what is called a step-like function behavior. Frequently, the exponential model of growth is chosen for fitting and forecasting future cases (Remuzzi and Remuzzi, 2020) . Nevertheless, even a good prediction model will start to deviate from the actual data in just a few weeks and, therefore, without any adjustment become useless for this task. The idea of this study is to provide a more realistic growth model of confirmed CV-19 cases to give anyone conditions to promptly evaluate how restrictive mobility actions (as, for instance, social isolation) are changing the virus growing rates. There is a very simple and concise mathematical function that behaves like a step-like function. It is known as the cumulative distribution function (CDF) whose expression is presented in Eq. (1) (Zandbergen and Chakraborty, 2006) : where a 1 , a 2 , D o , and p are adjustment parameters, and D is any particular day after the first CV-19 cases were detected. Its first derivative dN/dD, presented in Eq. (2), provides the number of new cases to be expected using the adjusted parameters found in Eq. (1): As declared free from new cases, China could be used as an example of the proposed model prediction (Eq. 1) to describe the behavior of the confirmed cases and confirmed new cases (Fig. 1) . It can be noticed that the model is well adjusted to the number of confirmed cases as r 2 = 0.9912. The region poorest evaluated is probably related to an unusual event that happened in the 21st day of the pandemic when, differently from the other days, 14 thousand daily cases were officially confirmed in China (red square dot in Fig. 1 ) (Worldometers.info, 2020) . Based on the model, the Italian cases are presented in Fig. 2 . Italy, unfortunately, made the headlines around the world because it was one of the first countries to publicize enormous daily death numbers. Recently, an article was published presenting asymmetrical epidemic curves, related to CV-19 cases in some European countries (Saglietto et al., 2020; Remuzzi and Remuzzi, 2020) . The authors emphasized that some projections, based on an exponential model of growth, predicted more than 30 thousand cases for Italy by March 15, around 5 weeks after the first confirmed cases (Armocida et al., 2020; Remuzzi and Remuzzi, 2020) . It is known that the exponential model, besides producing a good estimate, is adequate to describe the number of confirmed cases only for a short period, in general, one or two weeks from the initiation of the pandemic, as it quickly starts to deviate from the actual numbers as time passes. In Fig. 3 , the exponential model (EM) is compared to the actual numbers. The EM was obtained using pandemic information from the first 14 days of the confirmed cases. It is seen that the EM (Fig. 3 , red dashed line), deviates more than 100% of the data on the 21st day, i. e. around one week after the EM was conceived. Eqs. 1 and 2 were also employed to fit the data from some other European countries (Spain, Germany, and Austria) as presented in Fig. 4 . As in the case of Italy, the dashed lines are only predictions calculated using the CDF and its derivative (Eqs. 1 and 2). Differences in extreme weather conditions can explain the differences observed between the virus spread in different countries. Recently, Tosepu et al. (2020) reported the influence of climatic conditions in the spread of CV-19 in Indonesia. As an example, the correlation between the confirmed and new confirmed cases with the predicted ones, picking Austria as an example, are presented in Fig. 5 . The r 2 values from the correlations indicate that the predicted cases correlate better with the confirmed cases as compared to the new confirmed ones. It happens as the number of new confirmed cases, besides the visible trend, is significantly variable than the confirmed cases, as can be seen in Fig. 4 . The parameters of the CDF adjustment for some European countries and China are presented in Table 1 . Parameters a 1 and a 2 are related to the extrapolation of the curves for small (pandemic emergence) and large (pandemic stabilization) values of D, respectively. D o is near to the inflection of the pandemic Fig. 3 . Exponential model (EM) used for fitting the actual data up to 14 days from the first case. On the 21st day, the deviations from the actual data and the EM are larger than 100%. curve (quite close to the maximum of the curve of the new cases, Eq. 2) and p is related to its growth rate. Larger values of p are related to a more abrupt growth of the pandemic curve at its beginning and viceversa. It can be noticed that except for China and Norway, the values of D o and p are close to 30 and 4.5, respectively. The average of these parameters followed by their standard deviations are D o = (32 ± 4) days and p = (4.6 ± 0.5). It means that the inflection of the curve of growth (Eq. 1) starts around 32 days after the first CV-19 cases are detected and the curve of new daily cases has its maximum around this day. The parameter p of this magnitude indicates that the CV-19 cases have a huge rate of growth at 2 or 3 weeks after the first detected ones. As a final example, in Fig. 6 are presented examples of the curves generated by Eqs. 1 and 2, for distinct combinations of values of p and D o . For comparison reasons, a 1 and a 2 were chosen as 0 and 1, respectively. It means that the curve minimum and maximum were chosen as 0 and 1, respectively. From Fig. 6 it is seen that an increase in p (p from 3 to 6) makes the curve more inclined at the beginning of the process. Also, higher values of p made the peak of Eq. 2 more pronounced and less spread out (p = 6 as compared to the others). The effect of changing D o delayed the inflection point and made the peak more diffuse (the curve was flattened). The consequence of diminishing D o in Eq. 1 was to accelerate the process of reaching its ending (a 1 = 1) too. As a final consideration, the answer to the first question is that it is almost impossible to predict, based on the first cases of CV-19 (first 2 or 3 weeks), how the pandemic will evolve. It is related to many considerations that have to be taking into account as, for instance: the dynamic of the spread, demographic population, restrictions of social mobility, individual protection measures (use of protective masks and hygiene procedures), virus incubation time, transmission rates, meteorological factors, etc. Nevertheless, more realistic models can reveal reliable aspects related to pandemic evolution. Also, it can serve as a valuable tool, for decision-makers of any country, to investigate how protective measures are changing the evolution of the CV-19 cases. CRediT authorship contribution statement Fábio A.M. Cássaro:Conceptualization, Methodology, Writing -original draft.Luiz F. Pires:Investigation, Writing -original draft. The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. The Italian health system and the COVID-19 challenge Space-time dependence of coronavirus (COVID-19) outbreak. arXiv COVID-19 and Italy: what next? COVID-19 in Europe: the Italian lesson Correlation between weather and COVID-19 pandemic in Jakarta Improving environmental exposure analysis using cumulative distribution functions and individual geocoding The effect of varying p (top) and D o (bottom) values in Eq. 1 (solid line) and 2 (dashed line). D o = 32 and p = 4.6 are the average values of these parameters for the European modeled cases The authors would like to thank Ms. Kristy Lam from the Department of Natural Resources & Environmental Management (NREM), the University of Hawai'i at Mānoa, for assistance with the paper review. LFP would like to acknowledge the financial support provided by the Brazilian National Council for Scientific and Technological Development (CNPq) through Grant 304925/2019-5 (Productivity in Research).