key: cord-0989009-0mx8gpap authors: Abdeljaoued-Tej, I. title: COVID-19 data analysis and modeling in Palestine date: 2020-04-29 journal: nan DOI: 10.1101/2020.04.24.20078279 sha: 45b2c2a129723a1caed0d61d5970524d41192105 doc_id: 989009 cord_uid: 0mx8gpap We estimate an actual number of infected cases in Palestine based on the 18-day effect from infection to death. We find that the number of cases in April 22 varies between 506 and 2 026 infected cases. We also focus on the reproductive number in Palestine based on population dynamics with two SEIR models. Dataset is from 5 March to 22 April 2020. With a transmission rate equal to 4.55 106, on May 22, the simulations predict 11 014 total infected cases in the optimistic scenario and 113 171 in the worst one. The crest of the pandemic is from 22 to 27 May 2020. The reproductive number R0 is equal to 1.54 for a fixed fraction of 0.6 of symptomatic cases that are reported and for a removal rate of 7. Palestinian COVID-19 mortality number is equal to 6 per million. It is small compared to countries neighboring Palestine. The infected number is equal to 88.4 per million, which is less than most of its neighbors. The basic reproduction number is still greater than 1. Changes to the transmission rate (over time) would be advisable, to fall R0 below the critical threshold. The novel coronavirus SARS-CoV-2 (COVID-19), has spread to many countries. Given the fragile health systems in most countries, and especially in war zones, we studied the data available in Palestine as well as their possible evolution. We address the following fundamental issues concerning this epidemic: How will the epidemic evolve in Palestine concerning the number of reported cases and unreported cases? How will the number of unreported cases influence the severity of the epidemic? Most of the Palestinian territories live under a blockade and apartheid. What is the effect of these oppressive policies on the epidemic? To answer these questions, we developed mathematical models that recover from data of reported cases and the number of unreported cases for the COVID-19 epidemic in Palestine. 1 We use a set of reported data to model the epidemic in Palestine: data of Table 4 is shared by the Palestinian Ministry of Health. It represents the epidemic transmission in Palestine. The first case was detected on March 5, 2020 . Four deaths are reported on 22 April 2020 with more than 466 total number of infected cases. The three phases of COVID-19 epidemics can be decomposed as a linear phase, an exponential growth, and a decreasing stage. The linear growth in the number of reported cases (from 5 March to 31 March) is where the number of daily reported cases is almost constant day after day. The second phase of the epidemic corresponds to an exponential increasing phase, it starts on April 1st, 2020. The third phase of the epidemic corresponds to a time-dependent exponentially decreasing transmission rate, due to major public interventions and social distancing measures [5] . Our analysis identifies the epidemics in Palestine as in the exponential phase. The main objective of this study is the estimation of the average number of infections one case can generate throughout the infectious period. It is the basic reproduction number of an infectious agent. Section 2 presents the data set and an estimation of an actual number of infected cases in Palestine based on the 18-day effect from infection to death. introduces two deterministic compartmental model based on the clinical progression of the disease and the epidemiological status of the individuals [7] . The SEIR model studies the disease transmission. Four compartments are present: Susceptible, Exposed, Infectious, and Recovered. The cumulative number of reported symptomatic infectious cases at time t, denoted by CR(t) is computed. We then construct numerical simulations and compare them with data. 2 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The data available in [2] gives the number of COVID-19 positive cases per day, the number of deaths due to COVID-19, and the number of reported recoveries. But this repository does not contain data from Palestine. The data entry was done manually 1 . The daily growth rate of the infected cases in where C j is the number of reported COVID-19 cases at time j. The median of the daily growth rate of reported cases in Palestine from March 6 to 22 th April 2020 is equal to 0.07. The total number of deaths is 4. It represents 6.7 ‰ of total reported cases. Patients who die on a given day j were infected much earlier, so the mortality rate denominator should be the total number of patients infected at the same time as those who died [12] . The mortality rate is defined by where M j is the total number of deaths at time j. The median mortality rate during this period is equal to 0.03. The progression of the number of cases reported over the last eighteen days P 18 is known in each country. It depends on the rate of containment and its effectiveness: Figure 3 shows the progression rate of COVID-19 in Palestine. The median of P 18 is equal to 3.95 and it is used to compute the estimated number of reported and unreported cases [1] . We estimate the number of cases in Palestine as of April 22 th , 2020 by modulating the estimated mortality rate. In Figure 4 we take different values of T : from 0.5 to 2%, where P 18 = 3.38 with an account delays of 18 days between the onset of symptoms (or asymptomatic) and death. The more unfavorable case is 2 026 infected (unreported and unreported cases) and the optimistic simulation predicts 506 infected cases on 22 April. 1 Data is available on https://www.worldometers.info/coronavirus/ and at https://corona.ps/details On the other hand, we scale the number of deaths per 1,000 population due to COVID-19 related to the number of registered infected people per million population. To measure a death ratio, a delay of 18 days is used between the mortality number and the infected number. This delay is justified by the days between infection and death, which is about 17 to 19 days [3] . The mortality ratio per thousand to infected cases per million is shown in Table 1 . eatures are detailed in Figure 13 . The relationships between deaths a given day j and the number of cases 18 days before j − 18 is linear for several countries. 4 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 29, 2020. Assume that infected individuals were not infectious during the incubation period [13] . Assume population growth rate and death rate are zero. Assume people exhibit consistent behaviors before and during the epidemic [14] . Assume no quarantine or other mitigation intervention is implemented. All these assumptions directly influence the model, which is dependent on several parameters. See [10] for a more precise idea of the evolution of these assumptions. The purpose of the model is to predict forward in time the future number of cases in a time-line of the epidemic from early reported case data. A typical SEIR (susceptible, exposed, infectious, removed) model is done in [4] . It can be described as a system of differential equations: where, t 0 is the beginning date of the epidemic, t ≥ t 0 is time in days, S(t) is the number of susceptible at time t, E(t) is the number of exposed at time t, I(t) is the number of infectious at time t, R(t) is the number of removed, which includes the number of recovered and dead at time t, N (t) is the population at time t and N (t) = S(t) + E(t) + I(t) + R(t). Between S and I, the transition rate is where β is the average number of contacts per person per time, multiplied by the probability 5 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 29, 2020. . https://doi.org/10.1101/2020.04.24.20078279 doi: medRxiv preprint of disease transmission in a contact between a susceptible and an infectious subject, and I/N is the fraction of contact occurrences that involve an infectious individual. The transition rates are α and γ. We modify the model to estimate the number of unreported infected and asymptomatic cases out of the total number of cases. We propose to use a mathematical model, which recovers from data of reported cases, the number of unreported cases for the COVID-19 epidemic in Palestine. A sophisticated SEIR model is developed in [11] , but we choose to folow the approach of [5] and the model in [6] Figure 5 produces Figure 6 . The parameters are listed in Table 2 . The model contains a symptom class with both reported and unreported cases. The SEIR is given by the following system of ordinary differential equations: Here, S(t) is the number of individuals susceptible to infection at time t, I(t) is the number of asymptomatic infectious individuals at time t, R(t) is the number of reported symptomatic infectious in-6 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 29, 2020. . https://doi.org/10.1101/2020.04.24.20078279 doi: medRxiv preprint dividuals (i.e. symptomatic infectious with severe symptoms) at time t, and U (t) is the number of unreported symptomatic infectious individuals (i.e. symptomatic infectious with mild symptoms) at time t. This system is supplemented by initial data We assume that the removal rate ν is the sum of the removal rate of reported symptomatic infectious individuals, and of the removal rate of unreported symptomatic infectious individuals due to all other causes, such as mild symptom, or other reasons. The cumulative number of reported symptomatic infectious cases at time t is denoted by CR(t). We assume that CR(t) has the following special form: We obtain the model starting time of the epidemic t 0 : We fix S 0 to 5 077 760, which corresponds to the total population of Palestine. We assume that the variation in S(t) is small during the period considered, and we fix ν, η, f . We can estimate the parameter τ and the initial conditions U 0 and I 0 from the cumulative reported cases CR(t). We then construct numerical simulations and compare them with data. We obtain I(t) = I 0 exp(χ 2 (t − t 0 )) and I 0 = χ 2 f ν . We must have U (t) = U 0 exp(χ 2 (t − t 0 )). So, by substituting these expressions into 7 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 29, 2020. . https://doi.org/10.1101/2020.04.24.20078279 doi: medRxiv preprint previous identities, we obtain: We fix τ in such a way that the value χ 2 becomes the dominant eigenvalue of and (I 0 , U 0 ) is the positive eigenvector associated with this dominant eigenvalue χ 2 . Thus, we apply implicitly the Perron-Frobenius theorem. Moreover, the exponentially growing solution (I(t), U (t)) that we consider (which is starting very close to (0, 0)) follows the direction of the positive eigenvector associated with the dominant eigenvalue χ 2 . The basic reproductive number becomes: The first infected case in Palestine is documented on March 5, 2020. It is used as the first day of the forecast for Palestine. Fitting Palestinian data from April 15 to April 22 enables to compute CR(t) = χ 1 exp(χ 2 t) − 1. We find χ 1 = 30.91 and χ 2 = 3.16. This model shows that the starting of the epidemic is February 15, 2020 (the value of t 0 = −59). As long as the number of reported cases follows CR(t), we can predict the future values of reported cases in Figure 11 . For χ 1 = 30.91, χ 2 = 0.06 and χ 3 = 1, we obtain prediction in Table 3 . Thus, the exponential formula overesti- mates the number of infected cases after 4 months (see Figure 12) . From now on, we fix the fraction 8 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 29, 2020. . https://doi.org /10.1101 /10. /2020 For N a total population equal to 5 077 760, we suppose that S 0 = N/100 is the susceptible population to be infected by COVID-19. Figure 7 and Figure 8 show that the range between the most naive model and the slightly more elaborate one is between 113 000 and 11 014 infected cases. But both converge towards the first peak of the pandemic at the end of May 2020. The results of this model, if correct, show very good management of the epidemic regarding the number of deaths compared to its neighboring countries. For these values of parameters, R 0 = 1.54. In the literature, the value of R 0 varies from 1.4 to 3.9 [8] . Our simulations in Figure 9 give amounts that belong to this range. We estimated when the 9 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 29, 2020. An epidemic outbreak of a new human coronavirus COVID-19 will occur in Palestine. The unreported cases and the disease transmission rate are useful information. We estimate an actual number of infected cases in Palestine based on the 18-day effect from infection to death. We find that the number of cases in April 22 varies between 506 and 2 026 infected cases. To recover these effect of unreported cases, we applied a method developed in [7] to predict the evolution of a COVID-19 epidemic in Palestine, based on reported case data. is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 29, 2020. . https://doi.org /10.1101 /10. /2020 To estimate the reproductive number R 0 , we initially get the time-dependent function of reported cases CR(t). With the SEIR method used, the reproductive number is very sensitive to the data set: Formula for CR(t) is descriptive for the reported case data for Palestine, and R 0 is still greater than 1. To reduce R 0 to less than 1, public measures should be taken, such as isolation, quarantine, and public closings. These measures exacerbate the spread of this disease and mitigate the final size of the epidemic [11] . The model used in this paper incorporates social distancing measures through the transmission rate τ = 4.55 10 −6 . The consequences of late public interventions may have severe consequences for the epidemic outcome. Changes to this rate (over time) would be advisable when the epidemics will peak. In Figure 9 , we graph R 0 as a function of f and 1/ν, to illustrate the importance of these values in the evolution of the epidemic. The accuracy of these values depend on the input of medical and biological epidemiologists. We find that minimum value of R 0 is equal to 1.18 (f = 0.7 and 1/ν = 3), the maximum is equal to 3.3 (f = 0.1 and 1/ν = 8) and the median is equal to 2.24. Figure 9 : In this figure, we use 1/η = 7 days, and we plot the basic reproductive number R 0 as a function of f and 1/ν using τ = 4.55 10 −6 , which corresponds to the data for Palestine in Table 4 Figure 10: Number of infected cases and deaths. The data are divided into different groups. These number are normalized per million for reported infected cases or per miles for death cases i . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 29, 2020. . Table 4 ii . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 29, 2020. . https://doi.org/10.1101/2020.04.24.20078279 doi: medRxiv preprint Figure 13 : Number of deaths at t 0 as a function of the number of cases eighteen days earlier (t 0 -18 days). The data are divided into three groups according to the number of deaths (depending on whether the number of deaths is lower in the country) iii . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted April 29, 2020. Table 4 : Data for Palestine -accumulated reported cases, CR(t), and casualties iv . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted April 29, 2020. . https://doi.org/10.1101/2020.04.24.20078279 doi: medRxiv preprint Estimation of Tunisia COVID-19 infected cases based on mortality rate. medRxiv An interactive web-based dashboard to track covid-19 in real time. The Lancet infectious diseases Estimating the number of infections and the impact of non-pharmaceutical interventions on covid-19 in 11 european countries Modeling infectious diseases in humans and animals Understanding unreported cases in the covid-19 epidemic outbreak in wuhan, china, and the importance of major public health interventions The parameter identification problem for sir epidemic models: identifying unreported cases Predicting the number of reported and unreported cases for the covid-19 epidemic in south korea, italy, france and germany Novel coronavirus 2019-ncov: early estimation of epidemiological parameters and epidemic predictions. medRxiv Transmission of 2019-ncov infection from an asymptomatic contact in germany An updated estimation of the risk of transmission of the novel coronavirus (2019-ncov) Estimation of the transmission risk of the 2019-ncov and its implication for public health interventions Estimates of the severity of coronavirus disease 2019: a model-based analysis. The Lancet infectious diseases Nowcasting and forecasting the potential domestic and international spread of the 2019-ncov outbreak originating in wuhan, china: a modelling study A comparative study on the clinical features of COVID-19 pneumonia to other pneumonias The author would like to thank Pr. Ahmed Abbes for his valuable feedback, comments and rewarding discussions.