key: cord-0820575-9zeqigqa authors: magal, pierre; Webb, Glenn title: Predicting the number of reported and unreported cases for the COVID-19 epidemic in South Korea, Italy, France and Germany date: 2020-03-24 journal: nan DOI: 10.1101/2020.03.21.20040154 sha: 1207759681b7b27543e1c5785b0cdb4007e61d68 doc_id: 820575 cord_uid: 9zeqigqa We model the COVID-19 coronavirus epidemic in South Korea, Italy, France, and Germany. We use early reported case data to predict the cumulative number of reported cases to a final size. The key features of our model are the timing of implementation of major public policies restricting social movement, the identification and isolation of unreported cases, and the impact of asymptomatic infectious cases. In a previous work [2] , we developed a model of the COVID-19 epidemic in China. The purpose of the model was to predict forward in time the future number of cases in a time-line of the epidemic from early reported case data. The model in [2] focused on the Chinese government imposed public policies designed to contain the epidemic. Our goal here is to apply this analysis to the COVID-19 epidemics in South Korea, Italy, France, and Germany. In an early phase of the epidemic, the reported case data grows exponentially, which corresponds to a constant transmission rate. We assume that government measures and public awareness cause this early constant transmission rate to change to a time dependent exponentially decreasing rate. We identify the early constant transmission rate using a method developed in [1] . We then identify the time dependent exponentially decreasing transmission rate from reported case data, and project forward the time-line of the epidemic course. We refer to [3, 4] for more results about this topic. Our model incorporates the following essential ingredients of COVID-19 epidemics: (1) the number of asymptomatic infectious individuals (with very mild or no symptoms), (2) the number of symptomatic reported infectious individuals (with severe symptoms) and (3) the number of symptomatic unreported infectious individuals (with mild symptoms). COVID-19 epidemics can be decomposed into three phases: Phase I: The first phase of the epidemic corresponds to a linear growth in the number of reported cases, where the number of daily reported case is almost constant day after day; We use data from the Korean Center for Disease Control, the Italian Ministry of Health, the French Public Agency of Health, and the Robert Koch Institute of Germany. The cumulative reported cases are given in Figure 1 and the daily reported cases are given in Figure 2 . is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The model consists of the following system of ordinary differential equations: This system is supplemented by initial data Here t ≥ t 0 is time in days, t 0 is the beginning date of the model of the epidemic, S(t) is the number of individuals susceptible to infection at time t, I(t) is the number of asymptomatic infectious individuals at time t, R(t) is the number of reported symptomatic infectious individuals at time t, and U (t) is the number of unreported symptomatic infectious individuals at time t. The transmission rate at time t is τ (t). Asymptomatic infectious individuals I(t) are infectious for an average period of 1/ν days. Reported symptomatic individuals R(t) are infectious for an average period of 1/η days, as are unreported symptomatic individuals U (t). We assume that reported symptomatic infectious individuals R(t) are reported and isolated immediately, and cause no further infections. The asymptomatic individuals I(t) can also be viewed as having a low-level symptomatic state. All infections are acquired from either I(t) or U (t) individuals. The fraction f of asymptomatic infectious become reported symptomatic infectious, and the fraction 1 − f become unreported symptomatic infectious. The rate asymptomatic infectious become reported symptomatic is ν 1 = f ν, the rate asymptomatic infectious become unreported symptomatic is The cumulative number of reported cases at time t is given by the formula and the cumulative number of unreported at time t is given by the formula The parameters and initial conditions of the model are given in Table 1 and a flow diagram of the model is given in Figure 3 . Fraction of asymptomatic infectious that become reported symptomatic infectious fixedt ν 1 = f ν Rate at which asymptomatic infectious become reported symptomatic fitted Rate at which asymptomatic infectious become unreported symptomatic fitted 1/η Average time symptomatic infectious have symptoms fixed Table 1 : Parameters and initial conditions of the model. . CC-BY-NC-ND 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. We assume f = 0.6 or f = 0.1, which means that 40% or 90% of symptomatic infectious cases go unreported. The actual value of f is unknown. We assume η = 1/7, which means that the average period of infectiousness of both unreported symptomatic infectious individuals and reported symptomatic infectious individuals is 7 days. We assume ν = 1/7, which means that the average period of infectiousness of asymptomatic infectious individuals is 7 days. These values can be modified as further epidemiological information becomes known. We assume that in the second phase of a COVID-19 epidemic, the cumulative number of reported cases CR(t) grows approximately exponentially, according to the formula: We fix the value χ 3 = 1. The values of χ 1 and χ 2 are fitted to cumulative reported case data in the early phase of the epidemic, when it is recognized that CR(t) is growing exponentially (i.e. we use an exponential fit χ 1 exp(χ 2 t) to fit the data CR(t) + 1). We assume the initial value S 0 , corresponds to the population of the region of the reported case data. The value of the susceptible population S(t) is assumed to be only slightly changed by removal of the number of people infected in the beginning of the second phase. The other initial conditions are The value of the transmission rate τ (t), during the second phase of the epidemic, when the cumulative number of reported cases grows approximately exponential, is the constant value The initial time for the beginning of the early exponential growth phase is The value of the basic reproductive number is These formulas for I 0 , U 0 , t 0 , τ 0 , and R 0 were derived in [1] . Their numerical values, which are technically theoretic, are essential for identification of the exponential growth rate of CR(t) in the second phase. During the second phase τ (t) ≡ τ 0 is constant. When strong government measures such as isolation, quarantine, and public closings are implemented, the third phase begins. The actual effects of these 4 . CC-BY-NC-ND 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. . measures are complex, and we use an exponential decrease for a time-dependent decreasing transmission rate τ (t) in the third phase to incorporate these effects. The formula for τ (t) during the third phase is The date N and the value of µ are chosen so that the cumulative reported cases in the numerical simulation of the epidemic aligns with the cumulative reported case data after day N , when the public measures take effect. In this way we are able to project forward the time-path of the epidemic after the government imposed public restrictions take effect. We illustrate τ (t) in Figure 3 for a typical case. The daily number of reported cases from the model can be obtained by computing the solution of the following equation: (4.7) In this section we apply the method described in section 4 to the data coming from South Korea, Italy, France and Germany [5, 6, 7, 8]. In Table 2 to March 6 for Germany. The parameters ν = 1/7 and η = 1/7. The values of I 0 U 0 , τ 0 , and t 0 are obtained by using (4.2) to (4.4). . CC-BY-NC-ND 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. . https://doi.org/10.1101/2020.03.21.20040154 doi: medRxiv preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. . https://doi.org/10.1101/2020.03.21.20040154 doi: medRxiv preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. Figure 12 : Graphs of the model simulation and data as in Figure 11 , for a longer time period. . CC-BY-NC-ND 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. 14 . CC-BY-NC-ND 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint Figure 14 : Graphs of the model simulation and data as in Figure 13 , for a longer time period. We have applied a method developed in [1] and [2] to predict the evolution of a COVID-19 epidemic in a geographical region, based of reported case data in that region. Our method uses early data, when the epidemic is in its second phase and growing exponentially, corresponding to a constant transmission rate. In [1] we demonstrated a method to identify this constant transmission rate. When public measures are begun in order to ameliorate the epidemic, a third phase begins, which we model with a time-dependent exponentially decreasing transmission rate in [2] . In [2] we applied this method to mainland China, and demonstrated the ability of our model to predict the forward time-line of the epidemic. In Figure 4 in [2] , we showed how the prediction unfolded week by week, with increasing agreement with reported case 15 . CC-BY-NC-ND 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted March 24, 2020. . In this work we use the cumulative number of reported cases, the daily number of reported cases, and the weekly number of reported cases for four countries. With this data, we project the future number of cases, both reported and unreported, in each country. We assume a fraction f of total cases are unreported. We simulate our model with f = 0.6 and f = 0.1 for each country. The fractions f = 0.6 and f = 0.1 correspond to lesser and greater final sizes, respectively. For South Korea, the epidemic has subsided, because of the major measures that were implemented to restrict public distancing. For Italy, France, and Germany, the epidemics are still rising, because these measures have been implemented only recently. Our model incorporates social distancing measures through the time dependent transmission rate τ (t) in the third phase. This rate involves µ (the rate of exponential decay corresponding to these measures) and N the day at which they become effective. It is evident that these measures should start as early as possible, and should be as strong as possible. The consequences of late public interventions may have severe consequences for the epidemic outcome (as illustrated in Figure 6 in [2] ). In the case of South Korea, the peak of the epidemic occurred approximately February 29. In Figures 5 and 6 , we see that our model agrees very well the data for South Korea. Accordingly to our model, the daily number of cases reaches a maximum of approximately 700 cases near the turning point February 29. Compared to South Korea, the public interventions in Italy, France, and Germany were relatively late. The peak of the epidemic occurs in Italy around April 9, in France around April 14, and in Germany around May 1. The maximum daily number of cases is between 10 000 and 15 000 in Italy, between 12 000 and 20 000 in France, and between 200 000 and 500 000 in Germany. These numbers will likely remain high for an extended period of several months for these three countries, without further major public measures. Conflicts of Interest: Declare conflicts of interest or state "The authors declare no conflict of interest." Understanding unreported cases in the 2019-nCov epidemic outbreak in Wuhan, China, and the importance of major public health interventions Predicting the cumulative number of cases for the COVID-19 epidemic in China from early data, medRxiv Estimates of the severity of COVID -19 disease, medRxiv Early dynamics of transmission and control of COVID-19: a mathematical modelling study