key: cord-0538372-jdy0os21 authors: Liu, Zhihua; Magal, Pierre; Seydi, Ousmane; Webb, Glenn title: Predicting the cumulative number of cases for the COVID-19 epidemic in China from early data date: 2020-02-27 journal: nan DOI: nan sha: ea32db7bca64cdc76cf1afeeadb337ba3f396be9 doc_id: 538372 cord_uid: jdy0os21 We model the COVID-19 coronavirus epidemic in China. We use early reported case data to predict the cumulative number of reported cases to a final size. The key features of our model are the timing of implementation of major public policies restricting social movement, the identification and isolation of unreported cases, and the impact of asymptomatic infectious cases. Many mathematical models of the COVID-19 coronavirus epidemic in China have been developed, and some of these are listed in our references [4, 7, 9, 10, 11, 12, 13, 14, 15] . We develop here a model describing this epidemic, focused on the effects of the Chinese government imposed public policies designed to contain this epidemic, and the number of reported and unreported cases that have occurred. Our model here is based on our model of this epidemic in [5] , which was focused on the early phase of this epidemic (January 20 through January 29) in the city of Wuhan, the epicenter of the early outbreak. During this early phase, the cumulative number of daily reported cases grew exponentially. In [5] , we identified a constant transmission rate corresponding to this exponential growth rate of the cumulative reported cases, during this early phase in Wuhan. On January 23, 2020, the Chinese government imposed major public restrictions on the population of Wuhan. Soon after, the epidemic in Wuhan passed beyond the early exponential growth phase, to a phase with slowing growth. In this work, we assume that these major government measures caused the transmission rate to change from a constant rate to a time dependent exponentially decreasing rate. We identify this exponentially decreasing transmission rate based on reported case data after January 29. We then extend our model of the epidemic to the central region of China, where most cases occurred. Within just a few days after January 29, our model can be used to project the time-line of the model forward in time, with increasing accuracy, to a final size. The model consists of the following system of ordinary differential equations: This system is supplemented by initial data Here t ≥ t 0 is time in days, t 0 is the beginning date of the model of the epidemic, S(t) is the number of individuals susceptible to infection at time t, I(t) is the number of asymptomatic infectious individuals at time t, R(t) is the number of reported symptomatic infectious individuals at time t, and U (t) is the number of unreported symptomatic infectious individuals at time t. Asymptomatic infectious individuals I(t) are infectious for an average period of 1/ν days. Reported symptomatic individuals R(t) are infectious for an average period of 1/η days, as are unreported symptomatic individuals U (t). We assume that reported symptomatic infectious individuals R(t) are reported and isolated immediately, and cause no further infections. The asymptomatic individuals I(t) can also be viewed as having a low-level symptomatic state. All infections are acquired from either I(t) or U (t) individuals. The parameters of the model are listed in Table 1 and a schematic diagram of the model is given in Figure 1 . Fraction of asymptomatic infectious that become reported symptomatic infectious fixed ν 1 = f ν Rate at which asymptomatic infectious become reported symptomatic fitted Rate at which asymptomatic infectious become unreported symptomatic fitted 1/η Average time symptomatic infectious have symptoms fixed Table1 Table 1 : Parameters of the model. We plot the data for daily reported cases and the cumulative reported cases in Figure 2 . We assume f = 0.8, which means that 20% of symptomatic infectious cases go unreported. We assume η = 1/7, which means that the average period of infectiousness of both unreported symptomatic infectious individuals and reported symptomatic infectious individuals is 7 days. We assume ν = 1/7, which means that the average period of infectiousness of asymptomatic infectious individuals is 7 days. These values can be modified as further epidemiological information becomes known. In our previous work, we assumed that in the early phase of the epidemic (January 20 through January 29), the cumulative number of reported cases grew approximately exponentially, according to the formula: These values of χ 1 , χ 2 , and χ 3 were fitted to reported case data from January 20 to January 29. We assumed the initial value S 0 = 11, 000, 000, the population of the city Wuhan, which was the epicenter of the epidemic outbreak. The other initial conditions are The value of the transmission rate τ (t), during the early phase of the epidemic, when the cumulative number of reported cases was approximately exponential, is the constant value The initial time is The value of the basic reproductive number is These parameter formulas were derived in [5] . After January 23, strong government measures in all of China, such as isolation, quarantine, and public closings, strongly impacted the transmission of new cases. The actual effects of these measures were complex, and we use an exponential decrease for the transmission rate τ (t) to incorporate these effects after the early exponential increase phase. The formula for τ (t) during the exponential decreasing phase was derived by a fitting procedure. The formula for τ (t) is where January 24 and µ = 0.12 are fitted from on-going reported case data after January 24. In Figure 3 , we plot the graph of τ (t). We numerically simulated the model (2.1) to project forward in time the time-line of the epidemic after the government imposed interventions. We set τ (t) in (4.1) to S 0 τ (t)/1, 400, 050, 000, where S 0 = 11, 000, 000 and 1, 400, 050, 000 is the population of mainland China, excluding Hong Kong, Macao and Taiwan. We set S(t 0 ) in (2.2) to 1, 400, 050, 000. We set t 0 = 5.0, I(t 0 ) = 3.7, U (t 0 ) = 0.2 and R(t 0 ) = 0. In Figure 4 , we plot the graphs of CR(t) (cumulative reported cases), CU (t) (cumulative unreported cases), R(t), and U (t) from the numerical simulation. In Figure 5 we plot the graphs of the reported cases R(t) and the infectious pre-symptomatic cases I(t). The blue dots are obtained from the reported cases data ( Table 2) for each day beginning on January 26, by subtracting from each day, the value of the reported cases one week earlier. Our model transmission rate τ (t) can be modified to illustrate the effects of an earlier or later implementation of the major public policy interventions that occurred in this epidemic. The implementation one week earlier (24 is replaced by 17 in (4.1)) is graphed in Figure 6 (top). All other parameters and the initial conditions remain the same. The total reported cases is approximately 4, 500 and the total unreported cases is approximately 1, 100. The implementation one week later (24 is replaced by 31 in (4.1)) is graphed in Figure 6 (bottom). The total reported cases is approximately 820, 000 and the total unreported cases is approximately 200, 000. The timing of the institution of major social restrictions is critically important in mitigating the epidemic. The number of unreported cases is of major importance in understanding the evolution of an epidemic, and involves great difficulty in their estimation. The data from January 20 to February 15 for reported cases in Table 2 , was only for tested cases. Between February 11 and February 15, additional clinically diagnosed case data, based on medical imaging showing signs of pneumonia, was also reported by the Chinese CDC. Since February 16, only tested case data has been reported by the Chinese CDC, because new NHC guidelines removed the clinically diagnosed category. Thus, after February 15, there is a gap in the reported case data that we used up to February 15. The uncertainty of the number of unreported cases for this epidemic includes this gap, but goes even further to include additional unreported cases. We assumed previously that the fraction f of reported cases was f = 0.8 and the fraction of unreported cases was 1 − f = 0.2. Our model formulation can be applied with varying values for the fraction f . In Figure 7 , we provide illustrations with the fraction f = 0.4 (top) and f = 0.6 (bottom). The formula for the time dependent transmission rate τ (t) in (4.1) involves a new value for τ 0 and µ for each case. The final size of the epidemic when f = 0.4 is approximately 164,700 cases, and the final size of the epidemic when f = 0.6 is approximately 110,700 cases. From these simulations, we see that estimation of the number of unreported cases has major importance in understanding the severity of this epidemic. The number of days an asymptomatic infected individual is infectious is uncertain. We simulate in Figure 8 the model with ν = 1/3, which means that asymptomatic infected individuals are infectious on average 3 days before becoming symptomatic. The result is a small decrease in the final size of the epidemic, as compared to the case that ν = 1/7. We illustrate the importance of the level of government imposed public restrictions by decreasing the value of µ in formula (4.1) for τ (t). In Figure 9 we set µ = 0.09 instead of µ = 0.12, and the result is a significant increase in the number of cases. We have developed a model of the COVID-19 epidemic in China that incorporates key features of this epidemic: (1) the importance of the timing and magnitude of the implementation of major government public restrictions designed to mitigate the severity of the epidemic; (2) the importance of both reported and unreported cases in interpreting the number of reported cases; and (3) the importance of asymptomatic infectious cases in the disease transmission. In our model formulation, we divide infectious individuals into asymptomatic and symptomatic infectious individuals. The symptomatic infectious phase is also divided into reported and unreported cases. Our model formulation is based on our work [5] , in which we developed a method to estimate epidemic parameters at an early stage of an epidemic, when the number of cumulative cases grows exponentially. The general method in [5] , was applied to the COVID-19 epidemic in Wuhan, China, to identify the constant transmission rate corresponding to the early exponential growth phase. In this work, we use the constant transmission rate in the early exponential growth phase of the COVID-19 epidemic identified in [5] . We model the effects of the major government imposed public restrictions in China, beginning on January 23, as a time-dependent exponentially decaying transmission rate after January 24. With this time dependent exponentially decreasing transmission rate, we are able to fit with increasing accuracy, our model simulations to the Chinese CDC reported case data for all of China, forward in time through February 15, 2020. Our model demonstrates the effects of implementing major government public policy measures. By varying the date of the implementation of these measures in our model, we show that had implementation occurred one week earlier, then a significant reduction in the total number of cases would have resulted. We show that if these measures had occurred one week later, then a significant increase in the total number of cases would have occurred. We also show that if these measures had been less restrictive on public movement, then a significant increase in the total size of the epidemic would have occurred. It is evident, that control of a COVID-19 epidemic is very dependent on an early implementation and a high level of restrictions on public functions. We varied the fraction of unreported cases involved in the transmission dynamics. We showed that if this fraction is higher, then a significant increase in the number of total cases results. If it is lower, then a significant reduction occurs. It is evident, that control of a COVID-19 epidemic is very dependent on identifying and isolating symptomatic unreported infectious cases. We also decreased the parameter ν (the reciprocal of the average period of asymptomatic infectiousness), and showed that the total number of cases in smaller. It is also possible to decrease η (the reciprocal of the average period of unreported symptomatic infectiousness), to obtain a similar result. It is evident that understanding of these periods of infectiousness is important in understanding the total number of epidemic cases. Our model was specified to the COVID-19 outbreak in China, but it is applicable to any outbreak location for a COVID-19 epidemic. Identifying the number of unreported cases in SIR epidemic models Parameter identification in epidemic models Parameter estimation in epidemic models: simplified formulas The continuing 2019-nCoV epidemic threat of novel corona viruses to global health -The latest 2019 novel corona virus outbreak in Wuhan, China Understanding unreported cases in the 2019-nCov epidemic outbreak in Wuhan, China, and the importance of major public health interventions The parameter identification problem for SIR epidemic models: Identifying Unreported Cases Initial cluster of novel coronavirus (2019-nCoV) infections in Wuhan, China Is consistent with substantial human-to-human transmission The Rate of Under ascertainment of Novel Coronavirus (2019-nCoV) Infection: Estimation Using Japanese Passengers Data on Evacuation Flights Real-time forecasts of the COVID-19 epidemic in China from An updated estimation of the risk of transmission of the novel coronavirus (2019-nCov) Estimation of the transmission risk of the 2019-nCoV and its implication for public health interventions Novel coronavirus outbreak in Wuhan, China, 2020: Intense surveillance Is vital for preventing sustained transmission in new locations Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study Estimating the unreported number of novel Coronavirus (2019-nCoV) cases in China in the first half of January 2020: A data-driven modelling analysis of the early outbreak