key: cord-0665969-w09i7ct2 authors: Le'on, Ugo Avila-Ponce de; P'erez, 'Angel G. C.; Avila-Vales, Eric title: A data driven analysis and forecast of an SEIARD epidemic model for COVID-19 in Mexico date: 2020-04-16 journal: nan DOI: nan sha: d5254d0436c5b59e2e0d733b75de4b53f9550e4e doc_id: 665969 cord_uid: w09i7ct2 We propose an SEIARD mathematical model to investigate the current outbreak of coronavirus disease (COVID-19) in Mexico. We conduct a detailed analysis of this model and demonstrate its application using publicly reported data. We calculate the basic reproduction number ($R_0$) via the next-generation matrix method, and we estimate the per day infection, death and recovery rates. We calibrate the parameters of the SEIARD model to the reported data by minimizing the sum of squared errors and attempt to forecast the evolution of the outbreak until June 2020. Our results estimate that the peak of the epidemic in Mexico will be around May 2, 2020. Our model incorporates the importance of considering the aysmptomatic infected individuals, because they represent the majority of the infected population (with symptoms or not) and they could play a huge role in spreading the virus without any knowledge. The COVID-19 pandemic originated in Wuhan, China in December 2019. Since then, the number of cases has accelerated in China and subsequently all over the world. The causative agent is a new betacoronavirus related to In this work, we will use a compartmental differential equation model for the spread of COVID-19 in Mexico. The model monitors the dynamics of six subpopulations, which are: susceptible (S(t)), exposed (E(t)), infected (I(t)), asymptomatic (A(t)), recovered (R(t)) and dead (D(t)). The model simulations will be carried out with the following assumptions: (c) At first, no interventions were applied to stop the spread of COVID-19. (d) The population is constant; no births are allowed, and we only take into account the fatalities associated to COVID-19. Figure 1 shows a diagram of the flow through the compartmental subpopulations. Susceptible population S(t): This subpopulation will remain constant because recruiting individuals is not allowed in our model. The susceptible population will decrease after an infection, an acquired characteristic due to the interaction with an infected person or asymptomatic one. The transmission coefficients will be βI and βA. The rate of change of the susceptible population is expressed in the following equation: (1) Figure 1 : Flow diagram of our mathematical model to evaluate the behavior of the spread of nCoV-2019 in Mexico. S: susceptible, E: exposed, I: infected, A: infected but without symptoms (asymptomatic), R I : recovered from symptomatic infection, R A : recovered from asymptomatic infection, D: dead. Exposed population E(t): This subpopulation consists of individuals that are infected but cannot infect others. The population decreases at a rate w to become infected or asymptomatic. Consequently, Infected population I(t): Infected (symptomatic) individuals are generated at a proportion p from the exposed class. They recofer at a rate γ and die at a rate δ. This is the only population that acknowledges death. Thus, Asymptomatic population A(t): This population is considered an infected population, but the individuals do not develop the common symptoms of COVID-19. Asymptomatic individuals are important to model because they have the ability to spread the virus without knowing; they are produced at a rate 1 − p and recover at a rate γ. Consequently, Recovered populations R I (t) and R A (t): All individuals infected with symptoms or not will recover at a rate γ. We subdivide the recovered population in two compartments: individuals who recover after having symptoms (R I ) and individuals who recover from asymptomatic infection (R A ). Hence Dead population D(t): Infected individuals with symptoms die at a rate δ, that is, dD dt = δI. Hence, the system of differential equations that will model the dynamics of coronavirus spread in Mexico is: We also observe that N : The rest of this work is organized as follows. In Section 3, we calculate the basic reproduction number of the model. We present in Section 4 the assumptions and procedure we used for the implementation of our model, and we describe in Section 5 the results obtained when performing simulations with the best fit parameters. There exists a disease-free equilibrium, which is given by S = N , E = I = A = R I = R A = D = 0, and we will denote it by x 0 . We calculate the basic reproduction number R 0 based on this steady state. We use the nextgeneration matrix method proposed by Diekmann et al. [5] . To find R 0 , we must solve the equation R 0 = ρ(F V −1 ), where F and V are the derivatives of the new infections matrix F and the transition matrix V, respectively, evaluated at the disease-free equilibrium. Then The derivative of F at x 0 is: The transition matrix is The inverse of V is We need to find the eigenvalues of F V −1 , which are λ 1 = βp δ+γ + β(1−p) γ , λ 2 = 0 and λ 3 = 0. Then, the basic reproduction number is given by the dominant eigenvalue, that is, To describe the evolution of the epidemic in Mexico taking into account the social distancing measures taken by the government, we will assume that the infection rate, recovery rate and death rate are time-dependent functions, similar to those used in [3] . To model the effect of epidemic control measures, which cause the number of contacts per person per unit time to decrease as the epidemic progresses, we describe the infection rate by the function where β 0 + β 1 is the initial infection rate. This rate decreases exponentially to the value β 1 with a characteristic time of decrease τ β . The time of recovery for patients may also vary with time due to the medical staff improving their therapeutic procedures. Hence, we will assume that the recovery rate is modeled by the function where γ 0 is the recovery rate at time zero, and γ 0 + γ 1 is the recovery rate at a later time, which is reached after τ γ days of adaptation. Lastly, the death rate may decrease with time due to the adaptation of the pathogen or the development of more advanced treatments. Hence, we can model this with the function where δ 0 + δ 1 is the initial death rate, which decreases to the value δ 1 with a characteristic time τ δ . If we replace the constant parameters β, δ and γ in equation (8) with the aforementioned time-dependent functions, we can define as the effective daily reproduction ratio, which measures the number of new infections produced by a single infected individual per day, taking into account the evolving public health interventions and available resources [11] . The set of differential equations was solved using Matlab 2016b with the ode45 solver, which is based on an explicit Runge-Kutta (4,5) formula. Our model was calibrated using the cases of COVID-19 in Mexico. The data were collected in the period since the first reported case of COVID-19 in Mexico (February 28) until April 14 from the open source repository of Johns Hopkins University [7] . The optimization of parameters to describe the outbreak of COVID-19 in Mexico were fitted by minimizing the Sum of Squared Errors (SSE), in such a way that the solutions for D(t) and R I (t) obtained by the model approximate the reported values for deaths and recovered cases, respectively, while the sum I(t) + R I (t) + D(t) approximates the cumulative number of infected cases with symptoms. Since the Mexican government does not keep a record of the number of asymptomatic cases, we assume that the asymptomatic infected population is about nine times larger than the population with symptoms, based on government estimations. We applied three searches to minimize the SSE function: a gradient-based method, a gradient-free algorithm, and finally, a gradient-based method. This method was necessary to obtain the local minimum. We adapted the code from Caccavo [3] for our mathematical model. The predicted evolution of the outbreak for COVID-19 in Mexico can be seen in Figure 2 . The parameters of the mathematical model were fitted with the experimental data provided by a daily update from the Mexican Ministry of Health. By adjusting the data from the period from March 12, 2020 to April 14, 2020, we simulated the daily new COVID-19 cases in Mexico until June 4, 2020. The peak of the infection modeled will be around the first week of May, with 4400 infected individuals with the known developed symptoms and roughly 37 000 infected individuals that will not develop any kind of symptoms. The values of the best fit parameters are given in Table 1 . Figure 3 shows the variation of the infection rate β(t), recovery rate γ(t) and death rate δ(t) with respect to time. Using these values for the parameters, we can calculate the effective daily reproductive ratio R d (t) for each day (see Figure 4 ). Our simulation shows that R d (t) will become less than 1 around April 29, 2020. In Figures 5-8 , we carried out the simulation with the best fit parameters. We show a comparison of the cumulative number of infections ( Figure 5 ) and deaths ( Figure 6 ) with the reported data. We also plot the number of active symptomatic infections and asymptomatic infections in Figures 7 and 8 . As we can see, the population of asymptomatic individuals represents roughly 90% of all possible infections. The behavior of the asymptomatic population is important because they have the capacity to spread the virus without developing any symptoms, and public health interventions should focus on these individuals. By this date, Mexico is located in phase 2 (community transmission) and has declared a national emergency and asked the citizenship to stay in their houses. By declaring self-quarantine to all individuals, we could separate the healthy population from the asymptomatic population and prevent the spread of the virus more rapidly. This action will only gain time in the number of infections and help avoiding the saturation of hospitals. If we control the population of infected individuals, we will prevent the death of many individuals, and the hospitals will have sufficient supplies to mitigate the severity of symptoms in patients with any type of chronic degenerative disease. It is of great importance to explain some of the parameters estimated in the optimization section. τ β = 27.86 represents the number of contacts per person per time. Even though at this time this value is high, by declaring social distancing measures the Mexican government intends to decrease the value of this parameter. We will view this decrease once we reach the peak of infection. τ γ gives us information of the average recovery rate between the first day of the appearance of symptoms and death, which is τ γ = 11.28. This means that some infected individuals can develop more complicated symptoms, eventually they will take much more time to recover than others. As well, it represents the average time of supplies a patient will cost if they develop chronic symptoms like pneumonia or other pulmonary complications. Modeling COVID-19 epidemics in an Excel spreadsheet: Democratizing the access to first-hand accurate predictions of epidemic outbreaks Modeling and forecasting the Covid-19 pandemic in Brazil Chinese and Italian COVID-19 outbreaks can be correctly described by a modified SIRD model Dispersion of a new coronavirus SARS-CoV-2 by airlines in 2020: Temporal estimates of the outbreak in Mexico The construction of next-generation matrices for compartmental epidemic models Analysis and forecast of COVID-19 spreading in China, Italy and France Novel Coronavirus COVID-19 (2019-nCoV) Data Repository Mathematical modelling of the dynamics of the Coronavirus COVID-19 epidemic development in China A conceptual model for the coronavirus disease 2019 (COVID-19) outbreak in Wuhan, China with individual reaction and governmental action Age-structured impact of social distancing on the COVID-19 epidemic in India An updated estimation of the risk of transmission of the novel coronavirus (2019-nCov) Predicting COVID-19 distribution in Mexico through a discrete and time-dependent Markov chain and an SIR-like model The SARS-CoV-2 epidemic outbreak: a review of plausible scenarios of containment and mitigation for Mexico