key: cord-0454502-c642jind authors: Afonso, Suzete; Azevedo, Juarez; Pinheiro, Mariana title: Epidemic analysis of COVID-19 in Brazil by a generalized SEIR model date: 2020-05-22 journal: nan DOI: nan sha: cc8d595f44df00864cd7f6018e919d86a92be5c3 doc_id: 454502 cord_uid: c642jind We shall apply a generalized SEIR model to study the outbreak of COVID-19 in Brazil. In particular, we would like to explain the projections of the increase in the level of infection over a long period of time, overlapping large local outbreaks in the most populous states in the country. A time-dependent dynamic SEIR model inspired in a model previously used during the outbreak in China was used to analyses the time trajectories of infected, recovered, and deaths. The model has parameters that vary with time and are fitted considering a nonlinear least-squares method. The simulations starting from April 8, 2020, concluded that the time for a peak in Brazil will be in July 21, 2020 with total cumulative infected cases around 982K people; in addition, an estimated total death case will reach to 192K in the end. Besides that, Brazil will reach a peak in terms of daily new infected cases and death cases around the middle of July with 50K cases of infected and almost 6.0K daily deaths. The SEIR model derives from the SIR model and require a large amount of data over time to execute. Here, we follow the adapted SEIR model developed by Tan and Chen in [13] , which is the most reliable response to emergency public health actions and takes into account the population exposed to the virus. More specifically, we use the generalized version of the classical SEIR known as SEIRDP (Susceptible, Exposed, Infected, Recovered, Death, Insusceptible (P)) which is a particular case of the model proposed by [10] . In this model it is possible to include key epidemic parameters for COVID-19, such as the latent time, infected time, protection rate and time evolution of the recovery. This allows us to estimate the inflection point, ending time and total infected cases in Brazil and its states according to the Ministry of Health of Brazil data. The data are estimated considering a time window for collecting cure and mortality rates, which provides more accurate information on the evolution of local cases. The search for a reliable estimate of the number of infected and deaths is crucial so that public health systems can act to respond quickly to the needs presented. For this reason, many studies aim to estimate the contagion curve, the spreading speed and the prediction of deaths. Recently, Imperial College [12] released a study on the current situation of COVID-19 in Brazil and estimate that human-to-human transmission of Sars-CoV-2, virus of the coronavirus family causing the disease COVID-19, in Brazil averages from 1 to 2.8 people. They use a semi-deterministic Bayesian hierarchical model that analyzes the impacts of interventions aimed at restricting the transmission of the virus but it is not clear how effective these measures were. The model adopted follows a function of patterns in urban mobility to estimate deaths, infections, and transmissions. Some models can also predict the evolution of contaminants in real-time, such as the IHME [1] with some important variables such as the transmission rate and the recovery rate, neither static nor variable over time, which makes the result very broad and somewhat vague. Local health policies and actions can also be taken to mitigate the virus. Some authors have also studied the impact of lockdown on the spread of the virus [11] . Others, who used these models, consider the study of public policies to mitigate the disease in specific locations [16] . Our simulations are based on general data from Brazil and where different policies have been applied in each state according to hospital capacity and speed of contamination. In this way, we focus on responses to general data from Brazil and in some Brazilian states with a higher rate of cases in order to assist in medical and political actions. In this work, we analyzed the case data of COVID-19 from the states of São Paulo, Rio de Janeiro, Ceará, Espírito Santo, Bahia, Amazonas, Pará, Pernambuco, and Maranhão. To validate our results through the SEIRDP model, we carefully collect the epidemic data from the Ministry of Health of Brazil [2] from April 08th till May 21th, 2020 and studied the approximation of the observed and computed data considering the infected, recovered and death cases. Then, we were able to estimate the parameters that characterize the natural history of this disease and allows us to investigate the projection of the epidemic in certain periods. The SEIRDP (Susceptible, Exposed, Infected, Recovered, Death, Insusceptible (P)) model used to analyze the coronavirus epidemic in Brazil has the following variables: • S(t): Susceptible population; • E(t): Population who are exposed to the virus, but not yet infectious in latent period; • I(t): Population who get laboratory positive confirmation and with infectious capacity; • R(t): Recovery cases; • D(t): Death number; • P (t): Insusceptible cases; • N = S + P + E + I + R + D is the total population; • α: Protection rate (include people exposed to the infectious patients and people exposed to the asymptomatic patients); • β: Infection rate (include exposed people catch COVID-19, people get COVID-19 with diagnosed confirmation and people; without symptoms but can transmit it to others); • γ −1 : Average latent time; • λ(t): Coefficient used in the time-dependent cure rate; • κ(t): Coefficient used in the time-dependent mortality rate. The variable described above are related through the following ODE system: (1) The model is time-dependent, because the cure rate λ(t) and mortality rate κ(t) are analytical functions of the time defined by (3) respectively, which are fitted according to the number of cases for t ∈ [t 0 , t f ], where t 0 and t f are the initial and final periods, respectively. Equations (2) and (3) indicate that cure rate or recovery rate λ(t) goes up exponentially toward a threshold value and the mortality rate κ(t) should be zero after an infinite time. Moreover, the parameters {α, β, γ −1 , λ(t), κ(t)} were fitted in the least square sense. The parameters {α, β, γ −1 , λ 0 , λ 1 ,λ 2 , κ 0 (t),κ 1 (t)} are computed simultaneously by a nonlinear least-squares solver [8] . We can replace them in the model (1) and calculate the time-histories of the different states {S(t), P (t), E(t), I(t), R(t), D(t)}. In this case, we use the standard fourth-order Runge-Kutta process [6] . For this, the system of equations is written as a single ODE Anchored by the information from the public data of the Ministry of Health of Brazil [2], we reliably estimate key epidemic parameters and make projections on the inflection point considering fitted and actual data. Experiments show the spatial distribution of COVID-19 cases in Brazil and in some states of the federation, namely, São Paulo, Ceará, and Rio de Janeiro. These states stand out for the highest incidence of COVID-19 in Brazil. We also projected cases of COVID-19 in other states in the country: Espírito Santo, Bahia, Amazonas, Pará, Pernambuco and Maranhão that had a large number of confirmed cases in relation to other states in the federation, about I > 8.000 each one, according in the last epidemiological bulletin of Ministry of Health of Brazil in May 18, 2020. Fig. 1(a) shows the first modeling and the projection for Brazil from late early April to mid-May 2020 for the currently total, infected, recovery and death cases, while Fig. 1 (b) presents its respective relative error solutions. According to these results we can say that the model has a good correspondence with the data provided. This means that the state parameters found by the least-squares solution should be suitable for projecting future dates. A test was done with the traditional SEIR and the results were overestimated (see Fig. 1(c) ). In Fig. 2(a) , through the SEIRDP model, we made the projections in the period from April to August. According to this model, the infected cases in Brazil will reach to the peak at middle of July 2020 with about 982K cases. In this sense, the period also represents the peak time for medical resources get ready. According to the Ministry of Health, Brazil has a total of 441,811 hospital beds distributed in the country's numerous regions. From the number of infected, we can see that the public health system in Brazil may collapse in the middle of June. The number of victims is also expected to grow in the same period. It should be noted that these results are only predictions based on data and population mitigation policies. These results may change as the population's isolation patterns change, as studies in this direction indicate, [11, 16] . Modeling of the differential SEIR model from April 8 to May 21. Fig. 2(b) and Fig. 2(c) presents daily newly confirmed infected cases and daily death cases for Brazil over the same period respectively. The red lines indicate the actual data and the blue lines are predicted data from May to early of August 2020. As we can see that the daily infected cases seem reaching the peak around July 21 with about 55K per day and then start to decrease. From Fig. 2(c) , we can see that the daily death is converging to approximately 6000 daily deaths without a projection of decrease in this period. São Paulo is the second metropolis in Latin America and the main financial, commercial and corporate center in South America and where the first case of contamination in Brazil was detected. Currently, São Paulo concentrates the largest number of infected with the new Coronavirus. For São Paulo state, the modeling and relative errors from the same period of Brazil are shown in Figs. 3 . We notice that the modeled curves present a good approximation with the collected data. In addition, a large part of the relative errors in infected cases, recovery, and death cases is below 20%. In Fig. 4(a) we present an estimate of accumulative infected cases, recovery, and death cases. Projections show that the currently infected cases will reach a peak around July 20 close to Brazil, with about 114K cases. Fig. 4(b) and Fig. 4(c) show the daily confirmed infected cases and the death cases for São Paulo over the mentioned period respectively. Estimates show that the peak will reach in July 20 (about 5K) for daily infected and converging to 436 daily deaths till August 19, 2020. The daily and cumulative peak has shown to be very close to the results obtained in the projection of Brazil. Another important study presented is on the state of Ceará, which recently showed a great increase in the number of cases. For Ceará state, the modeling and relative errors from the same period of Brazil are shown in Figs. 5. Looking at the curves modeled for Ceará state, we see that the error curves of infected, recovered and deceased cases also are oscillating around less than 20% in the period from early April to mid-May. Therefore, it is possible to infer that the SEIRDP model provides a good approximation of the results with respect to the collected data. In Fig. 6(a) we provide an estimate of accumulative infected cases, recovery and death cases. Projections show that the currently infected cases will reach a peak around July 20 with about 80K cases. Fig. 6(b) and Fig. 6 (c) present the daily confirmed infected cases and the death cases for Ceará over mentioned period, respectively. By the estimates, we can conjecture that the peak will reach in July 07 (about 1665) for daily infected, converging to 1000 daily deaths till August 19, 2020. (a) (b) Figure 5 : (a) Predictions of the differential SEIRDP model from April 08 to May 21, 2020. (b) The relative mean errors of projection for Ceará state from April 08 to May 21, 2020 between actual data and SEIRDP model involving cases of infected, recovered and death. We also present a more detailed study of the second largest state in population of Brazil and the second state to record cases of contaminated by the new Coronavirus which is Rio de Janeiro. The modeling and projection results from the same period of Brazil are shown in Fig. 7(a) for infected, recovered and deaths. The results from April to middle May bring consistent response. The respective relative errors are shown in Fig. 7(b) ). According to Fig. 8(a) , the state of Rio de Janeiro projects that infected cases will reach to the peak around August 02 with about 191K cases, which is close to period of the peak estimation of Brazil. This plot also suggests that the number of deaths will continue with the same growth trend in the cumulative total if the population is not subject to new rules of protection. Fig. 8 (b) and Fig. 8(c) show the daily confirmed infected cases, the death cases and the total accumulative infected cases for Rio de Janeiro state over mentioned period, respectively. Rio de Janeiro state will reach the peak around July 21 for both the daily infected increase (about 9K per day). But for daily death cases, it will reach the peak of 800 per day around August 19. In this study, a time-dependent SEIRDP model was used to analyze the evolution of currently active, recovered and deaths cases in states in Brazil whose infected toll exceeds 8.000 cases actually. The states Espírito Santo, Bahia, Amazonas, Pará, Pernambuco and Maranhão marked together with São Paulo, Ceará, and Rio de Janeiro have more than 90 percent of the total fatalities in the country. As in previous experiments, theses projections take into account mitigation and self-protective measures implemented by the government and the population, in the periods given previously. Fig.9 shows the prediction of infected, recovered, and death from April to early August. According to this figure, the state of Pernambuco is close to reaching the peak of patients infected by COVID-19 in mid-June, giving the authorities time to plan control and mitigation measures during this period. On the other hand, the number of deaths maintains a projected growth in all states considered in this experiment. We consider an extension of the classical SEIR model known as SEIRDP with the aim of modeling COVID-19 epidemics and introduce an estimation algorithm for Brazil. These parameters are found by applying the least-squares estimate in order to fit the SEIR differential equations to the daily and cumulative cases observed by symptoms onset. The results revealed that the model (1) provides a good approximation with respect to the collected data, since a large part of the relative errors in infected cases, recovery, and death is below 20%. It should be noted that we need a reasonable set of data so that the estimates are more reliable. Fig. 10 illustrates how bad can be the prediction if the data set is too small. The case of Brazil is used in the following display. Note that the discrepancies are significant when few points are used to measure the prediction of results, but these discrepancies tend to reduce as the number of points increases. According to the estimates obtained through the SEIRDP model, the states of São Paulo, Ceará, Amazonas, Maranhão, Espírito Santo, Pará and Bahia will reach to the peak of patients infected in mid-July, while Rio de Janeiro in early august. In addition, the daily number of deaths projects growth until August 19, 2020. It is worth mentioning that the first cases were diagnosed in the country, suspected cases, and those who had contacts with confirmed cases were tested. Due to the large scale of contamination and low testing capacity, the Ministry of Health started to recommend since mid-March that in places with community transmission only serious cases should be tested. Consequently, many cases of coronavirus may have been underreported, underestimating the results of the model. Another intrinsic problem raised by this study was the delay between the occurrences of deaths and their recording in official data released by the Ministry of Health. Unfortunately, in some states, much information arrived late, this can compromise the projection results. Despite these problems, we hope that the estimates obtained will minimize the influence of Coronavirus in Brazil, especially in large urban centers whose probability of accumulating serious cases in the short term is high. Infectious Diseases of Humans: Dynamics and Control. Dynamics and Control Dynamics of measles epidemics: estimating scaling of transmission rates using a time series SIR model Numerical Analysis. Cengage Learning A mathematical model for simulating the transmission of Wuhan novel Coronavirus. bioRxiv Generalized SEIR Epidemic Model (fitting and computation Threshold behaviour of a stochastic SIR model Epidemic analysis of COVID-19 in China by dynamical modeling Lockdown may partially halt the spread of 2019 novel coronavirus in Hubei province Estimating COVID-19 cases and reproduction number in Brazil Real-time differential epidemic analysis and prediction for COVID-19 pandemic Susceptible-infected-recovered (SIR) dynamics of COVID-19 and economic impact The sir model and the foundations of public health The lockdown of Hubei province causing different transmission dynamics of the novel coronavirus (2019-ncov) in Wuhan and Beijing Simulating the infected population and spread trend of 2019-ncov under different policy by EIR model. medRxiv Stochastic SIR model with jumps