key: cord-1044949-6tz7sn8q authors: Valle, Jemy A. Mandujano title: Predicting the number of total COVID-19 cases and deaths in Brazil by the Gompertz model date: 2020-11-03 journal: Nonlinear Dyn DOI: 10.1007/s11071-020-06056-w sha: 67726a4c80437d75430ff14d4500d14a360f6b6e doc_id: 1044949 cord_uid: 6tz7sn8q In this work, we estimate the total number of infected and deaths by COVID-19 in Brazil and two Brazilian States (Rio de Janeiro and Sao Paulo). To obtain the unknown data, we use an iterative method in the Gompertz model, whose formulation is well known in the field of biology. Based on data collected from the Ministry of Health from February 26, 2020, to July 2, 2020, we predict, from July 3 to 9 and at the end of the epidemic, the number of infected and killed for the whole country and for the Brazilian states of Sao Paulo and Rio de Janeiro. We estimate, until July 9, 2020, a total of 1,709,755 cases and 65,384 deaths in Brazil, 331,718 cases and 15,621 deaths in Sao Paulo, 134,454 cases and 11,574 deaths in Rio de Janeiro. We also estimate the basic reproduction number [Formula: see text] for Brazil and its two states. The estimated values [Formula: see text] were 1.3, 1.3, and 1.4 for Brazil, Sao Paulo, and Rio de Janeiro, respectively. The results show a good fit between the observed data and those obtained by the Gompertz. The proposed methodology can also be applied to other countries and Brazilian states, and we provide an executable as well as the source code for a straightforward application of the method on such data. The World Health Organization declared the coronavirus disease 2019 a pandemic on March 11, pointing to the over 118,000 cases in over 110 countries and territories around the world at that time [18] . The COVID-19 pandemic in Brazil began on February 26, 2020, when a man from Sao Paulo who returned from Italy tested positive for the virus. The first case of COVID-19 in Rio de Janeiro was confirmed on March 5, 2020. Nowadays, Brazil is considered the epidemic center of Latin America, occupying the second place in the total number of cases and, more recently, in the total number of deaths. Currently (July 9, 2020), Sao Paulo and Rio de Janeiro are the states with the highest number of deaths by the new coronavirus in Brazil, according to data from the Brazilian Health Ministry. In the past few months, a considerable number of studies related to the evolution of COVID-19 in the world have been submitted and published. In the following, we describe some of these works. Ahmadi et al. [1] developed mathematical models to predict the number of COVID-19 cases in Iran from April 3, 2020, to May 13, 2020. The unknown parameters in these models were estimated by running the fminsearch, a MATLAB function, which is a least-squares algorithm. Torrealba et al. [22] analyzed the modeling and prediction of COVID-19 in Mexico, from an initial approximation, and using the Gauss-Newton algorithm, the authors estimated parameters in the Gompertz and Logistic models. Articles [12, 15, 16] estimated the number of total COVID-19 cases and deaths in world using the Gompertz model. In [11] , they analyzed the number of deaths by COVID-19 with social distancing in Brazil, Sao Paulo. Using the SEIR model, the authors recommend temporary lockdowns, however, great the economic and social costs. In [2] , the authors studied the impact on the evolution of cases in Rio de Janeiro, Brazil, considering a Susceptible-Infectious-Quarantined-Recovered (SIQR) model with containment. In this work, we model the near-future trajectory of the cumulative number of infections and deaths by COVID-19 for Brazil and two Brazilian states, given the cumulative number of infected cases and deaths from February 26, 2020 to July 2, 2020. Data were obtained from the Brazilian Ministry of Health until July 2 (https://covid.saude.gov.br), and we considered the cumulative reports from the date on which the first case was notified in Brazil and in each analyzed state. We also estimate the basic reproduction number (R 0 ), which represents the average number of secondary infections generated by each infected person. To estimate the unknown parameters, we use the Gompertz model and propose a gradient type iterative method (Minimal error method). The Gompertz model is one of the particular cases of the Richards model, as well as Brody, negative exponential, logistic and von Bertalanffy models, see [20, 21] . The growth functions can be grouped in three main categories: those without inflection point (Brody and negative models), those with sigmoidal shape and a fixed inflection point (Gompertz, logistic, and von Bertalanffy models), and those with a flexible inflection point (Richards model). The logistic, Gompertz, and von Bertalanffy models exhibit inflection points at about 50, 37, and 30% of the upper asymptote, respectively, which means that the Gompertz and von Bertalanffy processes are asymmetric, whereas the logistic is a symmetric process, see [19] . The Gompertz and the logistic models are the most frequently used sigmoid functions, and the literature on these models is extensive [3, 21] . In general, the cumulative number of deaths and cases by COVID-19 presents an asymmetrical sigmoidal growth curve. Therefore, using an inappropriate growth curve can have a substantial impact on forecasting [3] . The Gompertz model [5] was proposed by Benjamin Gompertz in 1825. Since then, this exponential model has been used to describe growth in plants, animals, bacteria, and cancer cells [22] . The Gompertz differential equation has the following form, where t represents time; N (t) is the cumulative population size at time t; r the intrinsic growth rate of model; K is the maximum value of model (N ) when t goes to infinity; t 0 represents the initial time; and N 1 is the initial population or condition. The analytic solution from (1) is where lim t→∞ N (t) = K . The turning point is the time at which the rate of accumulation changes from increasing to decreasing or vice versa and can be easily located by finding the inflection point of the epidemic curve, that is, the moment at which the trajectory begins to decline. Clearly, this quantity is of epidemiological importance, indicating either the beginning (i.e., the moment of acceleration after deceleration) or end (i.e., the moment of deceleration after acceleration) of a phase [6] . From Eq. (2), it is trivial to show that the inflection point is given by At the inflection point, the number of infected or killed is given by N (t i. p ) = K /e. In this paper, we consider the discrete function, from Eq. (2) and for t 0 = 1, where i represents time in days and N i is the cumulative number of infected cases (or deaths) at day i. In this paper, we consider that equality (3) is the inflection point of Eq. (4). model. The MEM is a variant of the conjugate gradient method [4, 10, 17] , and this approach was used to estimate unknown parameters in a computational neuroscience models [24, 25] . Also, [14] solves the Cauchy problem in linear elasticity with MEM. Knowing the cumulative number of infections or deaths from COVID-19 (M), we want to determine x assuming that Eq. (4) holds. The transpose of a vector y is denoted by y T . Let the nonlinear operator F : . . . where f 1 , f 2 , · · · , f m : R 2 → R are multivariable functions with real value and M solves (4). To obtain an approximation for x, given M and x 1 , we used the minimal error iteration where . It is possible to show that, under certain conditions (we assume that is the case), x k converges to a solution of F(x) = M; see [9, Theorem 3.21] . From Eq. (5), we obtain an approximation x k for x, but the adjoint F (x k ) is not known. In this paper, we obtain the unknown operator (see Appendix) together with the following iteration where The regression coefficient (R 2 ) is used to evaluate the fitting ability of various methods and can be obtained by the following equation, wherē N = m i=1 N i m is the average of the cumulative confirmed COVID-19 cases (or deaths). Remark 1 From differential Eq. (1) and since M is known, we consider the following initial approximation for x . The numerical scheme for our method poposed is in Algorithm 1. The basic reproduction number R 0 is the most important parameter to analyze any epidemic model for any disease. In a practical sense, R 0 is period. An epidemic occurs only if a single individual can spread its illness to more than one individual (R 0 > 1). If R 0 < 1, on average an infectious individual infects less than one person and the contagion is expected to stop spreading. Our result can be used to compute the basic reproduction number see [6] [7] [8] 13] , where r denotes the intrinsic growth rate in the Gompertz model and T is the generation time of In this section, we will provide the results obtained for the whole country (Brazil) and for the Brazilian states of Sao Paulo and Rio de Janeiro. Table 1 describes the estimated values for the Gompertz model and also presents the regression coefficient, the inflection point, and the basic reproduction number. The estimated daily values for the cumulative number of confirmed cases and deaths, from July 3 to 9, are illustrated in Table 2 . Figures 1, 2, 3 illustrate the relationship between observed values and those predicted In this article, we propose an iterative method to estimate the unknown parameters in the Gompertz model. The results of the modeling show a good fit between the estimated and the observed data. We obtained an estimate of 99.9% (R 2 ) for the cumulative number of infected and killed in Brazil. For Sao Paulo, we obtained an approximation of 99.7% and 99.4% for the cumulative number of infected and deaths, respec- (7); Compute M k+1 from Eq. (4), replacing x by x k+1 ; k+1 , x = x k+1 and count = 0; end k ← k + 1; count = count + 1; end Algorithm 1: Iteration to estimate R 2 , K and r . The source code is available at [23] . tively. For Rio de Janeiro, the estimate was 99.8% for the cumulative number of infected and deaths. The basic reproduction number R 0 ( or the average number of infections caused by one typically infectious individual ) depends on the growth rate (r ), which was estimated in this work, and the duration of infectiousness (T ). Considering T = 14, the basic reproduction number was 1.3, 1.3, and 1.4 for Brazil, Sao Paulo, and Rio de Janeiro, respectively. Applying our proposed methodology for China, we obtained R 0 = 3.2, given the number of infected cases accumulated from February 2 to July 2, based on data collected from the world health organization. We did the same for Italy and obtained R 0 = 1.8. Many works obtained these values for China and Italy. Therefore, our estimates for the basic reproduction number are acceptable. The inflection point of the curve provides vital information about changing trends in the epidemic and may possibly indicate changes in intervention and control. In this paper, the inflection points for the cumulative number of infected cases were July 9, July 4, and June 21 for Brazil, Sao Paulo, and Rio de Janeiro, respectively. Regarding the methodology used, there are several methods for estimating or determining parameters in a mathematical model. We can divide these methods into two groups, iterative and non-iterative methods. In this paper, we used a gradient type iterative method. This method can recover unknown parameters with a non-uniform distribution (non-constant functions) given noisy data. On the other hand, like all iterative methods, it depends on an initial estimate. If this initial approximation is far from the solution, the iteration will diverge. Although it is difficult to determine an initial estimate for some problems, in this work we found an appropriate initial guess, see Remark 1. by definition of Euclidean inner product on R 2 , we have Since θ ∈ R 2 is arbitrary, we gather that the following iteration holds: Modeling and forecasting trend of covid-19 epidemic in Iran until May 13, 2020 Covid-19 spreading in Rio de Janeiro, Brazil: do the policies of social isolation really work? A method to select between Gompertz and Logistic trend curves Convergence rate results for steepest descent type method for nonlinear ill-posed equations On the nature of the function expressive of the law of human mortality, and on a new mode of determining the value of life contingencies. In a letter to Francis Baily Richards model: a simple procedure for realtime prediction of outbreak severity Intervention measures, turning point, and reproduction number for dengue Early outbreak of 2009 influenza a (H1N1) in Mexico prior to identification of ph1n1 virus Iterative Regularization Methods for Nonlinear Ill-posed Problems A minimal error conjugate gradient method for ill-posed problems Simulando os efeitos de políticas de distanciamento social em são paulo usando um modelo seir. Insper, Centro de Políticas Públicas Estimation of covid-19 spread curves integrating global data and borrowing information Model selection and evaluation based on emerging infectious disease data sets including A/H1N1 and ebola The minimal error method for the Cauchy problem in linear elasticity. Numerical implementation for twodimensional homogeneous isotropic linear elasticity Predicting the number of total covid-19 cases and deaths in the USA by the gompertz curve Forecasting the number of covid-19 cases and deaths in the world, uk, russia and turkey by the gompertz curve A convergence rate result for a steepest descent method and a minimal error method for the solution of nonlinear ill-posed problems Forecasting covid-19 Comparing non-linear mathematical models to describe growth of different animals A unified approach to the Richardsmodel family for use in growth analyses: why we need only two model forms The use of Gompertz models in growth analyses, and new Gompertz-model approach: an addition to the unified-Richards family Modeling and prediction of covid-19 in Mexico applying mathematical and computational models Code to estimate the number of infected and killed by covid-19: Program Parameter identification problem in the Hodgkin and Huxley model A computational approach for the inverse problem of neuronal conductances determination The author would like to thank the National Council for Scientific and Technological Development of Brazil -CNPq (Grant No. 301330/2020-4) and PCI/LNCC (Programa de Capacitação Institucional do Laboratório Nacional de Computação Cientifica) for the financial support of this work. The authors declare that they have no conflict of interest. Theorem 1 Consider the iteration in Eq. (5) . Then, Eqs. (6) and (7) holds.Proof Given x k = (K k , r k ) ∈ R 2 and unit vector θ ∈ R 2 , the directional derivative of F at x k in the direction θ is given bywhererepresents the gradient of function f i at x k , and J F (x k ) · θ denotes the multiplication between matrix J F (x k ) and unit vector θ. From the minimal error iteration in Eq. (5), we gather thatBy the definition of adjoint operator,From previous equation and by the definition of Euclidean inner product on R m , we obtain