key: cord-0132850-sahy3tu4 authors: Xu, Conghui; Yu, Yongguang; Yang, QuanChen; Lu, Zhenzhen title: Forecast analysis of the epidemics trend of COVID-19 in the United States by a generalized fractional-order SEIR model date: 2020-04-27 journal: nan DOI: nan sha: 68b1aaf54cdad74b9b4095f253802a7a191a2ee6 doc_id: 132850 cord_uid: sahy3tu4 In this paper, a generalized fractional-order SEIR model is proposed, denoted by SEIQRP model, which has a basic guiding significance for the prediction of the possible outbreak of infectious diseases like COVID-19 and other insect diseases in the future. Firstly, some qualitative properties of the model are analyzed. The basic reproduction number $R_{0}$ is derived. When $R_{0}<1$, the disease-free equilibrium point is unique and locally asymptotically stable. When $R_{0}>1$, the endemic equilibrium point is also unique. Furthermore, some conditions are established to ensure the local asymptotic stability of disease-free and endemic equilibrium points. The trend of COVID-19 spread in the United States is predicted. Considering the influence of the individual behavior and government mitigation measurement, a modified SEIQRP model is proposed, defined as SEIQRPD model. According to the real data of the United States, it is found that our improved model has a better prediction ability for the epidemic trend in the next two weeks. Hence, the epidemic trend of the United States in the next two weeks is investigated, and the peak of isolated cases are predicted. The modified SEIQRP model successfully capture the development process of COVID-19, which provides an important reference for understanding the trend of the outbreak. The outbreak of the coronavirus disease (COVID- 19) in 2019 occurred in Wuhan, China, at the end of 2019. This is a severe respiratory syndrome caused by the novel coronavirus of zoonotic origin [1] . The Chinese government has implemented many measures, including the establishment of specialized hospitals and restrictions on travel, to reduce the spread. By April 20, 2020, the outbreak in China has been basically controlled. However, the outbreak is still rampant all over the world. At present, the United States, Italy, Spain and other countries are still in the rising stage of the outbreak. It has posed a great threat to the public health and safety of the world. At present, many countries have adopted mitigation measures to restrict travel and public gatherings, which have a serious impact on the economy. Therefore, it is very important to predict the development trend of this epidemic and estimate the peak of the isolated cases. The epidemic model is a basic tool to research the dynamic behaviours of disease and predict the spreading trend of disease. Establishing a reasonable epidemic model can effectively characterize the development process of the disease. Therefore, a generalized SEIR epidemic model is proposed in this paper, and is defined as SEIQRP model. Some qualitative properties of this model are first analyzed, including the existence and uniqueness of the disease-free and the endemic equilibrium points. Then, conditions are also established to ensure the local asymptotic stability of both disease-free and endemic equilibrium points. So far, many scholars have researched COVID-19 from different perspectives [1] [2] [3] [4] [5] 23] . In [6] , the epidemics trend of COVID-19 in China was predicted under public health interventions. In [7] , the basic reproduction number of COVID-19 in China was estimated and the data-driven analysis was performed in the early phase of the outbreak. In order to predict the cumulative number of confirmed cases and combine the actual measures taken by the government on the outbreak of COVID-19, we further put forward an improved SEIQRP model, denoted by SEIQRPD model. At present, we can obtain the epidemic data of COVID-19 outbreak in the United States before April 20, 2020. We use these data and the improved model to predict the epidemic trend of the United States in the next two weeks, and estimate the peak of isolated cases. Firstly, the data before April 5 was selected to identify the model parameters, and the prediction ability of the improved model with the epidemiological data from April 6 to April 20 is verified. Then, the cumulative number of confirmed cases and isolated cases are predicted in the next two weeks. The peak of isolated cases is thus predicted. In recent years, with the continuous development of fractional calculus theory, fractional order systems modeling approaches have been applied in various engineering and nonengineering fields [19] [20] [21] . Compared with the short memory of the integer derivative, the fractional derivative has the information of the whole time interval or long memory. It is more accurate to describe the biological behavior of population by using fractional differential equation. The rest of this manuscript is structured as below. In Section 2, the fractional SEIQRP model and the modified SEIQRP model is proposed. Some qualitative properties of the SEIQRP model are discussed in Section 3. In Section 4, the prediction ability of the modified SEIQRP model is verified by using real data. The disease development in the United States in the next two weeks after April 21, 2020 is predicted. Finally, conclusion and some future works are discussed in Section 5. In this section, some useful lemmas and definitions will be given to analyze some results of this paper. Definition 1 [8] The Caputo fractional order derivative is given below where n − 1 ≤ α ≤ n. In order to simplify the symbolic representation, the Caputo fractional differential operator C 0 D α t in this paper are represented by D α . Definition 2 [8, 22] The Mittag-Leffler function is given below where n − 1 ≤ α ≤ n. Consider the following n-dimensional fractional order differential equation system where α ∈ (0, 1), X(t) = (x 1 (t), x 2 (t), . . . , x n (t)) T is an n dimensional state vector and A is an n × n constant matrix. The outbreak of COVID-19 has had a great impact on the economic growth of any country and daily life of any human. In order to control and prevent the possible outbreak of infectious diseases like the COVID-19 or other insect diseases in the future, it is very important to establish an appropriate model. The transmission diagram of the generalized SEIR model proposed in this paper is shown in Figure 1 . We divide the total population into six distinct epidemic classes: susceptible, exposed, infectious, quarantined, recovered and insusceptible. We will represent the number of individuals at time t in the above classes by S(t), E(t), I(t), Q(t), R(t) and P(t), respectively. The specific explanations of the above six categories are as follows: • Susceptible S(t): the number of uninfected individuals at the time t. • Exposed E(t): the number of infected individuals at the time t but still in incubation period (without clinical symptoms and low infectivity). who are not exposed to the external environment at the time t. The incidence rate plays a very important role in the epidemic, which can describe the evolution of infectious disease. According to the spread of different diseases in different regions, there are many forms of incidence rate. [10] [11] [12] . We expect to establish a generalized incidence rate, which can contain most of these specific forms. Therefore, the SEIQRP model with general incidence rate is given by where Λ is the inflow number of susceptible individuals, β 1 and β 2 denote the infection rates of the infected individuals and the exposed individuals, respectively, ρ is the protection rate, ε represent the incubation rate. δ is the rate at which symptomatic infections are diagnosed and quarantined. λ and κ represent the cure rate of isolated individuals and the death rate caused by the disease, respectively, µ is the natural mortality, the incidence rate β 1 f 1 (S)g 1 (I), β 2 f 2 (S)g 2 (E) are used to describe the transmission of diseases, which satisfy the following conditions [16] : In the following, we will combine the actual situation and government mitigation policy to improve the SEIQRP model (2) . Firstly, during the outbreak of COVID-19, we need to make reasonable assumptions according to the actual mitigation policies and circumstances. • During the COVID-19 outbreak, population mobility is strictly controlled by many countries. Especially, the policy of city closure was implemented in Hubei province, China. Therefore, the impact of migration will not be considered in the improved model. • For the prediction of this short-term virus transmission, the impact of natural mortality will not be considered. It is assumed that novel coronavirus is the only cause of death during the outbreak. • In order to predict the trend of cumulative confirmed cases, it is necessary to simulate the number of death cases. Therefore, a new class D(t) to the SEIQRP model will be added in the modified model, which denotes the number of death cases at time t. • The time-varying cure rate, mortality rate and infection rate will be applied to the improved model. This can better simulate the impact of the improvement of medical conditions and government control on individuals in reality. Based on the above assumptions and analysis, the following improved model for COVID-19 is proposed. where β 1 (t) = σ 1 exp (−σ 2 t), λ (t) = λ 1 (1 − exp(λ 2 t)) and κ(t) = κ 1 exp(−κ 2 t). The parameters σ 1 , σ 2 , λ 1 , λ 2 , κ 1 and κ 2 are all positive, where σ is used to simulate the intensity of government control. It should be emphasized that the protection rate ρ for susceptible individuals also reflects the intensity of government control. Obviously, the right side of system (2) satisfies the local Lipschitz condition, then there exists a unique solution of system (2) [8, 15] . Theorem 1 The solutions of system (2) are non-negative, and the closed set Ω = {(S, E, I, Q, R, P) ∈ R 6 (2) . Proof In order to investigate the non-negativity of solutions of the system (2), we consider the following system where the initial conditions are S 10 = 0, E 10 = 0, I 10 = 0, Q 10 = 0, R 10 = 0 and P 10 = 0. (0, 0, 0, 0, 0, 0) is the unique solution of the above system. According to the fractionalorder comparison theorem [17] , one can deduce that the so- By adding six equations of system (2), one can deduce By applying the fractional-order comparison theorem, one has Thus we can draw the result of Theorem 1. System (2) has an obvious disease-free equilibrium point In order to obtain the endemic equilibrium point of the system (2), we set: which implies Combining the above equations and the second equation of (5), one has Define Note that ϕ(0) = 0 and ϕ( Λ µ+ε ) = −Λ < 0. In order to show that ϕ(E) = 0 has at least one positive root in the interval (0, Λ µ+ε ], we need to prove that ϕ ′ (0) > 0. Hence Therefore, we have where the basic reproduction number is given by If R 0 > 1, the system (2) has at least one endemic equilibrium point M * = (S * , E * , I * , Q * , R * , P * ), where In the following section, we show that endemic equilibrium point M * is unique. According to the above analysis and hypothesis (i)-(iii), one has By the hypothesis (iv), this implies ϕ ′ (E * ) < 0. If there is another endemic equilibrium point M * * , then ϕ ′ (E * * ) ≥ 0 holds, which contradicts the previous discussion. Hence, system (2) has a unique endemic equilibrium point M * when R 0 > 1. Based on the above analysis, the following result can be obtained. Theorem 2 System (2) has a unique disease-free equilibrium point M 0 , if R 0 < 1. System (2) has a unique endemic equilibrium point M * , if R 0 > 1. In this section, the local asymptotic stability of diseasefree equilibrium point M 0 and endemic equilibrium point M * for system (2) is discussed. Proof The Jacobian matrix of the system (2) at the diseasefree equilibrium point M 0 is The corresponding characteristic equation is where The characteristic equation H(s) = 0 has four obvious negative characteristic roots, which are denoted by s 1 = s 2 = −µ, This implies that the other two eigenvalues s 5 and s 6 of characteristic equation H(s) = 0 are real roots. Hence If R 0 < 1, then one can obtain s 5 + s 6 < 0 and s 5 s 6 > 0, which imply s 5 < 0 and s 6 < 0. If R 0 > 1, one has s 5 s 6 < 0, which imply s 5 > 0 or s 6 > 0. It follows from Lemma 1 that the proof is completed. Further, we will show the locally asymptotic stability of the endemic equilibrium point M * of system (2) . Similarly, the corresponding Jacobian matrix of the system (2) at M * is where By the corresponding results in [13, 14] , let Then, the following result can be obtained. Theorem 4 With regard to system (2) , assume that R 0 > 1, (i) If D 1 (L 1 (s)) > 0, a 1 > 0, a 3 > 0 and a 1 a 2 > a 3 , then the endemic equilibrium point M * is locally asymptotically stable. (ii) If D 1 (L 1 (s)) < 0, a 1 > 0, a 2 > 0 and a 3 > 0, then the endemic equilibrium point M * is locally asymptotically stable for α ∈ (0, 2 3 ). (iii) If D 1 (L 1 (s)) < 0, a 1 < 0 and a 2 < 0, then the endemic equilibrium point M * is unstable for α ∈ ( 2 3 , 1). (iv) If D 1 (L 1 (s)) < 0, a 1 > 0, a 2 > 0 and a 1 a 2 = a 3 , then for α ∈ (0, 1), the endemic equilibrium point M * is locally asymptotically stable. Proof Based on the previous discussion, the characteristic equation L(s) = 0 has three obvious negative roots s 1 = s 2 = −µ and s 3 = −µ − λ − κ. In order to investigate the stability of equilibrium point M * , we only need to discuss the range of the root of L 1 (s) = 0, denoted by s 4 , s 5 and s 6 . (i) By the results in [14] , if D 1 (L 1 (s)) > 0, then s 4 , s 5 and s 6 are real roots. Further, by Routh-Hurwitz criterion, the necessary and sufficient conditions for s i (i = 4, 5, 6) to lie in the left half plane are 0, a 3 > 0, a 1 a 2 > a 3 . That is to say, under the above conditions, the roots of L 1 (s) = 0 satisfy Therefore, M * is locally asymptotically stable and (i) holds (ii) By the results in [14] , if D 1 (L 1 (s)) < 0, then L 1 (s) = 0 has a real root and a pair of conjugate complex roots, denoted by s 4 , m + ni and m − ni, respectively. Thus, one has By calculation, The conditions a 1 > 0, a 2 > 0 and a 3 > 0 imply that −s 1 > 2m, m 2 + n 2 > −2s 4 m, s 4 < 0. Further, one has That is to say tan 2 θ > 3, which implies Therefore, in order to ensure the establishment of | arg(s)| > απ 2 , we must have α < 2 3 . Thus (ii) holds. The proof of conclusions (iii) and (iv) is similar to that of conclusion (ii), hence we omit it. The data used in this paper are from the Johns Hopkins University Center for Systems Science and Engineering (https://github.com/CSSEGISandData/COVID-19). the Johns Hopkins University publishes data of accumulated and newly confirmed cases, recovered cases and death cases of COVID-19 from January 22, 2020. In the following discussion, the standard incidence rate [18] is used to describe the transmission of COVID-19, and is given by where N represent the total population of the region at the initial time. Hence, The effectiveness of the model (15) in describing the spread of COVID-19 is illustrated by selecting the confirmed, cured and dead cases in the United States. According to the real data provided by the Johns Hopkins University, the outbreak in the United States has not been brought under full control. The data of confirmed cases, cured cases and death cases are selected from January 22, 2020, to April 20, 2020. Assuming that the confirmed individuals will be isolated, then This hypothesis is in line with the actual situation. Hence, we can obtain the real data of isolated cases. Through the fractional predictor corrector method and the least squares fitting [24] , we can identify the parameters of the model (15) through the real data, which can be found in Table 1 . Based on the parameters in Table 1 , we can make a simple prediction of isolated cases and recovered cases in the United States, which can be found in Figure 1 . We need to emphasize that the peak here represents the number of isolated cases rather than the cumulative number of confirmed cases. Under the parameters in Table 1 , it can be calculated by using (9) that R 0 = 0.2041 < 1. By the conclusion in Theorem 3, the disease-free equilibrium point is local asymptotic stable. The SEIQRP model has a basic guiding significance for predicting and fitting spreading dynamics of COVID-19. However, the prediction of this model for COVID-19 is relatively rough, we still need to improve model (15) according to actual mitigation policies and research objectives. According to the analysis in Section 2, we choose the SEIQRPD model to predict the trend of the epidemic in the United States under reasonable assumptions. Similarly, the standard incidence rate is used to describe the transmission of COVID-19, the fractional SEIQRPD model can be expressed as When α = 1, the fractional-order SEIQRPD model is similar to the integer-order model used in [24] . According to the data provided by Johns Hopkins University, by April 20, 2020, the outbreak in China has been basically controlled. In many provinces of China, the number of new cases per day is increasing in single digits. This means that the data in China contains more information about the spreading dynamics of COVID-19. Therefore, the data in Hubei, Guangdong, Hunan and Zhejiang are selected to research the fitting effect of the model (17) . According to the real data of these four regions in China, the parameters of the model (17) are identified, and the results are shown in Table 2 . The model (17) successfully capture the trend of the outbreak, which can be seen in Figure 3 . Number of cases 10 4 Hubei (China) At present, we can obtain the epidemic data of the United States from January 22, 2020 to April 20, 2020. We need to preprocess the data to remove the data smaller than 0.5% of the current highest number of confirmed cases. In order to test the prediction ability of the SEIQRPD model (17) for the development process of the epidemic in the United States, we select the data before April 5 to identify the parameters of the SEIQRPD model (17) . Furthermore, in order to illustrate the ability of the SEIQRPD model (17) in predicting the outbreak, we compared the real data and fitted data after April 5, which can be seen in Table 3, Table 4 and Figure 4 . According to the results in Table 3 and Table 4, it can be found that the real values of current isolated cases and cumulative confirmed cases fall within the range of 95% -105% of the predicted values. This shows that the SEIQRPD model (17) can effectively predict the data in the next two weeks. The data before April 20, 2020 are selected to identify the parameters of the improved model (17) , and the results are shown in Table 5 . The cumulative number of confirmed cases and the number of isolated cases after two weeks are predicted, which can be seen in Table 6 . The isolated cases in the United States will peak on June 18, 2020, with the peak of 1.0431 × 10 6 , which can be seen in Figure 5 . We first propose the fractional SEIQRP model with generalized incidence rates. Some qualitative properties of the SEIQRP model are discussed. In order to predict COVID-19 effectively, we propose an improved SEIQRPD model according to the actual mitigation situation. According to the data of the United States before April 5, 2020, the trend of the outbreak in the United States from April 6 to April 20 is successfully predicted as compared to the real records. Then, using the data before April 20, 2020, we forecast the trend of the outbreak in the United States in the next two weeks, and estimate the peak of isolated cases and the date of the peak. The improved SEIQRP model proposed in this paper successfully captures the trend of COVID-19. The long-term prediction needs to adjust the model appropriately according to the change of policy and medical level. We will discuss in the future work. A conceptual model for the coronavirus disease 2019 (COVID-19) outbreak in Wuhan, China with individual reaction and governmental action Epidemic analysis of COVID-19 in China by dynamical modeling A Mathematical model of COVID-19 using fractional derivative: outbreak in India with dynamics of transmission and control The reconstruction and prediction algorithm of the fractional TDD for the local outbreak of COVID-19 Coronavirus: Where We are and What We Know Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: A data-driven analysis in the early phase of the outbreak Fractional differential equations The analysis of fractional differential equations: an application-oriented exposition using differential operators of Caputo type Hopf bifurcation in an age-structured SIR epidemic model An SIRS epidemic model with pulse vaccination and nonmonotonic incidence rate A stochastic SIRS epidemic model with nonlinear incidence rate. Applied Mathematics and Computation On fractional order differential equations model for nonlocal epidemics Fractional calculus and the evolution of fractal phenomena Existence of solution for delay fractional differential equations Stability of a fractional order SEIR model with general incidence Global stability analysis of fractional-order Hopfield neural networks with time delay Backward bifurcation of an epidemic model with standard incidence rate and treatment rate A new collection of real world applications of fractional calculus in science and engineering Fractional order crowd dynamics: Cyber-Human systems modeling and control Fractional Calculus View of Complexity: Tomorrow's Science Mittag-Leffler stability of fractional order nonlinear dynamic systems A fractional-order SEIHDR model for COVID-19 with inter-city networked coupling effects. Nonlinear Dynamics (Special Issue on 'Nonlinear dynamics of COVID-19 pandemic: modeling, control, and future perspectives Generalized SEIR epidemic model (fitting and computation) The plots in this paper were plotted using the plot code adapted from [24] . We declare that we have no conflict of interest.