key: cord-0727676-6m84kbl1 authors: Sioofy Khoojine, Arash; Mahsuli, Mojtaba; Shadabfar, Mahdi; Hosseini, Vahid Reza; Kordestani, Hadi title: A proposed fractional dynamic system and Monte Carlo-based back analysis for simulating the spreading profile of COVID-19 date: 2022-03-30 journal: Eur Phys J Spec Top DOI: 10.1140/epjs/s11734-022-00538-1 sha: 96ae6aa50659706bc5c9c8af4b6161031da2ef66 doc_id: 727676 cord_uid: 6m84kbl1 This paper presents a dynamic system for estimating the spreading profile of COVID-19 in Thailand, taking into account the effects of vaccination and social distancing. For this purpose, a compartmental network is built in which the population is divided into nine mutually exclusive nodes, including susceptible, insusceptible, exposed, infected, vaccinated, recovered, quarantined, hospitalized, and dead. The weight of edges denotes the interaction between the nodes, modeled by a series of conversion rates. Next, the compartmental network and corresponding rates are incorporated into a system of fractional partial differential equations to define the model governing the problem concerned. The fractional degree corresponding to each compartment is considered the node weight in the proposed network. Next, a Monte Carlo-based optimization method is proposed to fit the fractional compartmental network to the actual COVID-19 data of Thailand collected from the World Health Organization. Further, a sensitivity analysis is conducted on the node weights, i.e., fractional orders, to reveal their effect on the accuracy of the fit and model predictions. The results show that the flexibility of the model to adapt to the observed data is markedly improved by lowering the order of the differential equations from unity to a fractional order. The final results show that, assuming the current pandemic situation, the number of infected, recovered, and dead cases in Thailand will, respectively, reach 4300, [Formula: see text] , and 36,000 by the end of 2021. The mathematical modeling of infectious diseases has been the foundation of infectious disease epidemiology for more than a century [1] . In recent years, detailed scrutiny of such diseases has become extensive because of improvements in data analysis, computing methods, diagnostic tests, and genome sequencing. Advances in such methods enable researchers to deploy mathematical models to design practical schemes for disease control, and also develop and test mathematical and statistical methods [2] [3] [4] [5] [6] . Mathematical models that describe the spread of diseases are continually being developed and are playing a fundamental role in promoting public health strategies in many countries [7] [8] [9] [10] . a e-mail: arashsioofy@yibinu.edu.cn b e-mail: mahsuli@sharif.edu (corresponding author) c e-mail: mahdishadabfar@yahoo.com (corresponding author) d e-mail: v.r.hosseini@gmail.com e e-mail: eng.kordestani@gmail.com In December 2019, a novel coronavirus, officially called coronavirus disease 2019 (COVID-19) by the World Health Organization (WHO), led to an outbreak of atypical pneumonia, first in Wuhan, the capital city of Hubei Province in China, and then expeditiously flared out in the entire world [11] . As of October 01, 2021, there have been more than 230 million confirmed cases, together with nearly 4.8 million deaths reports in the world [12] . Throughout anti-epidemic struggles, apart from medical, biological, and scientific research, theoretical studies based on mathematical and statistical modeling may thus significantly contribute to the conception of outbreak attributes [13] [14] [15] . Forecasting the trajectory of the spread accordingly helps countries make informed decisions on the required actions to mitigate the spread of the disease. Different models have thus far been proposed to analyze COVID-19, but compartmental methods that divide the population into compartments have dominated epidemiological modeling area [16] [17] [18] [19] . These models assist in interpreting the way COVID-19 spreads and in forecasting regional pinnacles of the pandemic. Compartmental models are broadly based on studying the systems of differential equations. In this respect, differential equations concentrate on the rate of changes in a variable or a group of variables as time proceeds. The most common model is the SIR, which divides a population into susceptible (S), infected (I), and recovered (R). The popularity of this model stems from its simplicity in predicting a small number of parameters. There are a number of studies that predict the spread of COVID-19 by this method, e.g., see [20, 21] , and [22] . Various models have been similarly extracted from the basic SIR model, which have thus far supplemented additional compartments to the SIR model [23] . One of the valuable models is the SEIR, namely, the susceptible-exposed-infected-recovered model, which has added the exposed compartment to the SIR structure [24] . The exposed group refers to the phase between the susceptible and the infected individuals. It comprises the population exposed to an infection but not yet infected. The SEIR model is thus capable of analyzing the spread of diseases more accurately than the basic SIR model [25] . Of note, a host of studies have analyzed the COVID-19 pandemic through modified versions of the SEIR model, e.g., see [26, 27] and [28] . Some studies have proposed the fractional SIR model, where one or, feasibly, both sub-compartments are raised by exponents that are usually less than unity [29] . The reason is that the infection transmits outwards from an infected population to the whole population through the early stages of an epidemic. Within this framework, where susceptible is much greater than infected it is better to scale them as a fractional power. Given the decreasing number of infected people over the general population in real world, the exponent is anticipated to be larger than 1/2. Furthermore, the power of the susceptible population, which is notably higher than that of the infected population, may be negligible at least in the initial stages of an epidemic. Deploying fractional exponents stemmed from a growth model, called the Norton-Simon-Massagué (NSM) model [30] . This model was created to explain the growth of biological organisms through applying determined energy principles. The governing differential equation reads where p 1 and p 2 measure anabolism growth and defuse, respectively. Equation 1 may be simplified as declaring the net growth rate of a biological organism outcome of the balance between synthetic and degradative mechanisms. Meanwhile, the rate is proportional to the growth volume G(t) with a power function and the rate of the latter process relates linearly with G(t). It is thus vital to mention that the two exceptional cases of Eq. 1 are power-law, where p 1 = 0, and second-type growth, where p 1 = 2/3, to explain the tumor growth [31] . Different types of fractional compartmental models have been applied to a number of infectious diseases, e.g., see [32] [33] [34] [35] [36] [37] [38] . Despite the fact that these models are convincing instruments and typically allow practical intuition into grasping the growth and the spread of diseases within designated populations, they suffer from one major flaw; that is, with the increase of the number of compartments, the number of model parameters increases, making it difficult for the model to properly fit the data. In other words, there is a trade-off between increasing the computational node and the regression accuracy. To cover new aspects of the problem, such as quarantine, hospitalization, and vaccination, it is necessary to add new components to the problem to make model more realistic [39, 40] . This, on the other hand, increases the number of model parameters, greatly adds to the complexity of the problem, and in turn reduces the accuracy of the fit. To address this issue, this paper puts forward a timevariant SEIVR-QH network, comprised of nine nodes: (1) susceptible, (2) insusceptible, (3) vaccinated, (4) exposed, (5) infected, (6) quarantined, (7) recovered, (8) hospitalized and (9) dead. In a novel development, the differential equation is expressed in fractional form, described shortly. The conventional form of compartmental networks employs first-order differential equations. In contrast, introducing the fractional form here in this paper facilitates the formulation of differential equations in non-integer, fractional orders, which leads to significant flexibility for the model to describe the observed data. This model has several unknown parameters that must be estimated to fit the model to the actual data. It requires a multi-objective optimization to select the optimal parameters by minimizing the error between the actual data and the model outputs. In addition, this optimization process requires solving the system of fractional differential equations in each iteration which is a complex task. To this end, a two-step back analysis is proposed in this paper based on Monte Carlo sampling. A comparison is made here between the capability of the proposed method and that of other optimization techniques, such as the particle swarm optimization (PSO) used in [41] . First, PSO is a deterministic optimization algorithm and Monte Carlo sampling is an inherently probabilistic method. In other words, the system parameters in the latter are modeled as random variables. The back analysis proposed in this study is a two-step algorithm. That is, first, the Monte Carlo sampling method is used to identify the samples that lead to the closest realizations to the target time histories. The results are then introduced into the second stage of the back-analysis algorithm to obtain the optimal parameters. This leads to the probabilistic characteristics of the optimal model parameters, such as the mean and standard deviation, using the output of the first step. In addition, the deterministic optimal parameters can be obtained as the mean values. Therefore, the proposed back analysis can provide both probabilistic and deterministic parameters, while the PSO method only yields optimal deterministic results. Second, the modeling approach in the present paper is based on the fractional form of the system of differential equations. The performance of PSO in the optimization of fractional equations has not yet been evaluated. In contrast, Monte Carlo sampling method is robust in handling such complex systems of fractional networks proposed in this study. Finally, the fractional order of the proposed differential equations also needs to be optimized along with other model parameters. Hence, the problem here involves differential equations of varying order, which is intractable in the classical PSO. That is, the classical PSO does not allow the optimization of the order of differential equations as the change in the order will fundamentally change the objective function and affect the process of calculating the cost function. In contrast, Monte Carlo sampling is based on the generation of random samples and the fractional order is yet another random variable in this analysis, which can be readily handled. The content of the paper is organized as follows. In Sect. 2, the proposed time-variant fractional SEIVR-QH network is presented, and the deterministic form of the problem is formulated. In Sect. 3, the main framework of the Monte Carlo-based back analysis is introduced and applied to the actual data of Thailand. In Sect. 4, a sensitivity analysis [42] is conducted to investigate the influence of the fractional order of the differential equation on the regression accuracy. Finally, the research is summarized and concluded in Sect. 5. In this paper, the modeling formulation is derived by the Caputo derivative in fractional calculus. Several researchers have applied fractional derivatives in science and engineering, e.g., see [43] . Definition 1 Given a positive real number η and an integrable function f : [a, b] → R, the fractional integral of f of order η is defined as [44] : where Γ denotes the Gamma function. The Caputo fractional derivative, where D = d/dt, is defined as [44] : If f is a function of class C n , its fractional derivative is represented by the following expression: where Γ represents the Gamma function. In this paper, an extended fractional SEIR network, called the SEIVR-QH, is applied to develop a deterministic model for the spread of COVID-19. In the definition of this network, the population is divided into nine nodes. First, the whole population is considered susceptible (S), which can become exposed (E) or can be immunized by observing social distancing (P ) or being vaccinated (V ) against the disease. Depending on the social exposure, exposed cases can be transmitted to the infected stage (I). If an individual is diagnosed with the disease, they will be quarantined (Q) based on the disease severity and infection spread. If the infection is severe, the patient will then be hospitalized (H). Consequent to treatment via hospital-based medical care or self-medication, nearly all quarantined cases recover (R) and some die (D) because of the disease severity. The relationship between these nine states is depicted in the network shown in Fig. 1 . The mathematical relationships among the nine states in the model are expressed by a system of fractional differential equations, as follows: where α is the protection rate, β indicates the infection rate, ρ stands for the vaccination rate, γ is the average latency time, δ denotes the rate at which infected cases enter into quarantine, ζ refers to hospitalization rate, λ is the time-variant quarantine recovery rate, κ is the time-variant mortality quarantine rate, and ψ and φ are hospitalization recovery and mortality rates, Theorem 1 There is a unique solution for the fractional SEIVR-QH model. Moreover, the solution is nonnegative. then the function f is non-decreasing, and if D η f (x) ≤ 0, ∀x ∈ (0, b), then the function f is non-increasing for all x ∈ [0, b]. Afterwards, all the conditions in Theorem 3.1 from [45] are met and there is a unique and non-negative function in (0, ∞), which is the solution of Eq. (1). There is also a need to show that the domain R +9 is positively invariant. Since the following holds: on each hyper-plane bounding the non-negative orthant, the vector field points into R +9 . Eqs. (5)- (14), with η satisfying 0 < η ≤ ∞, to evaluate its equilibrium points. Let Then, the equilibrium points are 1. A disease-free equilibrium P F = (1, 0, 0, 0, 0, 0, 0, 0) 2. An endemic equilibrium point P E = (S * , E * , I * , Q * , The system of fractional differential equations is written in the matrix form, as follows: To solve the matrix form of the given system, the Adams-Bashforth-Moulton predictor-corrector method is used in two steps. First, the prediction step calculates a rough approximation of the desired quantity, typically employing an explicit method. Second, the corrector step refines the initial approximation recruiting another means, typically an implicit method. For more details, see [46] . At this point, the formulation of SEIVR-QH network, the fractional form of the differential equations, and its solution are established. The problem at hand is to estimate the unknown weights of the network, namely β, α, γ, δ, λ 0 , λ 1 , κ 0 , κ 1 , ζ, ρ, ψ, and φ, such that the proposed SEIVR-QH network best describes the observed data. To this end, a novel two-step optimization algorithm is established in this section using Monte Carlobased back analysis to obtain the optimal weights when η is assumed as unity [47] [48] [49] [50] . A sensitivity analysis is then used in the next section to investigate the influence of fractional order in the regression accuracy. To implement the back-analysis approach, the parameters involved in the problem, i.e., weights of the network, are defined as random variables. This makes it possible to consider a range of variation for these parameters and evaluate the predictions obtained by their various combinations. To better understand the parameters' combination, suppose the parameters are placed in a lattice, in which the x-axis and y-axis are the parameters within the arranged set of ranges, then the variety of these parameters with different values is supposed as a random walk on this lattice. The range of variations defined for each random variable is shown in Table 1 . Next, 10,000 realizations are generated for each random variable. This process takes 30 min on a PC equipped with a 64 GB of random access memory (RAM) and an Intel Core i7-5820K central processing unit (CPU). Then, by introducing each set of realizations into the system of differential equations, a new SEIVR-QH problem is established that is solved using the method describe previously. This leads to 10,000 distinct predictions of the spread of the disease over time. The resulting database of I, R, D, and V is then used as the input of the back-analysis algorithm in the next step. The process described for implementing the Monte Carlo sampling is schematically shown in Fig. 2 . To implement the back-analysis method, the actual observations of COVID-19, i.e., I, R, and D in Thailand, are compiled from the World Health Organization (WHO) online database [12] . In addition, the number of daily vaccinations is obtained from the Johns Hopkins University database [51] . The whole dataset is available on "Mendeley Data" [52] . The results are plotted in Fig. 3 from 2021-02-28 to 2021-08-30. This figure also demonstrates intervals of ±15% above and below the observed data. This is here considered as an allowable interval, and used as a filter to select the closest cases to the observed data among the 10,000 realizations of I, R, D, and V . This means that the allowable area is defined by a sample-selection criterion that leads the algorithm towards finding the best available fit. The process described for the selection/rejection of samples is schematically shown in Fig. 4 . By applying the proposed filter to the data, 40 out of 10,000 samples fall within the allowable interval. Next, a new criterion is defined for a second-round selection to find the best sample from the previously selected 40 samples. For this purpose, the root mean square error where θ 1,i , θ 2,i , θ 3,i , and θ 4,i are the RMSE of the ith sample for the infected, recovered, dead, and vaccinated, respectively; I(t), R(t), D(t) and V (t) are the actual measurements of infected, recovered, dead, and vaccinated cases, respectively; and , and V S i (t) are the time series of infected, recovered, dead, and vaccinated cases for the i th sample, respectively, computed using the proposed SEIVR-QH model. The results for θ 1 , θ 2 , θ 3 , and θ 4 are shown in columns 2-5 and 8-11 of Table 2 . Next, by combining the RMSEs of I, R, D, and V using a square root of the sum of squares, the final selection criterion, denoted by θ t , is defined: The results of θ t are shown in columns 6 and 12 of sorted according to θ t and the sample corresponding to the lowest θ t , i.e., sample 4 in Table 2 , is selected as the best fit. The optimal values of the parameters correspond to this sample are reported in Table 3 . The time series of I, R, D, and V corresponding to the optimal parameters are also shown in Fig. 5 . As described in Sect. 2, the fractional form of the SEIVR-QH model, i.e., η ≤ 1, provides a new degree of freedom to the compartmental system and provides the algorithm with further flexibility compared to the conventional differential equations of the first order, i.e., η = 1. This subsequently enables the system to better fit the actual data. Since the back analysis presented in this paper is implemented by four objective functions that must be optimized simultaneously to ensure that the number of infected, recovered, dead, and vaccinated cases matches the actual data, this flexibility is key in achieving accurate results. To make use of the fractional differential equation system to fit the SEIVR-QH network to the collected data of COVID-19 in this paper, the order of the equations is defined as a decision variable that assumes values less than unity. Thereafter, the new fractional problem is solved and the resulting fit is assessed for improvement in the prediction accuracy. To make use of the fractional differential equation system to fit the SEIVR-QH network to the collected data of COVID-19, the order of the equations is defined as a decision variable that assumes values less than unity. Then, the new fractional problem is solved and the resulting fit is assessed for improvement in the prediction accuracy. For this purpose, assuming the optimal parameters obtained in Sect. 3, 11 different values of the decision variable that represent the fractional order are considered, including η = 0.5, 0.55, 0.6, 0.65, 0.7, 0.75, 0.8, 0.85, 0.9, 0.95 and 1. Next, a fractional form of SEIVR-QH problem is defined corresponding to each η and analyzed separately by the method presented in Sect. 2. The results of each case are then stored in the form of time series I, R, and D. Table 4 shows the θ 1 , θ 2 , and θ 3 , as well as θ t for different values of η. In addition, the corresponding time series of I, R, and D are plotted against different η in Fig. 6 . As seen, the best result is obtained for η = 0.9. In other words, assuming η = 0.9, a better fit to data is obtained and θ t is minimized. Compared to the assumption of η = 1, the value of θ t has changed from 3.52×10 6 to 2.58 × 10 6 , i.e., a reduction of 27% in the error measured. According to the new results, the number of infected, recovered, and dead at the beginning of 2022 in the Thailand will, respectively, reach about 4300, 4.65 × 10 6 and 36,000. In addition, by November 2022, the number of infected will fall below 1500, indicating that the disease will be under control in the Thailand. It is noteworthy that these results are based on the fit obtained by the fractional model, which is more accurate and reliable than the results reported in Sect. 3. In this paper, a compartmental network is established using fractional SEIVR-QH to predict the spread profile of COVID-19 in Thailand. To this end, the proposed network is employed in a Monte Carlo-based back analysis in which network weights, including β, α, γ, δ, λ 0 , λ 1 , κ 0 , κ 1 , ζ, ρ, ψ, and φ, are defined as random variables. By generating a set of random samples for each random variable, the Monte Carloback analysis is used to find realizations of parameters whose predictions fall within a desired interval around the observed data. Thereafter, the RMSE criterion is applied to the selected samples to find the best fit to the Thailand data. Finally, a sensitivity analysis is conducted to investigate the effect of the order of the fractional operator on the accuracy of the fit. The primary results of this study are as follows: • Monte Carlo-back analysis identified 40 samples in close proximity of the observed data, having RMSE 6 The resulting fit on the data for different η values between 3.52×10 6 and 4.73×10 6 . The best fit between them provides the optimal network weights as infection rate, β = 5.1769, protection rate, α = 0.1599, infectious to quarantine rate, δ = 0.3246, average latent time, γ = 0.0294, time-dependent recovery rate, λ 0 = 0.1773 and λ 1 = 0.3034, timedependent mortality rate, κ 0 = 0.0360 and κ 1 = 0.1239, hospitalization rate, ζ = 0.0061, vaccination rate, ρ = 0.0837, hospitalization to recovery rate, ψ = 0.0010, and hospitalization to mortality rate, φ = 0.0015. • The results of sensitivity analysis showed that the accuracy of data fitting is greatly improved by changing the fractional order, so that the θ t of the best fractional fit, i.e., η = 0.9, reaches about 2.58 × 10 6 , while that for the ordinary case, i.e., η = 1, is bound to 3.52 × 10 6 . This means that the fractional form of the SEIVR-QH network improves the accuracy by about 27%. • The optimal output of the fractional SEIVR-QH network showed that by the end of 2021, the number of people infected (active), recovered and dead in Thailand will reach 4300, 4.65 × 10 6 and 36,000, respectively. • The resulting number of infected cases also shows that the number of active cases fall below 1500 by November 2022, indicating that the disease will be under control in Thailand. Although the compartmental networks benefit from simple configuration and straightforward implementation, they suffer from a common problem; that is, if compartments and computational nodes are added, their complexity is greatly increased, making it difficult to optimize the network to adapt to the data. The method proposed in this paper, however, enables the compartmental network to be expanded to the required size without concerning the complexity of the network and the size of the differential equation system. In other words, one can simply add the required nodes, such as reinfection after recovery and vaccine side effects, and then use Monte Carlo sampling method to optimize the network. This will be studied by authors in the future. Moreover, SEIVR-QH parameters are assumed as variables in the back analysis and incorporated in matching of the time histories. However, in addition to the model parameters, the fractional order of the differential equations is also defined as random variables and employed in the formulation of the Monte Carlo sampling method. This allows for a higher flexibility to fit the model to the data and thus, increases the model accuracy. Secondly, the probabilistic characteristics of the order of the differential equation, namely the mean and the standard deviation, are obtained from the optimal results of the first stage of the back analysis. This research is a practical step in defining the randomized fractional problem in solving the differential equations of disease spread. This is the subject of ongoing research by the authors on modeling the spread of COVID-19. This manuscript has associated data in a data repository. [Authors' comment: All data used in this paper are uploaded in Mendeley Data and are accessible via [51] .] Ecological Paradigms Lost Routes of Theory Change Daniel Bernoulli's epidemiological model revisited Viral load kinetics of SARS-COV-2 in hospitalized individuals with Covid-19, Open Forum Infectious Diseases ofab153 The role of health preconditions on Covid-19 deaths in Portugal: evidence from surveillance data of the first 20293 infection cases Exact properties of SIQR model for COVID-19 Mathematical models of infectious disease transmission Nonlinear dynamics of Covid-19 pandemic: modeling, control, and future perspectives Understanding Covid-19 nonlinear multi-scale dynamic spreading in Italy Rare and extreme events: the case of Covid-19 pandemic Epidemic analysis of Covid-19 in China by dynamical modeling. medRxiv Worldometer Covid-19 data Analysis of lockdown perception in the United States during the COVID-19 pandemic A review of mathematical modeling, artificial intelligence and datasets used in the study, prediction and management of COVID-19 Network autoregressive model for the prediction of Covid-19 considering the disease interaction in neighboring countries Compartmental models of the Covid-19 pandemic for physicians and physician-scientists Mathematical modeling of Covid-19 transmission dynamics with a case study of Wuhan Epidemic model of Covid-19 outbreak by inducing behavioural response in population Modeling and forecasting the Covid-19 pandemic in India Measuring and preventing Covid-19 using the sir model and machine learning in smart health care Simulating the progression of the Covid-19 disease in Cameroon using sir models A sir model assumption for the spread of Covid-19 in different com Time-variant reliability-based prediction of COVID-19 spread using extended SEIVR model and Monte Carlo sampling A Seir model for control of infectious diseases with constraints A modified Seir model for the spread of Ebola in western Africa and metrics for resource allocation Prediction of the Covid-19 epidemic trends based on Seir and AI models A simulation of a Covid-19 epidemic based on a deterministic Seir model SEIR model for COVID-19 dynamics incorporating the environment and social distancing Quantitative laws in metabolism and growth The model muddle: in search of tumor growth laws A fractional order epidemicmodel for the simulation of outbreaks of influenza A(H1N1) A method for solving differential equations of fractional order A latency fractional order model for HIV dynamics The dynamics of Covid-19 in the UAE based on fractional derivative modeling using Riesz wavelets simulation Numerical simulation and stability analysis for the fractional-order dynamics of Covid-19 A fractional-order model describing the dynamics of the novel coronavirus (Covid-19) with nonsingular kernel Fractional model of Covid-19 applied to Galicia, Spain and Portugal A mathematical model to examine the effect of quarantine on the spread of coronavirus On the necessity of proper quarantine without lock down for 2019-nCov in the absence of vaccine SEIR modeling of the COVID-19 and its dynamics Simplified algorithm for reliability sensitivity analysis of structures: a spreadsheet implementation Fractional-Order Nonlinear Systems: Modeling Introduction to Fractional Differential Equations Global existence theory and chaos control of fractional differential equations A predictorcorrector approach for the numerical solution of fractional differential equations Approximation of the Monte Carlo sampling method for reliability analysis of structures Probabilistic approach for optimal portfolio selection using a hybrid Monte Carlo simulation and Markowitz model Mathematical analysis of a stochastic model for spread of coronavirus A stochastic mathematical model for Covid-19 according to different age groups John Hopkins university Covid-19 data COVID-19 data of Thailand includes infected, recovered, dead, and vaccinated cases from 2021/02/28 to 2021/08/30. Mendeley Data