key: cord-1024467-ixfco3p3 authors: Zandavi, Seid Miad; Rashidi, Taha Hossein; Vafaee, Fatemeh title: Forecasting the Spread of Covid-19 Under Control Scenarios Using LSTM and Dynamic Behavioral Models date: 2020-05-24 journal: IEEE trans. cybern. (Online) DOI: 10.1109/tcyb.2021.3120967 sha: fbf5b7eafe28fbebbbba4b61600921f5b5525691 doc_id: 1024467 cord_uid: ixfco3p3 To accurately predict the regional spread of Covid-19 infection, this study proposes a novel hybrid model which combines a Long short-term memory (LSTM) artificial recurrent neural network with dynamic behavioral models. Several factors and control strategies affect the virus spread, and the uncertainty arisen from confounding variables underlying the spread of the Covid-19 infection is substantial. The proposed model considers the effect of multiple factors to enhance the accuracy in predicting the number of cases and deaths across the top ten most-affected countries and Australia. The results show that the proposed model closely replicates test data. It not only provides accurate predictions but also estimates the daily behavior of the system under uncertainty. The hybrid model outperforms the LSTM model accounting for limited available data. The parameters of the hybrid models were optimized using a genetic algorithm for each country to improve the prediction power while considering regional properties. Since the proposed model can accurately predict Covid-19 spread under consideration of containment policies, is capable of being used for policy assessment, planning and decision-making. T THE outbreak of coronavirus disease 2019 has exposed the world to great challenges and is a serious concern for public health. The outbreak started in Wuhan, China, in December 2019 [1] , [2] and within a few weeks it spread across the globe. This caused policy changes regarding the control of the spread. There is a lack of information and uncertainty about this outbreak, making it important to understand its dynamic behavior. Forecasting the outbreaks behavior over time can provide useful insights into the epidemiological situation [3] and determine whether the pandemic has been brought under control by mitigation measures [4] , [5] . Research is currently forecasting changes in infectious diseases [6] , predicting the international spread of outbreaks [7] , and assessing the impacts of alternative interventions during pandemics [8] . However, research is faced with many challenges, particularly related to time. There may be delays in the presentation of symptoms due to the incubation cycle and delays in verifying Seid M. Zandavi detection and testing events. Delays and uncertainties can be taken into account by models, especially those stemming from normal infection histories and reporting processes [9] . Besides, some aspects of outbreak dynamics can be biased, incomplete, or only reported by individual data sources. There is evidence that synthesis approaches will permit a more robust estimate of the dynamics underlying the transmission based on noisy data [10] , [11] . In order to determine the potential trajectory of the disease in accordance with the evidence, a dynamic model can be used. These predict issues such as how an infection progresses, how the number of cases/deaths is affected, or how long an outbreak lasts. In traditional approaches, SEIR models (representing the population groups susceptible, exposed, infectious and recovered) have been used to analyze the spread of Covid-19 [2] , [12] . Such models include feedback that regulates endogenous changes in contact rate, testing, diagnostics, and reporting in response to risk perception and other relevant factors. In these compartmental models [2] , populations are divided into compartments and people are modelled by (ordinary) differential equations within the four SEIR groups. Generative models reflect another broad variety of models with causal effects (using hidden states and parameters). For example, a generative model simulates the dynamics of effects in a group or population (i.e., new COVID-19 cases over time) [13] . These methods can measure the impact of policies (e.g. social distancing) and demographic variations (e.g., public immunity) in order to anticipate what might happen in a particular area under various conditions [14] . Accordingly, SEIR-like models have been used to imitate disease outbreaks by, for example, estimating the parameters of Bayesian Markov-Chain Monte Carlo (MCMC) models [15] , [16] or comprehensively discussing scenarios [17] , [18] . This family of models also has recently played a dominant role in studying the overall outbreak of the coronavirus, from inference [2] , [19] to scenario prediction [1] under control strategies [20] . Here, we propose a model of time-dependent spreading movement. The time-dependence is determined through potential changes representing human and system behaviors. Therefore, a hybrid dynamic model is proposed to acquire a robust estimation of the exclusive use of region properties, which includes three main sections: A task model, facility model, and dynamic motion model. The models estimate the behavior of the Covid-19 outbreak in a particular region. The first model represents general behavior in public, such as public knowledge and how to follow the rules and knowledge. The facility model considers facilities such as hospitals, including emergency and inpatient departments, and medical staff and their knowledge, rules, and skills. The last model predicts the dynamic behavior of time-varying cases and deaths in the nominated country. The motion model uses an artificial neural network using Long Short Term Memory (LSTM) to update itself in line with stochastic behaviors and uncertainties in the proposed framework. Our framework is structured to assess past intervention efficacy and to analyze future possibilities that spread uncertainty. This model can be easily adapted to any country or region. The top-ten most-affected countries, including Australia, are studied with data collected between 31 December19 April 2020 obtained from the European Centre for Disease Prevention and Control (ECDC). Furthermore, 90 % of this data is used to train the hybrid dynamic model and 10 % is used to analyze the performance of the model in the proposed framework. This paper is organized as follows. Section II describes the methods. The problem formulation is explained and introduced in Section III. The results and discussion are presented in Section IV. Finally, the paper ends with a conclusion. Recently, deep learning has garnered the attention of many researchers in different areas. Deep learning usually defines multiple layers considering its architecture and uses a stochastic optimization algorithm to calculate the weight and bias parameters for each layer. As designed its architecture to perform the machine learning tasks, the number of depths (i.e., number of hidden layers) is directly correlated with the learning ability [21] . In particular, LSTM, a form of recurrent neural network (RNN), is able to update during the sequence of learning as it has feedback connections, unlike a feedforward neural network. This is the key point in using LSTM to forecast with time series data. For example, it has been applied to many areas such as image captioning [22] , [23] , [24] , natural language translation [25] , and speech recognition [26] . Many of these active areas focus on classification and applications to forecast/regression models are relatively limited. This paper aims to make a hybrid model based on LSTM and a dynamic behavioral model to forecast the spread of Covid-19 across the world from a dynamic modelling perspective. The dynamic behavioral model is introduced to describe and predict the interactions between multiple components of a phenomenon that are viewed as a system, which includes many inputs and outputs interacting over time. The dynamic model focuses on the mechanism of how the components and system evolve across time. Therefore, dynamic modeling allows us to bridge the gap between conceptualizing the phenomena of dynamic behavior and particular phenomena. Dynamic system modeling is used in many academic fields, originating in mathematics and physics before being adopted in the life, social, and behavioral sciences. It is clear that dynamic system models combined with machine learning techniques can play an essential role in data analysis and the way theories are conceived and developed. In order to forecast Covid-19, the proposed hybrid model consists of three main modules: 1) time series/sequencing learning (i.e., LSTM) with time-variant dynamics, 2) public behavior and 3) a behavioral model of the system. Figure. 1 shows that Covid-19 forecasting includes three main sections. First, the task model describes external conditions and reference inputs during covid-19 epidemics. This module can describe factors such as social distancing, social knowledge, self-isolation, etc. People decide whether to conform to tasks or not, observing both external conditions and the current situation. Additionally, the reference inputs are the ideal states of public behavior during the covid-19 epidemic, so the people should control this epidemic. Second, the facility model describes covid-19 control behavior during the outbreak. This model focuses on technologies, supplies, professional personnel, etc. Such modelling can reduce the number of deaths and increase the number of discharged patients. In this model, personnel are modeled based on human features such as workload, fatigue, and conditions that affect the performance of medical staff. Lastly, dynamic motion is modelled using the artificial neural network based on LSTM to predict numbers of cases and deaths. The details of each model are introduced in the following sections. LSTM is a particular form of RNN capable of learning long-term dependency, and has fundamental differences to a conventional feedforward neural network. They are sequencebased models that are able to establish the temporal correlations between previous information and current circumstances. In times series problems, like forecasting the spread of Covid-19, using a sequence-based model in an LSTM means that the decision an LSTM made at time t − 1 affects the decision it will make at the next time, t. The feature (i.e., feedback connections) plays an important role in imitating the systems dynamic motion, since it takes daily information into account when the subsequent information is entered. According to back-propagation through time, RNNs suffer from long-range dependencies because of gradient vanishing and exploding [27] . Gradient vanishing in RNN refers to problems where the norm of the gradient for long-term components decreases exponentially fast to zero, limiting the models ability to learn long-term temporal correlations, while gradient exploding refers to the opposite event. Although LSTM has been introduced to address the issue [28] , the forget gate in the LSTM architecture boosts the performance of the model [29] . This feature opens a new avenue for many sequencelearning applications. Here, the general structure of LSTM is described with naming similar to that of Ref [30] . Let {x 1 , x 2 , ..., x T } present a sequencing input for an LSTM model (i.e., the general structure is illustrated in Fig. 2 ). In Fig. 2 , x t is a kdimensional real vector at the t-th time step. In order to establish temporal connections, the LSTM defines and maintains an internal memory cell state throughout the whole life cycle, which is the most important element of the LSTM structure. The memory cell state s t−1 interacts with and the subsequent input x t to determine which elements of the internal state vector should be updated, maintained or erased based on the outputs of the previous time step and the inputs of the present time step. In addition to the internal state, the LSTM structure also defines an input node g t , input gate i t , forget gate f t , and output gate o t . Equation 1 -Eq. 6 gives the formulations for all nodes in an LSTM structure. weight parameters for the corresponding input of the network activation function; σ and φ are a sigmoid function and tanh(.), respectively. The sigmoid function with an output range of [0, 1] works as a soft switch for the forget gate (f t ), input gate (i t ), input node (g t ), and output gate (o t ). This means that it is a decision-making point determining whether the signal/sequencing data should pass the gate or not. For example, if the output of the sigmoid function is zero, there is no signal for the prediction. Thus, all gates (forget gate, input gates, input node and output gates), are directly depended on the current x t and previous output H t−1 . The input gate decides what to maintain in the internal state while the forget signal is carried out from the previous state (s t−1 ) by the forget gate. In order to update the internal state, the output gate points out which internal state s t should pass as the LSTM output h t . This process then continues to repeat for the next time step. All the weights and biases are learned by minimizing the differences between the LSTM outputs and actual training samples. Besides, information on the current time step can be stored and maintained to affect the LSTM output of future time steps. Here, LSTM is designed to estimate the movement of Covid-19s spread with consideration of uncertainties. The stochastic behavior of the system is modeled as a dynamic behavioral model to update the LSTM states over time. The dynamic behavioral model describes the behavior of a system on how its elements interact with each other. The system interactions provide the functionality of the system and are used for systems and subsystems. A behavioral model represents the temporal behavior between different subsystems, while an entity interacts over time. Thus, modeling of the interactions among subsystems, the functionality of the entity over time, and the setting of roles, etc. is introduced as dynamic behavior in the system because the system performance is determined through the engagement of each module over time. Generally, systems and subsystems interact to accomplish a purpose by exchanging information to submit roles with expectations, from which functions are presented to ensure that action is taken. Also, an interaction over time involves communication (i.e., signals in time-series interactions), transfer of knowledge, receiving and collecting data. Thus, data manipulation/changing system behavior over time is introduced as a dynamic role. A dynamic behavioral model is constructed to demonstrate one or more interactions within the system that are responsible for task accomplishment. Accordingly, general public behavior, skilled medical staff, and high-quality hospitals cooperate as a dynamic system to manage the spread of Covid-19 in an area. The problem is to forecast the number of cases and deaths during Covid-19 pandemics. The hybrid model using LSTM and the dynamic behavioral model are introduced to achieve good predictions. Covid-19 forecasts help to take more attention into account when the number of cases is increasing. In this section, the structure of the proposed framework (see Fig. 1 ) is introduced. Also, the formulation of the modules involved, such as the task model, medical staff model, hospital as facility model, and dynamic motion model, are presented in detail. In the proposed framework, the task model describes the behavior of the public in the Covid-19 time. It is categorized into two groups: an external condition and reference input. The external condition provides all the environmental factors related to people deciding whether to maintain social distancing or not. The reference input describes the ideal state in the country, such as public knowledge, keeping updated with the latest news, and government rules. It is obvious that the task model is very uncertain due to its dependency on public behavior. Thus, the uncertainties are modeled by stochastic colored noise. The colored noise was generated by white noise that was Gaussian-distributed with a zero mean. Therefore, the dynamic is presented in Eq. 7. where x is the task model and is the white noise. Colored noise is the spectral density calculated by the Fouriertransform of the auto-correlation of white-noise ( (t)). The auto-correlation is formulated in Eq. 8 where D and τ are the noise intensity and correlation time, respectively. The Laplace transform of the auto-correlation of introduced colored noise is expressed as Eq. 9. where A = D τ and B = 1 τ . The transfer function used to model the task model is G ColoredN oise . It is obvious that the colored noise determines the uncertain behavior in terms of the physical phenomena of the motional system. According to McRuer's crossover model [31] , humans in a dynamic system behave in a way that results in an open-loop transfer function, which is formulated as a Laplace form in Eq.10. where Y h and Y p represent human and plant transfer functions, respectively. ω c is the crossover frequency that describes human operations and adaptation during a compensatory situation. Figure 3 shows that the human behavior model is formulated with three elements: delay in performance, equalization form, and medical staff (according to their knowledge, rules, and skills). Thus, the model is generalized and adjusted by using a describing function form and a mitigating set of rules of human characteristics, such as knowledge, rules, and skills. The generalized/adjusted human function, according to McRuer's crossover model, is formulated in Eq. 11 . where k p , T L and T I are the gain, time-lead and time-lag constant in the equalization from of the model shown in Fig. 3 . Here, e −T s represents the delay in staff response, which is described by the time-constant (T ). H KRS is the transfer function of medical staff according to their knowledge, rules, and skills. Having generalized and adjusted the human model, knowledge, rules, and skill play important roles in staff performance. In this regard, the model consists of four modules: human senses such as visual and audio recognition, a cognitive model, plans and rules for different forms of tasks, and medical staffs physical actions and performance efficiency. Figure 4 illustrates the general model of medical staff. Human behavior and performance are highly correlated with visual and auditory recognition. These human senses are heavily dependent on workload, fatigue, and working hours. In the time-domain, the function of the human senses is modeled as an exponential function (see Eq. 12). f hs (t) = a + be −ct (12) where a, b and c are constant parameters in the proposed model. Laplace transform is applied to fully describe the In order to familiar form in Eq. 13, some replacements in the formula are made (see Eqs. 14 -16 ). T Alternatively, the familiar form of the human sense transfer function can be simplified as in Eq 17. F hs (s) = k p hs T L hs s + 1 s(T I hs s + 1) where k p hs is the gain in the system, which describes the quality of human senses while submitting jobs. T L hs and T I hs represents the time-lead and time-lag in the proposed module. The signal output from the human sense module passes through the cognitive model, bridging the gap between understanding what to preserve and what to perform under some circumstances. As per the system defined in Fig. 4 , an expression of the cognitive module is related to system outputs and inputs representing the conservation-of-mass principle. Besides, the cognitive model is formulated as per Eq. 18. where P cog is the probability of understanding the task correctly in uncertain circumstances. The term InputF low cog (t) − OutputF low cog (t) represents the length of the cognitive process in the human mind, which can be formulated as the input coming from the human senses with the delay in output considering the length of T cog . Therefore, the formulation and transfer function of the cognitive model are as presented in Eq. 19 and Eq. 20. The Laplace transform is as below. The transfer function is simplified in Eq. 21. After the cognitive model has been defined, medical staff need to make a plan for their decision. Here, they need to plan and obey the rules, somehow make the limitation in jobs. In this matter, a saturation function in Laplace form is considered. A saturation function is introduced to define a threshold in the systems response when the input exceeds the limit. At this time, the output becomes constant at the highest level of the threshold. Thus, the plans and rules module in the proposed system can be mathematically expressed as in Eq. 22 . where F pr (s) is the transfer function of the plans and rules module represented in Fig. 4 . Finally, the performance and physical actions of medical staff are like a second-order dynamic system. According to the McRuer crossover theorem, the proposed model can be formulated by Eq. 23. where A, B, C, D are constant parameters related to the nature of the system. f (t) is the performance response when the plans and rules are followed by the staff. The Laplace transform of this proposed model is calculated as per Eq. 24. F a (s) = d As 2 + Bs + c U pr (s) The transfer function is replaced with a similar form as that in Eq. 25. where ω n and ξ are the natural frequency and damping ratio, respectively. These parameters define how the dynamic model can behave in the system. When viewing a hospital as a system, it can be divided into two main departmentsemergency and inpatientswhich are the key departments that affect the quality and timeliness of patient care. Emergency capacity must be flexible throughout the day as patients arrive according to a non-homogeneous arrival pattern. Also, the inpatient sector usually focuses on maintaining bed occupancy levels to improve efficiency in terms of utilizing resources. The proposed model describes the maximum occupancy level and planned capacity. The proposed hospital model consists of three patient arrival sources: medical direct admissions, emergency walkin patients, and emergency ambulance arrivals. While the emergency department is crowded with a considerable number of walk-in and ambulance arrival patients, diversion might carry out in the ambulance arrival. Under emergency diversion, the number of patient arrivals is reduced as ambulances are rerouted to nearby alternative hospitals. At this point, emergency walk-in patients must be registered. Besides, medical direct admission patients are sent directly to the inpatient department upon arrival to the hospital. Similar to ambulance arrivals, accepting patients to the inpatient department is heavily dependent on the availability of beds. In this regard, the diversion might be performed when crowding becomes a problem. Note that the inpatient department includes many individual sectors/units, such as the intensive care unit (ICU), telemetry units, medical/surgical units, etc. However, this work focuses on the performance of hospitals with high-demand units under conditions of a Covid-19 outbreak. Figure 5 presents the general model of a hospital during a Covid-19 pandemic with consideration of high-demand departments such as emergency and inpatient departments. According to the conservation-of-mass principle, continuous-time patient flow in the proposed hospital model can be formulated using differential equations according to disturbances and manipulated variables to measured outputs. Equations 26 to 31 are formulated to describe changes and hospital tasks. Ambulance arrivals are correlated with hospital decisions of accepting or diverting some or all patients to nearby hospitals. This matter is formulated as Eq. 26. where f h1 (t) is ambulance diversion and u 1 (t) describes the decision between ambulance arrival or ambulance diversion. Additionally, the emergency queue is a module that makes decisions about transferring patients emergency queue to delivering services to each individual patient. The mathematical expression is formulated in Eq. 27. where f h2 (t) is the emergency queue and d 1 (t) and d 2 (t) are the time-varying rates of ambulance and walk-in patient arrivals, respectively. d 1 (t) and d 2 (2) are modeled as colored noise due to a lack of information about the rate of arrivals. u 2 (t) is similar to u 1 (t) and represents the decision to transfer patients in emergency sector to visiting patients or delivering services. After completing emergency arrival, it is time to perform services in the emergency department. Here, the length of treatment is a crucial consideration. Equation 28 shows the duration of progress in emergency treatment. where f h3 (t) and θ emg are the emergency services and length of treatment, respectively. Emergency holding is a decision about whether to hold patients in the emergency department or transfer them to inpatient beds. At this point, medical staff decide to transfer emergency patients directly to the inpatient department for further care. Equation 29 describes the emergency holding decision. where f h4 (t) represents emergency holding and u 3 (t) is the decision to transfer a patient in the holding sector to the inpatient department. k inp is the probability that emergency patients are admitted directly to the inpatient sector. Also, the rate of medical admission is modeled as a decision to deliver services directly to the patient (see Eq. 30). where f h5 (t) and u 4 are the medical diversion and direct admission decisions, respectively. Finally, the hospital model describes the number of patients being treated in the inpatient department. The length of treatment in this sector is of primary importance. Therefore, controlling inpatient stays and outpatient services directly influences the number of Covid-19 cases. In this regard, the model is formulated as per Eq. 31. where f h6 (t) represent inpatient services. d 3 and θ inp are the time-varying rates of direct patient admissions (i.e., modeled as colored noise) to the inpatient section and the length of treatment in the inpatient department, respectively. The Laplace transform of each module is represented in Eq. 32 to Eq. 37. The dynamic motion model is introduced to describe timevarying cases and deaths. After complete modeling of the public tasks and facilities, such as hospitals and medical staff behavior, movement is of importance in the proposed model. The time-varying movement is obviously dependent on the proposed models such as the tasks and facilities models (see Fig. 1 ). The proposed framework is determined by how public tasks, facilities, and medical staff behavior affect the numbers of cases and deaths during a Covid-19 outbreak. In this dynamic motion module, LSTM is utilized as a dynamic model in the proposed system. LSTM is a time-series model, which is able to estimate the temporal correlations acquired by the simulation model and real data from previous and current circumstances. Therefore, the decision the LSTM makes at time t − 1 affects the decision it will be make at the next time step, t. The LSTM's feedback connections imitate the systems dynamic motion, since it takes daily information into account when the subsequent information is entered. Here, the proposed LSTM architecture is shown in Fig. 6 . In this section, the performance of the proposed hybrid model is evaluated using the latest available public data on Covid-19 1 . From the worldwide distribution of Covid-19 cases, the top-ten most-affected countries, which include Table I , can be measured by two approaches. First, by extracting the impact of each property using a heuristic optimization algorithm [32] . Second, by utilization of actual data and the observed behavior of the whole system if it is available. In this paper, a genetic algorithm (GA) was applied to actual Covid-19 distribution data to determine the impacts of the different modules in the proposed hybrid model. Dynamic motion is represented by the LSTM, which is highly correlated with the sequence length in training the model. The sequence length plays an essential role in the prediction of Covid-19s spread because mitigation measures and policies are being improved daily. In this regard, sequence lengths of 110 days are used as the primary parameters in training the LSTM. The boxplot (see Fig. 7 ) shows the distribution of RMSEs across the different countries for each sequence length. A seven-day sequence length was found to provide lower variability and shorter outliers than other sequence lengths, while both cases and deaths are matter. Thus, a seven-day sequence length was selected to analyze the impacts of the different modules in the proposed hybrid model. Table II lists actions, and the hospital, which is modeled in terms of direct medical registration, ambulance arrivals, emergency walkins, and emergency services. The DM represents dynamic movement over time as modelled by LSTM. Here, the USA, the most-affected country, was chosen to investigate the importance of the modules in the hybrid model. Genetic Algorithm (GA) was used to find best parameters for each model. Accordingly, GAs with 200 population size and 500 performed to minimize the root mean square error (RMSE) between the predicted and observed data. The RMSE is calculated as follows: wherex t and x t are the observed and predicted values, respectively. To assess the statistical significance of the reduction in RMSE in hybrid models compared to LSTM, each module was evaluated 500 times after hype-parameter tuning, and the corresponding RMSE distribution was used to estimate 95% confidence interval (CI) and t-test p-values comparing significant differences between stage 11 and other stages. Table II and Fig. 8 ) clearly show the cumulative effect. As seen in Fig. 7 , adding TM (stage 2) significantly improved the accuracy (i.e., 250% better than LSTM). Then, adding more modules (i.e., greater similarity to the real environment) on top of TM (i.e., stages 3 to 11) was able to improve the models performance in comparison with the previous stage. The cumulative results show that having greater similarity to the real environment helps to obtain a more accurate model. Figure 8 demonstrates that not only can the whole modeled system, encompassing everything from public behavior to hospital performance, reach significant accuracy, it also imitates changes in Covid-19 time. For instance, Fig. 9 shows a comparison of LSTM and the proposed model, from which it presents that the hybrid model can more accurately estimate real ambient behavior. The parameters of the hybrid model were tuned by GA Having tuned the hybrid model's parameters, the hybrid model was evaluated using test data. In Table IV , the performance of the hybrid model is compared with that of LSTM. In this table, RMSEs were obtained for cases and deaths in different countries. The results show that the performance of the hybrid model is significantly better than that of LSTM. Figure 10 to Fig. 13 illustrates that the hybrid model was capable of more accurate prediction than the LSTM model with all case studies. Not only can the model predict the behavior/trend, it also provides accurate estimates of the numbers of cases and deaths under considerable uncertainty. In these figures, forecasts for the next 20 days are provided. As seen, Spain, France, Germany, China, and Australia are able to control the number of cases at zero cases per day. However, the numbers of cases in the USA and Turkey in the next 20 days are around 25,000 and 2,000 per day, respectively. Therefore, these countries need greater restrictions and improved public knowledge and behavior if they are to decrease the number of cases. During this period, there is insufficient control in Iran and the United Kingdom. As the model is sensitive to large fluctuations in cases and deaths, care should be taken to maintain a stable trend. Besides, Fig. 12 and Fig. 13 show that LSTM crashes when estimating cases in Australia and France because of negative values of cases. A novel hybrid model using LSTM and dynamic behavior models was proposed to forecast the spread of Covid-19 in the most-affected countries. Many factors affect the spread of this virus, so it is very difficult to make the right predictions of cases and deaths. In this regard, the LSTM and dynamic behavioral models were used to model a dynamic system with a high level of fidelity. The results show that the hybrid model can accurately predict the spread of Covid-19 based on real data. The proposed hybrid model provides robust estimates with the exclusive use of regional properties. Adding more modules and using real data for different modules can substantially improve the models performance. In this paper, a TM, FM (including medical staff and hospitals), and DM were constructed to forecast the spread of Covid-19 in the top ten most-affected countries. Public knowledge and behavior can directly impact the spread of Covid-19. Additionally, skilled medical staff and high-quality hospitals can control the outbreak. How will country-based mitigation measures influence the course of the covid-19 epidemic? Early dynamics of transmission and control of covid-19: a mathematical modelling study Temporal changes in ebola transmission in sierra leone and implications for control requirements: a real-time modelling study Transmission dynamics of the etiological agent of sars in hong kong: impact of public health interventions The impact of control strategies and behavioural changes on the elimination of ebola from lofa county, liberia The rapidd ebola forecasting challenge: Synthesis and lessons learnt Delaying the international spread of pandemic influenza Evaluation of the benefits and risks of introducing ebola community care centers, sierra leone Early epidemiological assessment of the virulence of emerging infectious diseases: a case study of an influenza pandemic Evidence synthesis for stochastic epidemic models Assessing optimal target populations for influenza vaccination programmes: an evidence synthesis and modelling study Simulation-based estimation of the spread of covid-19 in iran Dynamic causal modelling of covid-19 Inferring covid-19 spreading rates and potential change points for case number forecasts Bayesian inference for stochastic epidemics in populations with random social structure Epidemiological and ecological determinants of zika virus transmission in an urban setting Pulse vaccination strategy in the sir epidemic model Strategies for containing ebola in west africa Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (sars-cov2) Bistability of sudr+ k model of epidemics and test kits applied to covid-19 Representation learning: A review and new perspectives Show and tell: A neural image caption generator Deep visual-semantic alignments for generating image descriptions Deep captioning with multimodal recurrent neural networks (m-rnn) Sequence to sequence learning with neural networks Towards end-to-end speech recognition with recurrent neural networks Short-term residential load forecasting based on lstm recurrent neural network Long short-term memory A critical review of recurrent neural networks for sequence learning Learning to forget: Continual prediction with lstm Mathematical models of human pilot behavior Stochastic dual simplex algorithm: A novel heuristic optimization algorithm