key: cord-1049287-p2h4h8co authors: Gao, Shan; Wang, Hanyi title: Scenario prediction of public health emergencies using infectious disease dynamics model and dynamic Bayes date: 2021-09-21 journal: Future Gener Comput Syst DOI: 10.1016/j.future.2021.09.028 sha: e960fdc1c4e601e4b352ec9454d29de4e241f515 doc_id: 1049287 cord_uid: p2h4h8co This study was aimed to discuss the predictive value of infectious disease dynamics model (IDD model) and dynamic Bayesian network (DBN) for scenario deduction of public health emergencies (PHEs). Based on the evolution law of PHEs and the meta-scenario representation of basic knowledge, this study established a DBN scenario deduction model for scenario deduction and evolution path analysis of PHEs. At the same time, based on the average field dynamics model of the SIR network, the dimensionality reduction process was performed to calculate the epidemic scale and epidemic time based on the IDD model, so as to determine the calculation methods of threshold value and epidemic time under emergency measures (quarantine). The Corona Virus Disease (COVID) epidemic was undertaken as an example to analyze the results of DBN scenario deduction, and the infectious disease dynamics model was used to analyze the number of reproductive numbers, peak arrival time, epidemic time, and latency time of the COVID epidemic. It was found that after the M1 measure was used to process the S1 state, the state probability and the probability of being true (T) were the highest, which were 91.05 and 90.21, respectively. In the sixth stage of the development of the epidemic, the epidemic had developed to level 5, the number of infected people was about 26, and the estimated loss was about 220 million yuan. The comprehensive cumulative foreground (CF) values of O1 [Formula: see text] O3 schemes were -1.34, -1.21, and -0.77, respectively, and the final CF values were -1.35, 0.01, and -0.08, respectively. The final CF value of O2 was significantly higher than the other two options. The household infection probability was the highest, which was 0.37 and 0.35 in Wuhan and China, respectively. Under the measures of home quarantine, the numbers of confirmed cases of COVID in China and Wuhan were 1.503 (95% confidential interval (CI) = 1.328 [Formula: see text] 1.518) and 1.729 (95% CI = 1.107 [Formula: see text] 1.264), respectively, showing good fits with the real data. On the 21st day after the quarantine measures were taken, the number of COVID across the country had an obvious peak, with the confirmed cases of 24495, and the model prediction value was 24085 (95% CI = 23988 [Formula: see text] 25056). The incubation period 1/q was shortened from 8 days to 3 days, and the number of confirmed cases showed an upward trend. The peak period of confirmed cases was advanced, shortening the overall epidemic time. It showed that the prediction results of scenario deduction based on DBN were basically consistent with the actual development scenario and development status of the epidemic. It could provide corresponding decisions for the prevention and control of COVID based on the relevant parameters of the infectious disease dynamic model, which verified the rationality and feasibility of the scenario deduction method proposed in this study. methods commonly used in the performance of unconventional emergencies, and constructed the methods and steps for the selection and application of the performance of the scenarios in the various stages of the disposal process. The infectious disease dynamics model (IDD model) is an effective tool for the study of infectious diseases. It can effectively simulate the spreading mechanism and law of infectious diseases, and can provide a powerful reference value for formulating effective preventive measures. The traditional dynamic warehouse model, continuous infectious disease dynamic model, and random IDD model have played an important role in the prevention and control of infectiousness, but their assumptions about contact among individuals are equal to probability [8] , which does not match the actual condition. Bayesian network is an acyclic graph composed of directed edges representing variable nodes and connection points, and has been widely used in knowledge expression and reasoning fields [9] . But it still has the uncertainty of the knowledge meta-model [10] . The dynamic Bayesian network (DBN) dds a time factor on the basis of a static Bayesian network, which can effectively maintain the continuity of events [11] , but currently there are few studies on the application of DBN to PHEs scenario prediction. In summary, there are some shortcomings for the traditional IDD model in the prediction of public health events. The DBN algorithm shows obvious advantages in the process of knowledge element expression, but there are few studies on its application in the prediction of PHEs. Therefore, based on the uncertain characteristics of PHEs, an IDD model and DBN scenario deduction model, were constructed in this study, and the scenario deduction of IDD model and DBN to PHEs was discussed, aiming to provide a reference basis for the establishment of emergency decision-making for PHEs. Knowledge element is the smallest entity unit in which each element of an event cannot be separated. Using common knowledge elements to represent public health emergencies helps to deeply understand the evolution process of PHEs and the interrelationships among various elements [12] . For knowledge object A, the situational common knowledge can be expressed as follows: If the constraint relationship among the internal attributes of a thing is r R  , the knowledge meta model can be expressed as follows: The knowledge representation of the situation was the premise and basis for the evolution of the situation model and reasoning. Through the analysis of the evolution path of the situation, the development trend and influencing factors of public health events can be analyzed, and scientific and effective decision-making can be established within a limited time [13] . The situational knowledge representation model is generally composed of three parts: situational status, emergency treatment methods, and emergency goals [14] . For a PHE P, its environmental (E) and control (C) conditions were analyzed. Based on its current situational state (S), the event state (I), the event life cycle state (Elc), and the disaster-bearing body state (S) were analyzed. In addition, corresponding emergency treatment plans M1, M2, and M3 were established to achieve the corresponding emergency goals G1, G2, and G3. The specific structure of the situational knowledge representation model was shown in Bayesian network is one of the most effective models in the field of uncertain knowledge expression and reasoning [15] . For the cause set H of causality in the Bayesian network, the result is expressed as I, then H I  and i H ,i 1, 2, , n H   . In the Bayesian network reasoning process, the reason set for a given node is combined with its result independently [16] . Therefore, the joint probability of all nodes represented by the Bayesian network can be expressed as equation (3) DBN was added with a time factor, which made event reasoning and development time had a high degree of continuity and consistency [17] . For an event with L event fragments, the joint probability of all nodes can be expressed as follows: k 11 12 1 Ln 11 12 1 Lm 11 12 1 Lm , ,k , , , The scenario deduction model of PHEs based on DBN firstly determined the network node variables in the process of handling public health emergencies based on the key elements forming node variables. Then, it established the causal relationship among the node variables according to the sudden child labor and health incidents, and formed an acyclic network structure diagram. Finally, it incubated the corresponding condition probability for each node according to the Bayesian network structure. The specific PHEs processing flow based on DBN was shown in Figure 2 . There is obvious heterogeneity in contact among individuals [18] , and IDD model based on network control measures is of great significance in the scenario prediction of public health events. [19] . Due to the limitation of calculation skills and process, it is difficult to describe the epidemic time of an epidemic in a clear numerical expression [20] . For a degree-uncorrelated network whose distribution is A(k), its probability generating function can be expressed as below equation: The first and second moments of the degree distribution are expressed as below equations: Then, the average field dynamics model of the SIR network can be expressed as below equations: In the above equations,   k C t referred to the relative density of restored nodes at time t and degree k;  was the probability of contact between an infected node and a susceptible node, α represents the recovery rate; and represented the probability of any edge connecting to the infected node at time t. In addition, It was assumed that the probability of an optional edge connected to a susceptible node was   The calculation method of the basic reproductive numbers of the mean field dynamics model of the SIR network was shown in equation (12) below:  , it meant that the infectious disease was a small-scale outbreak; and 1  indicated that the infectious disease was a pandemic or a final outbreak. In order to quickly obtain the outbreak scale and epidemic time of the average field dynamics model of the SIR network, it had to be reduced in dimensionality. The mean field dynamics model of the SIR network after dimensionality reduction can be expressed as below equation: In the equation (13) (14): According to the algorithm restraint of the above-mentioned dimensionality reduction SIR network average field dynamics model, when represented the probability that after the end of the disease, there was still no infection on any of the edges. Then the final density of susceptible nodes after the end of the infectious disease can be expressed as follows: The scale of the epidemic outbreak based on the mean field dynamics model of the reduced-dimensional SIR network can be expressed as follows: It was assumed that there was an initial infected node in the network as the initial moment, and the end time when the last infected node in the network was restored after an epidemic, this period was approximated as the epidemic time [21] . Then at the initial time t = 0, there was only one infected node in the network. Then, the epidemic time of infectious diseases based on the mean field dynamics model of the reduced-dimensional SIR network can be expressed as below equation: In the equation (17) above, 1*  represented the recovery of the last node of the infectious disease after the epidemic had spread. In the spread of infectious diseases, at a certain moment  , nodes with large quarantine degrees or edges of nodes with large cutting degrees were an effective intervention to reduce the risk of transmission and reduce the final scale [22] . Then, under the full quarantine measure, the infectious disease   + t k E based on the dimensionality reduction SIR network average field dynamics model initially could meet the following conditions: In the equation (18) represented the relative density of susceptible nodes with degree k before and after quarantine time t, respectively; and c k indicated the maximum degree of the remaining nodes after the complete quarantine measure was taken. Under the incomplete quarantine measure, the infectious disease based on the dimensionality reduction SIR network average field dynamics model initially could meet the following condition: Under the incomplete quarantine measure, the final scale and epidemic time of the infectious disease based on the average field dynamics model of the dimensionality reduction SIR network can be given as below equation: In equation above, * 2  represented the probability that the infection did not occur on any one side at the end of the infectious disease under the incomplete quarantine measure. Then, the calculation method of the quarantine time threshold with the final scale can be written as below equation: In the PHEs decision-making process, the set of factors that affect the decision-making and the degree of mutual influence between the factors and the corresponding matrix transformation have a certain impact on the formulation of the emergency plan [23] . It was assumed that the plan set of the polymorphic and multi-attribute risk PHEs emergency decision-making was   In the equation (23) Cost type and benefit type are one of the common attributes of samples [24] . In order to increase comparability in the decision-making model process of dealing with public health events, the decision-making process matrix should be standardized. The related attributes can be divided to form an attribute set, and then according to the law of the development of each event within the attribute range, the correlation between the elements is greatly affected, and the sudden point of the format file is expressed in the form of a set. In the PHEs multi-attribute emergency decision-making problem, the traditional method of maximizing dispersion is most commonly used to solve the attribute weights [25] . (25) In the actual response to PHEs, the timelier the emergency response is, the better the loss can be prevented from spreading. How to reasonably and accurately determine the time weights of different stages of emergency response based on the existing public health event information is of vital importance to the evaluation of various emergency options and the priority of the alternatives. Therefore, the time degree can be reasonably used to find the time weight vector [27] . For the state S of PHEs with attribute H, it should calculate the probability in different states, establish the corresponding processing scheme O according to the state S and probability P, and process the scheme information into a curved interval as the amount of data in the decision-making process. The specific process of establishing multi-attribute emergency decision-making for PHEs based on IDD model was shown in The topological structure of the network and the basic reproductive numbers and final scale of the outbreak and epidemic of heterogeneous infectious diseases have a great influence on the characteristics of infectious diseases. When a uniform Poisson degree distribution (P) or a heterogeneous power rate distribution (L) is assigned to each sub-network, 4 coupling networks (PP, LP, PL, and LL) are obtained. According to the cumulative foreground (CF) theory, the foreground value of the emergency decision plan for PHEs is correlated with the value function [28] . The calculation method of the value function was given in below equation: In the equation (25) above, y  was the state of loss or gain of the decision-making plan relative to the decision maker at the reference point, 0 y  meant gain, and 0 y  meant loss; α, β, and χ referred to the risk aversion coefficient, the risk preference coefficient, and the comparative gain, respectively. The COVID data used in this study came from the specific data reported by the National Health Commission of China (NHCC). As of September 2020, it has reported a total of 85,022 confirmed cases, a total of 80,126 cured and discharged cases, and a total of 4,634 deaths in China. The number of J o u r n a l P r e -p r o o f national statistical cases is used as the data of the PHEs scenario deduction study. According to the development characteristics of COVID, the relevant processing methods were adopted to provide a basis for judgment, the scene of the epidemic was divided to extract the key elements that affected the development of the epidemic to facilitate the deduction of its development trend, which is more conducive to analyzing and judging the correlation between the elements. According to the development process of the epidemic situation, the situation status is mainly divided into five scenarios: citizen health, fever, positive nucleic acid test, citizen and family quarantine observation, and infection in the associated area of the citizen. In addition, corresponding emergency measures will be given according to different scenarios. The specific scenario elements of the epidemic situation were shown in Figure 4 . . Three decision-making methods were initially established to solve the emergency decision-making problem. Option 1 (O1) is to dispatch a medical team and an emergency ambulance; Option 2 (O2) is to add a medical expert and corresponding emergency medical equipment on the basis of O1; and Option 3 (O3) is to send a medical emergency protection team on the basis of O2. The effectiveness of the options was evaluated based on the health status of the patients, the number of rescuers, the cost of rescue equipment and manpower. The data was analyzed using Wuhan COVID data as an emergency decision plan. The clustering coefficient O was used to characterize the quarantine intensity of household quarantine measures in the COVID outbreak. The greater the intensity of the quarantine measure, the greater the O value. According to national statistics, domestic quarantine is mainly divided into 10 categories under the [29] . When each family was completely isolated, the clustering O of the network can be expressed as equation (25) below:     10 10 3 3 k k 3 3 10 10 3 3 k k 3 3 3 0.5 2 In the equation above, k F was the number of households with k family members, O referred to the number of combinations, and M represented the number of permutations. After corresponding emergency measures were adopted in each situation, it had to judge whether the corresponding result was true or not. The probability in different states was calculated according to the probability calculation equations of the Bayesian scenario deduction model. At the same time, according to the correlation between the knowledge element structure of the scenario mode, the probability distribution diagram of the DBN scenario state was established ( Figure 5 ). The target values obtained by taking corresponding emergency measures under different conditions showed great differences. After the M1 measure was used to process the S1 state, the state where the result was true (T) showed the highest probability and over probability, which were 91.05 and 90.21, respectively. The statistical analysis was performed on the level of the epidemic, the number of infections, and the loss of the plan at different stages of the development of the epidemic, and the results were given in Figure 6 . In the face of the continuous development of the epidemic, the level of the epidemic had become more and more serious, and the number of infections and the value of the meta loss had also shown a clear upward trend. In the sixth stage of the development of the epidemic, the epidemic had reached level 5, and the number of infections per person was about 26. The maximum loss value was estimated to be around 220 million yuan. The same scheme (O1 ~ O3) was adopted for each stage of the epidemic development, and the CF values of different schemes were calculated and analyzed according to the CF decision algorithm (Figure 7) . The comprehensive CF values of O1 ~ O3 were -1.34, -1.21, and -0.77, respectively, and the corresponding final CF values were -1.35, 0.01, and -0.08, respectively. The final CF value of O2 was higher obviously than that of the other two options. The number and distribution of patient cases in the first 28 days of the COVID outbreak were statistically analyzed, and the statistical results were shown in Figure 8A . As time went by, the number of COVID pathological patients in Wuhan and across the country had shown an obvious upward trend. At the 28 th day of the outbreak, the number of confirmed cases nationwide was 5895, of which number of patients in Wuhan accounted for more than 95%. Under the quarantine measure, statistical analysis was performed on the infection probability of 10 different types of patients according to the family distribution of the National Bureau of Statistics [30] , and the results ( Figure 8B ) suggested that the household infection probability was the highest, which was 0.37 and 0.35 in Wuhan and China, respectively. Wuhan after quarantine measures were adopted) The model established in this study was used to iterate 10,000 times to perform a fitting analysis on the number of confirmed COVID cases. As illustrated in Figure 9 , under the measures of home quarantine, the number of confirmed COVID cases in China and Wuhan showed a good fitting result with the real data. According to the fitting results, the reproductive numbers of Wuhan and China were 1.503 (95% confidential interval (CI) = 1.328 ~ 1.518) and 1.729 (95% CI = 1.107 ~ 1.264), respectively. The data of confirmed COVID cases was analyzed based on the model, and the results were shown in Figure 10 . The model predicts that on the 4th day after one cycle (10 days) after the quarantine measure was taken, there was a clear peak in Wuhan COVID outbreak cases. The peak value of confirmed cases was 7996 cases, and the model predicted a value of 7822 cases (95% CI = 7167 ~ 8255). Across the country, there was a clear peak of COVID outbreak cases on the 11 th day 10 days after the adoption of quarantine measures. The peak value of confirmed cases was 24495, and the model prediction value was 24085 (95% CI = 23988 ~ 25056). To verify the effectiveness of emergency measures, the key parameter incubation period 1/q that reflected the detection intensity was analyzed, as revealed in Figure 11 . After effective quarantine measures were taken, when the incubation period 1/q was shortened from 8 days to 3 days, the number of confirmed cases showed an upward trend, and the peak period of confirmed cases was advanced, shortening the overall epidemic time. The relationship between the cumulative confirmed cases in Wuhan and the incubation period) PHEs have had a serious impact on the global public health and economy, and have seriously hindered the development of human society. Scenarios are the fundamental basis for establishing public health emergency decision-making [30] . Predicting PHEs in a complex environment and establishing corresponding emergency decision-making are of great significance to the management and development of public health incidents [31] . In this study, a DBN scenario prediction model was established based on the evolution of public health event scenarios and their main factors. The development of the COVID epidemic was undertaken as an example to analyze the process and key technologies of the scenario deduction of the COVID epidemic. The suddenness of PHEs leads to the complicated and changeable evolution process, and it is impossible to establish an effective case library in a short period of time. Therefore, the expert evaluation scoring method should be the main method, and the average value of the evaluation results of multiple experts should be used as the final result [32] . Based on historical data and expert knowledge, the conditional probability of each node variable was determined, and the probability distribution diagram of the state deduction in different states was analyzed. It was found that the target value obtained by taking corresponding emergency measures in different states showed great differences. After the M1 measure was used to process the S1 state, the state where the result was true (T) showed the highest probability and over probability, which were 91.05 and 90.21, respectively. Deduction of the probability distribution map based on the state can A B J o u r n a l P r e -p r o o f predict the development trend of the epidemic, and make reasonable adjustments and optimizations, which can make the situation and state emergency measures more ideal [33] , and can effectively block the development and spread of the epidemic. COVID can infect humans across populations, and its clinical symptoms mainly include cough, fever, and dyspnea, and its respiratory tract infection lasts in the range of 2 -14 [34] . Under normal circumstances, the establishment of a public emergency response strategy model requires the analysis of the characteristics, mechanism, dynamics, and other parameters and variables of the spread of infectious diseases; the parameters can be estimated based on actual data and model fitting, so as to determine the basic reproductive numbers of infectious diseases, peak scale, peak time, final scale, and epidemic time. Factors such as the number of medical staff, the number of infected people, and the scope of the area all exert great impacts on the emergency decision-making of public health events [35] , so it is necessary to collect statistics on epidemic-related data, and analyze the related data sets based on these data. The results in the article showed that with the continuous development of the epidemic, the level of the epidemic had become more and more serious, and the number of infected persons and the value of yuan loss had also shown a significant upward trend; in the sixth stage of the development of the epidemic, the epidemic reached level 5. The number of infections per person was about 26, and the estimated loss was about 220 million yuan. The comprehensive CF values of O1 ~ O3 were -1.34, -1.21, and -0.77, respectively, and the final CF values were -1.35, 0.01, and -0.08, respectively. The final CF value of O2 was significantly higher than the other two solutions. Such results suggest that the O2 shows obvious advantages compared with the other two options. The results of this study proved that after effective quarantine measures were taken, when the incubation period 1/q was shortened from 8 days to 3 days, the number of confirmed cases showed an upward trend, and the peak period of confirmed cases was advanced, shortening the overall epidemic time. Therefore, effective suggestions can be provided for the prevention and control of COVID based on latency time. In this study, a scenario deduction model of DBN was established based on DBN and PHEs scenario evolution law, and an IDD model was established based on SIR network; some parameters such as epidemic scale, epidemic time, and thresholds were determined and applied to the development and scenario prediction of the COVID epidemic. It was found that the prediction result of scenario deduction based on DBN was basically consistent with the actual development scenario and development status of the epidemic, and it could provide corresponding decisions for COVID prevention and control based on the relevant parameters of the IDD model. However, there were some shortcomings in this study. It failed to further analyze the stability of the established model and compare it with other models. In future work, it will continue to analyze the stability of the two established models and compare them with other public health event scenario deduction models. In short, the PHEs scenario deduction model established in this study was reasonable and feasible, which provided a reference basis for the establishment of PHEs decision-making programs. COVID-19, a worldwide public health emergency Public health emergency preparedness: a framework to promote resilience Responding to Covid-19: How to Navigate a Public Health Status and Challenges of Public Health Emergency Management in China Related to COVID-19. Front Public Health Public health emergency or opportunity to profit? The two faces of the COVID-19 pandemic Obesity is a public health emergency Emerging novel coronavirus (2019-nCoV)-current scenario, evolutionary perspective based on genome analysis and recent developments Viboud C; Isaac Newton Institute IDD Collaboration. Modeling infectious disease dynamics in the complex landscape of global health The SEIRS model for infectious disease dynamics Perceived infection transmission routes, infection control practices, psychosocial changes, and management of COVID-19 infected healthcare workers in a tertiary acute care hospital in Wuhan: a cross-sectional survey Epi-GTBN: an approach of epistasis mining based on genetic Tabu algorithm and Bayesian network Situational awareness and information flow in prehospital emergency medical care from the perspective of paramedic field supervisors: a scenario-based study Spatial Lifecourse Epidemiology and Infectious Disease Research Addition of an Emotionally Stable Node in the SOSa-SPSa Model for Group Emotional Contagion of Panic in Public Health Emergency: Implications for Epidemic Emergency Responses Current Status of Epidemiology, Diagnosis, Therapeutics, and Vaccines for Novel Coronavirus Disease 2019 (COVID-19) Bayesian Algorithm for Retrosynthesis Dynamic Human Error Assessment in Emergency Using Fuzzy Bayesian CREAM Tooling-up for infectious disease transmission modelling Responding to Covid-19: How to Navigate a Public Health Emergency Legally and Ethically An open-data-driven agent-based model to simulate infectious disease outbreaks Dynamic population flow based risk analysis of infectious disease propagation in a metropolis Spatial infectious disease epidemiology: on the cusp Dynamic Models of Infectious Disease Transmission in Prisons and the General Population Dynamic analysis of the recurrent epidemic model The research and development process for multiscale models of infectious disease systems Multiscale model for the optimal design of pedestrian queues to mitigate infectious disease spread Spread of Infectious Disease Modeling and Analysis of Different Factors on Spread of Infectious Disease Based on Cellular Automata Dynamic modelling of the impact of public health education on the control of emerging infectious disease The Use and Underuse of Model Systems in Infectious Disease Ecology and Evolutionary Biology Model-informed COVID-19 vaccine prioritization strategies by age and serostatus Centre for Mathematical Modelling of Infectious Diseases COVID-19 working group. Early dynamics of transmission and control of COVID-19: a mathematical modelling study Modeling the effect of literacy and social media advertisements on the dynamics of infectious diseases Pathophysiology, Transmission, Diagnosis, and Treatment of Coronavirus Disease 2019 (COVID-19): A Review Coronavirus disease 2019 (COVID-19) pandemic and pregnancy Coronavirus disease 2019 (COVID-19): current status and future perspectives There is no conflict of interest.