key: cord-0039255-kwikwxge authors: Chen, Qi; Rui, Jia; Hu, Qingqing; Peng, Ying; Zhang, Hao; Zhao, Zeyu; Tong, Yeqing; Wu, Yang; Su, Yanhua; Zhao, Benhua; Guan, Xuhua; Chen, Tianmu title: Epidemiological characteristics and transmissibility of shigellosis in Hubei Province, China, 2005 – 2017 date: 2020-04-07 journal: BMC Infect Dis DOI: 10.1186/s12879-020-04976-x sha: 699071766da4f82ab66f8c132275b021c46cac72 doc_id: 39255 cord_uid: kwikwxge BACKGROUND: Shigellosis is one of the main diarrhea diseases in developing countries. However, the transmissibility of shigellosis remains unclear. METHODS: We used the dataset of shigellosis cases reported between January 2005 and December 2017, from Hubei Province, China. A mathematical model was developed based on the natural history and the transmission mechanism of the disease. By fitting the data using the model, transmission relative rate from person to person (b) and from reservoir to person (b(w)), and the effective reproduction number (R(eff)) were estimated. To simulate the contribution of b and b(w) during the transmission, we performed a “knock-out” simulation in four scenarios: A) b = 0 and b(w) = 0; B) b = 0; C) b(w) = 0; D) control (no intervention). RESULTS: A total of 130,770 shigellosis cases were reported in Hubei province, among which 13 cases were dead. The median annual incidence was 19.96 per 100,000 persons (range: 5.99 per 100,000 persons – 29.47 per 100,000 persons) with a decreased trend (trend χ(2) = 25,470.27, P < 0.001). The mean values of b and b(w) were 0.0898 (95% confidence interval [CI]: 0.0851–0.0946) and 1.1264 × 10(− 9) (95% CI: 4.1123 × 10(− 10)–1.8416 × 10(− 9)), respectively. The “knock-out” simulation showed that the number of cases simulated by scenario A was almost the same as scenario B, and scenario C was almost the same as scenario D. The mean value of R(eff) of shigellosis was 1.19 (95% CI: 1.13–1.25) and decreased slightly with a Linear model until it decreased to an epidemic threshold of 0.99 (95% CI: 0.65–1.34) in 2029. CONCLUSIONS: The incidence of shigellosis is still in high level. The transmissibility of the disease is low in Hubei Province. The transmission would be interrupted in the year of 2029. Shigellosis, caused by Shigella spp which are Gramnegative bacteria, is a fecal-oral transmission disease with a symptom of severe colitis and induced by a very low infectious dose [1, 2] . In developing countries, Shigella spp are one of the top four attributable causes of moderate-severe diarrhea in children younger than 5 years. Shigellosis is an illness that kills roughly 750,000 people per year [3] [4] [5] . Although the incidence of shigellosis has declined remarkably due to the rapid improvements in water supply and sanitation in China [6] , a considerable disease burden still exists and is unevenly distributed across the country [7] [8] [9] . Because of the different transmission conditions resulted from socialeconomic factors such as water supply and hygiene practices in different areas, the transmission of shigellosis is heterogeneous and is still a major public health problem in many areas in China [7, 10, 11] . The transmission interruption has a public importance in China as well as many developing countries. Generally, whether an infectious disease spreads or not depends on the size of transmissibility or "transmission force" behind the transmission. However, to our knowledge, there is no study investigating the transmissibility of shigellosis. Therefore, an understanding the transmissibility of shigellosis is of great significance to ensure a better disease prevention and control. The transmissibility of an infectious disease is commonly quantified by basic reproduction number (R 0 ), which is defined as the expected number of secondary infections that result from introducing a single infected individual into an otherwise susceptible population [12] [13] [14] [15] . However, because of the disease intervention or the decreasing proportion of susceptible individuals due to the spread of the pathogen, it is hard to estimate R 0 . Therefore, effective reproduction number (R eff ) is commonly employed instead, with the definition of the number of new cases which was produced by a typical case during the period of infection [16, 17] . From the definition, it is clear that when R eff > 1, the disease is able to spread in the population. If R eff < 1, the infection will be cleared from the population. Therefore, R eff = 1 is considered as the "epidemic threshold". Although statistical model, like Bayesian model, was employed to explore the epidemiological characteristics [18] , there is few research to simulate and forecast the population-based transmission [19] . To explore the transmissibility, we have performed a study to estimate the relative transmissibility of shigellosis among male and female individuals in Hubei Province, China [20] . And we found that the transmissibility of shigellosis was different in females was higher than that of males [20] . However, we still do not know the values of R eff and their trends. In this study, we used the dataset of reported shigellosis cases collected in Hubei Province, China, developed a mathematical model to fit the data, estimated the values and trends of R eff , and forecasted the transmission interruption year of the disease. Hubei Province, located in the center of China and north of the Dongting Lake, has a population of more than 58 million. Hubei is known as the "province of thousands of lakes". Except the main stream of the Yangtze River and Han River, there are 4228 rivers and 755 lakes in the province. The large water areas provide an easy spread of Shigella spp. Therefore, it is of great value to study shigellosis transmissibility in Hubei Province, China. In this study, we used the dataset of reported shigellosis cases that was built from January 2005 to December 2017 in the province. The sex, age, illness onset date, and diagnosis date of each case was included in the data. Incidences of 12 urban areas (including 7 central districts in Wuhan City and 5 districts in Yichang City) and 14 rural areas (including 5 outer suburbs districts in Wuhan City, and 8 counties or county-level cities in Yichang City) from 2005 to 2017 were collected to compare the difference between urban and rural areas. All the cases were collected from the China Information System for Disease Control and Prevention. All the cases were identified following the case definitions from "Diagnostic criteria for bacterial and amoebic dysentery (WS287-2008)" announced by the National Health Commission of the People's Republic of China: In this study, we only included clinically diagnosed cases and confirmed cases for the analysis. In this study, two models were developed according to the transmission routes of the disease. According to our previous research [14, 19] , we firstly built a susceptible-exposed-symptomatic-asymptomatic-recovered-water/food (SEIARW) model (denoted as Model 1) in which two routes (person-to-person and reservoir-to-person) were considered. For the route of person-to-person, two routes (symptomatic-to-susceptible and asymptomatic-to-susceptible) were both considered. In the model, people were divided into susceptible (S), exposed (E), symptomatic (I), asymptomatic (A), and recovered (R) individuals. The reservoir, including water or food, was denoted as W. In order to normalize the dimensions of people and Shigella spp population, we set s = S/N, e = E/N, i = I/N, a = I/A, r = R/N, w = εW/μN, b = βN, and b w = μβ W N/ε. In the model, N is assumed to denote the total population. The parameter β is the transmission relative rate from person to person, β W is the transmission relative rate from reservoir to person, k is the relative transmissibility of asymptomatic to symptomatic individuals, ω is the incubation relative rate, p is the proportion of asymptomatic individuals, γ is the infectious period relative rate of symptomatic individuals, γ' is the infectious period relative rate of asymptomatic individuals, ε is the relative rate of the pathogen's lifetime, c is the shedding rate of the asymptomatic comparing to the infectious, and μ is the pathogen shedding coefficient of infectious individuals. The equations of the model are as follows: We also developed a susceptible-exposed-symptomaticasymptomatic-recovered (SEIAR) model (denoted as Model 2) which only includes the transmission route of personto-person [21] [22] [23] . The equations of the model are as follows: In the model, the effective reproduction number (R eff ) was calculated by the equation as follows: There are eight parameters (b, b w , k, ω, p, γ, γ', c, and ε) in the above models. According to our previous research [19] , k, ω, p, γ, γ', c, and ε are disease-specific parameters which could be estimated from literatures. The incubation period of shigellosis is 1-4 days [24, 25] , and most commonly 1 day, therefore, ω = 1.0. The proportion of asymptomatic infection ranges from 0.0037 to 0.27 [26] [27] [28] , and it can be set p = 0.1. The infectious period of symptomatic infection is 13.5 days [19] , therefore, γ = 0.0741. According to our previous research [19] , the infectious period of asymptomatic infection could be simulated 5 weeks in our model, thus γ' = 0.0286. Because of that the die-off rate of Shigella spp is about 24.5 h [29] , ε = 0.6931 was consequently simulated [19] . In our previous research [19] , a typical case has diarrhea about 3.2 times (range 3-12 times) per day but an asymptomatic individual only sheds stool once per day, thus c = 0.3125. Due to reduction of shedding frequency, the relative transmissibility of asymptomatic individual (k) was modeled to be a reduced quantity as 0.3125 [19] . However, b and b w are scenario-or area-specific parameters which depend on different outbreaks and periods. Therefore, these two parameters are confirmed by curve fitting by Models 1 and 2 to the collected data. To simulate the contribution of b and b w during the transmission, we performed a "knock-out" simulation which was based on the following four scenarios: A) b = 0 and b w = 0; B) b = 0; C) b w = 0; D) control (no intervention). was employed for model simulation and least root mean square was adopted to assess the goodness of fit. The simulation methods were the same as the previously publications [12, 14, 19, 30, 31] . The chi-square and trend chi-square tests and t test were performed by SPSS 13.0 (IBM Corp., Armonk, NY, USA). Eleven equations (Linear, Logarithmic, Inverse, Quadratic, Cubic, Compound, Power, S, Growth, Exponential, Logistic) were employed to fit the data of R eff which was calculated by the reported data. The equations of the 11 models were shown as follows: Fig. 1 Epidemiological characteristics of Shigellosis cases from 2005 to 2017 in Hubei Province, China. a, reported cases and incidence of Shigellosis cases; b, sex distribution; c, age distribution In the equations, x and f(x) refer to time (year) and dependent variables (R eff ), respectively; b 0 , b 1 , b 2 , b 3 , and u refer to the coefficients of the models which were estimated by curve fitting with the data. Determination coefficient (R 2 ) was employed to evaluate the curve fitting. From January 1, 2005 to December 31, 2017, 130,770 Shigellosis cases were reported in the province. Among them 13 cases were dead with a case fatality rate of 0.01%. The median yearly reported incidence was 19.96 per 100,000 persons (range: 5.99 per 100, 000 persons -29.47 per 100,000 persons). Figure 1a showed that the number of reported cases and reported incidence yearly were both decreased significantly (trend χ 2 = 25,470.27, P < 0.001). Of all the cases, male cases were account for 56.57% (73,981/130770) which was significantly higher than that of female cases (χ 2 = 2255.19, P < 0.001). This incidence difference between male and female was observed in each year (Fig. 1b) . Children cases with an age of 10 years and younger had the highest case proportion of 39.30%, with the "0 -1" years old children or new-born cases were account for 61.33% (Table 1 ). The second age group with highest case proportion was "11 -20" years (15.20%) followed by "61 -" years (14.51%). The distribution of incidence of age group was similar in different years (Fig. 1c) . The median duration from illness onset date to diagnosed date (D ID ) of all the cases was 1.7 days (inter-quartile range [IQR]: 2.3 days). Each year had a similar distribution of D ID (Fig. 2) . The D ID of 71.24% cases were lower than 4 days, especially lower 1 days (24.43%) and 2 days (30.87%). Among the 17 sub-regions of Hubei Province, Wuhan City had the most reported cases (39.88%) and the highest cumulative incidence during the past 13 years. The second highest number of cases was reported in Yichang City (8.66%) and Jingzhou City (8.24%). However, the rank of reported incidence was very different. Xiantao City and Yichang City had the second and third highest incidence, respectively. Shennongjia, which is a forest region, had the least reported cases (0.05%) and the lowest reported incidence ( Table 2) . The mean reported incidence of shigellosis was 58.53 per 100,000 in urban areas (95%CI: 51.91 per 100,000-65.15 per 100,000) and 14.10 per 100,000 in rural areas (95% CI: 11.34 per 100,000-16.86 per 100,000) (Fig. 3) . The difference of the incidences was statistically significant between urban and rural areas (t = 12.1345, P < 0.001). The results of curve fitting (Fig. 4) showed that Model 1 fitted well to the data (χ 2 = 0.00015, P > 0.999). Calculated by the model, the mean value of b was 0.0898 (95% CI: 0.0851-0.0946) and b w was 1.1264 × 10 − 9 (95% CI: 4.1123 × 10 − 10 -1.8416 × 10 − 9 ) from 2005 to 2017 in the province. The difference between the two parameters was as much as more than seven orders of magnitude in each year ( Table 3) . Results of the "knock-out" simulation (Fig. 5) showed that the number of cases simulated by scenario A (b = 0 and b w = 0) was almost the same as that simulated by scenario B (b = 0), and that the number of cases simulated by scenario C (b w = 0) was the almost same as that simulated by scenario D (control), (Fig. 6) . The mean value of R eff was 1.08 (95% CI: 0.83-1.34) in the year of 2005. It reached a peak value of 1.39 (95% CI: 1.14-1.65) in 2006. Although it had a lowest value in 2010, it decreased slightly yearly (Table 4 ). By fitting the 11 equations with R eff calculated by the SEIAR model with the reported data, the three most fitting models were Cubic, Linear, and Quadratic ( Table 5 ). The fitted results were shown in Fig. 7 . The Linear and Quadratic models forecasted that the mean value of R eff would decrease yearly. To our knowledge, this is the first study to estimate the long-term transmissibility and forecast the R eff trend of Shigellosis in China. Located in the lake region, Hubei Province has a high burden of shigellosis and provides a large data for the modelling study. Our model developed in Hubei Province could also be used to study the transmission of the shigellosis in other places. Therefore, our study has an important significance to provide a reference for deeply understanding the transmission characteristics of the disease. In our study, the SEIARW model was employed to fit the epidemic curves of the reported data, the results of Chi-square test showed a high good-of-fitness of our model to the reported data, suggesting that the model is suitable for this study and can be used to estimate the transmissibility of the disease. In this study, we found that the incidence of the disease was still in a high level although with a decreasing trend. This finding is similar to the epidemiological characteristic in many areas in China [9] [10] [11] 18] . Our findings of sex and age distribution showed that: 1) male individuals were more likely to be infected by Shigella spp than females; 2) individuals who were infants or old were more likely to be infected by the pathogen. These findings are similar to the epidemiological characteristics of shigellosis [24, 32, 33] . The intervention focusing on male, children, newborns and elders might be more effective to prevent and control the disease. It is also important to monitor the disease among those populations. Our findings of area distribution showed that the incidence of the disease had a high heterogeneity in different sub-regions of Hubei Province. Shigellosis had a high transmission in large cities such as Wuhan City (the capital city of the province) and Yichang City (which has a population more than 400 million) or cities near Wuhan City such as Xiantao City. We also found that the incidence of shigellosis in urban areas was higher than that in rural areas. It might due to the higher population density in urban areas. However, the transmissibility remains unclear in urban and rural areas and further research is needed to explore the force of the transmission in different areas. About a half of the cases were diagnosed in 1.7 days from their illness onset date. However, this also meant that about 50% of the cases were diagnosed after 1.7 days from their illness onset date. There is a room to improve the surveillance system's ability of diagnosing the disease immediately and to shorten the D ID of Shigellosis cases. The "knock-out" simulation showed that parameter b w had almost no contribution during the transmission which meant that the transmission route of reservoir to person was interrupted in the area. This is mainly attributed to the government's progress of improving sanitation drinking water and lavatories, food safety, and people's health behaviors like drinking boiled water. Therefore, the remained major transmission route is person to person which will be the main public health focus to control the disease. This result also provided a mathematical evidence of the epidemiological characteristics of shigellosis that the pathogen was transmitted primarily through person-to-person [24] . Therefore, a combined strategy of case management would be strongly recommended including case isolation and treatment, environment disinfection, surveillance system improvement, and individuals' health behaviors improvement like hand hygiene. Our results showed that the mean value of R eff of Shigellosis was 1.19, meaning that the expected number of secondary infections that resulted from introducing a single infected individual into an otherwise susceptible population was only 1.19 on average. This revealed that the transmissibility of shigellosis is lower than many other infectious diseases such as influenza, Ebola virus disease, and Norovirus infection [12, 14, 15, 34] . In addition, the transmissibility of Shigellosis decreased slightly from 2005 to 2017. Although the Linear, Quadratic, and Cubic models had the best goodness of fit, the 95%CIs of Quadratic model and Cubic model were not stable after the year 2019 (Fig. 7) . Therefore, the Linear model was recommended to predict the trends of the transmissibility. The Linear model predicted that the R eff would down to an epidemic threshold (R eff = 1.00) in 2029. This was an interesting result which meant that the disease would probably be eliminated after the year of 2029. But this result was based on the assumption that the transmission condition would remain stable in the future years. Therefore, this trend might be different in other provinces. Hubei Province is located in a lake region with many cities suffered from floods frequently. The transmissibility of Shigellosis would be increased after the flood and provides a potential transmission route of water to person. On the other hand, the year of the transmission interrupted might be ahead of Limited by the availability of data, there are several limitations in this study. Firstly, some environment factors (temperature, humidity, and rainfall) which might affect the spread of the disease were not considered in the model. Secondly, due to the transmission heterogeneity, the features and trends of the transmissibility of the disease might be different in different areas in Hubei Province. Thirdly, our study only focused in Hubei Province, the characteristics and the trends of transmissibility of shigellosis might be different in other provinces. The applicability of the mathematical models remains unclear to different datasets of reported shigellosis. However, Hubei Province has a high burden of shigellosis and provides us a large data for the modelling. The ordinary differential equation model has a strong applicability. It could be used in different populations including school or community with small-scale outbreaks [14, 19] , and in different diseases such as shigellosis, norovirus infection, Ebola virus disease, and influenza [12, 14, 19, 31] . In addition, we did not simulate the sensitivity and specificity improvement of the surveillance system by the model. More researches are needed to perform the simulation of shorten D ID in different scenarios. The disease burden of Shigellosis is still in a high level in Hubei Province, China. Individuals who were Abbreviations CI: Confidence interval; D ID : Duration from illness onset date to diagnosed date; IQR: Inter-quartile range; SEIAR: Susceptible-exposed-symptomaticasymptomatic-recovered; SEIARW: Susceptible-exposed-symptomaticasymptomatic-recovered-water/food Intercontinental dissemination of azithromycin-resistant shigellosis through sexual transmission: a crosssectional study Global, regional, and national causes of child mortality: an updated systematic analysis for 2010 with time trends since Burden and aetiology of diarrhoeal disease in infants and young children in developing countries (the global enteric multicenter study, GEMS): a prospective, case-control study Shigellosis: epidemiology in India Regional disparities in the burden of disease attributable to unsafe water and poor sanitation in China Spatiotemporal pattern of bacillary dysentery in China from 1990 to 2009: what is the driver behind? Trend and disease burden of bacillary dysentery in China Patterns of bacillary dysentery in China An 11-year study of shigellosis and Shigella species in Taiyuan, China: active surveillance, epidemic characteristics, and molecular serotyping Spatiotemporal Characteristics of Bacillary Dysentery from Risk of imported Ebola virus disease in China Perspectives on the basic reproductive ratio Evidence-based interventions of Norovirus outbreaks in China The transmissibility estimation of influenza with early stage data of smallscale outbreaks in Changsha, China Hand, foot, and mouth disease in China: patterns of spread and transmissibility Seasonality of the transmissibility of hand, foot and mouth disease: a modelling study in Xiamen City Spatiotemporal Risk of Bacillary Dysentery and Sensitivity to Meteorological Factors in Hunan Province, China Investigation of key interventions for shigellosis outbreak control in China The relative transmissibility of shigellosis among male and female individuals in Hubei Province, China: a modelling study. bioRxiv Transmissibility of the influenza virus during influenza outbreaks and related asymptomatic infection in mainland China Evaluating the effects of common control measures for influenza a (H1N1) outbreak at school in China: a modeling study Outbreak detection and evaluation of a school-based influenza-like-illness syndromic surveillance in Tianjin, China. PloS one A school outbreak of Shigella sonnei infection in China: clinical features, antibiotic susceptibility and molecular epidemiology Risk factors for secondary transmission of Shigella infection within households: implications for current prevention policy Asymptomatic salmonella, Shigella and intestinal parasites among primary school children in the eastern province Detection of intra-familial transmission of shigella infection using conventional serotyping and pulsed-field gel electrophoresis Comparative survival of indicator bacteria and enteric pathogens in well water Simulation of key interventions for seasonal influenza outbreak control at school in Changsha The effectiveness of age-specific isolation policies on epidemics of influenza a (H1N1) in a large City in central South China Identification and management of Shigella infection in children with diarrhoea: a systematic review and meta-analysis Changing patient population in Dhaka Hospital and Matlab Hospital of icddr,b The transmissibility and control of pandemic influenza a (H1N1) virus Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations We thank the staff members at the hospitals, local health departments, and municipal-and county-level CDCs for their valuable assistance in coordinating data collection. Authors' contributions TC, QC, and JR designed the study. QC, YP, HZ, YT, YW, and XG collected data. TC, JR, QC, ZZ, YS, and BZ and performed the analysis. TC, QC, and QH wrote the first draft of this paper. All authors contributed to the writing of the manuscript. The author(s) read and approved the final manuscript. The datasets used and analyzed during the current study are available from Dr. Qi Chen (chenqi8700@qq.com) on reasonable request. This effort of outbreak control and investigation was part of CDC's routine responsibility in Hubei Province; therefore, institutional review and informed consent were waived by Medical Ethics Committee of Hubei Center for Disease Control and Prevention on the following grounds: (1) all data analyzed were anonymized; (2) neither medical intervention nor biological samples were involved; (3) study procedures and results would not affect clinical management of patients in any form. Not applicable. The authors declare that they have no competing interests.