key: cord-1017711-b7p92sb1 authors: Syed, Faiza; Sibgatullah, Syed title: Estimation of the Final Size of the COVID-19 Epidemic in Pakistan date: 2020-04-06 journal: nan DOI: 10.1101/2020.04.01.20050369 sha: 300183d862ef809c734535bb79928efa4cdcd208 doc_id: 1017711 cord_uid: b7p92sb1 The COVID-19 infections in Pakistan are spreading at an exponential rate and a point may soon be reached where rigorous prevention measures would need to be adopted. Mathematical models can help define the scale of an epidemic and the rate at which an infection can spread in a community. We used an SIR model to predict the magnitude of the COVID-19 epidemic in Pakistan and compared the numbers with the reported cases on the national database. Our results indicate that Pakistan could hit peak number of infectious cases on May 26th, 2020 and by June 24th, 2020, 90% of the population will have become infected with the virus if policy interventions seeking to curb this infection are not adopted aggressively. The first case of COVID-19 emerged in Pakistan on 26 th February 2020 in Karachi, the most populous city of Pakistan. The patient zero had a travel history to Iran and immediately quarantined upon testing positive for the virus. However, the patient was followed by hundreds of pilgrims returning from Iran which were likely carrying the virus that ultimately led to the spread of COVID-19 into the community. Since then the infections have been increasing exponentially and without proper intervention the situation may escalate enough to overwhelm the already struggling healthcare system in the country. recovered cases (R) are compartments and each individual of a given population will pass through the susceptible phase then to the infected phase and finally to the recovered phase. The SIR model is a steady state model, therefore the population that is analysed is static i.e. no one is being born or is dying. Additionally, the model assumes that once a person is infected, they are immune to the disease and therefore cannot contract it again. The SIR model is ideal for modelling the spread of diseases spread through person to person contact. In this study we aim to model the COVID-19 epidemic in Pakistan, which will indicate the peak infection day, rate of increase of infections per day and a 90-day forecast to assess the final size of the epidemic. The model is built on the following set of assumptions, based on the methodology described by Ronald Ross and William Hammer 1 , is expressed as the following differential equations: The basic assumption of the SIR model is that the total number of susceptible infected and recovered cases at any given time is equivalent to the test population, so the equations can be represented as: The basic reproduction number (R˳), is a ratio between the fraction of individuals susceptible per day (β) and the fraction of recoveries (γ); represented as: The value of R˳ plays a significant role in determining the infectiousness of a certain diseasecausing organism. Therefore, the rate of change in infected individuals is directly dependent on the R˳, given by: . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.04.01.20050369 doi: medRxiv preprint Further assumptions of the SIR model assume that if the R˳ is greater than the ratio of total population and the susceptible cases at time zero then it would imply that the outbreak will turn into a full-fledged epidemic. Similarly, if the R˳ is less than N/S (0), then it would imply that the outbreak will not cause an epidemic. Therefore, the R˳ plays a crucial role in determining the fate of an epidemic. The data was extracted from daily situation reports published on the National Institute of Health (NIH) Pakistan for a period of 33 days, (from 26 February 2020 to 28 th March 2020) and was corroborated with the simulation results. In the case of COVID-19, the value of R˳ is highly variable and varies from country to country. Several sources 2-7 report a range of R˳ values, between 1.4 -3.9, therefore we took an average value of 2.65 for our current analysis. The value of R˳ will continue to evolve as the epidemic progresses throughout the globe; till then, we do not have an exact figure 2 . The value of γ was considered based on the average infectious period for COVID-19, so γ=0.14. The value of β was calculated to be 0.378 from equation 2.2. The S0 was assumed to be 220,000,000 since the entire population of Pakistan is susceptible to COVID-19, as the disease is new and is spreading across all regions of the country. The initial time T was set to be day 1, when the first 2 cases of COVID-19 arrived in Karachi. The simulation was used to predict the number of susceptible, infected and recovered cases for a period of 120 days starting from the day the first cases arrived. . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint . https://doi.org/10.1101/2020.04.01.20050369 doi: medRxiv preprint The NIH Pakistan data for the cumulative cases and daily reported cases was plotted to observe the increase in number of cases for a period of 33 days. The current trajectory suggests an exponential increase in the number of cases since the epidemic started in Pakistan on 26 th February 2020 (figure 5.1). The data on the number of daily reported cases also reflects a general increasing trend as testing for COVID-19 gathers pace (figure 5.2). is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint . https://doi.org/10.1101/2020.04.01.20050369 doi: medRxiv preprint . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint . https://doi.org/10.1101/2020.04.01.20050369 doi: medRxiv preprint (r=0.97, p<0.001) . The modelled data is represented as a Blue line and the reported data is represented as an Orange line. The x-axis represents the number of days, whereas the y-axis represents the number of cases. . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint . https://doi.org/10.1101/2020.04.01.20050369 doi: medRxiv preprint Our study was focused on modelling the COVID-19 epidemic in Pakistan in order to estimate the number of infections, the peak infection day, the rate of increase of infections per day and the resolution of the end-point of the epidemic 1 . The simulation parameters were adjusted according to the population of Pakistan. Our model simulated the conditions where COVID-19 is spreading in a closed population of 220,000,000 people, without the effect of any extraneous variables such as social distancing, hand washing or travel restrictions. The values of R˳=2.65, β=0.378 and γ=0.14 generated data which was quite close to the actual reported cases in Pakistan ( figure 5.4) . Furthermore, the trends of the modelled I and the reported number of infections coincide significantly (r=0.97, p<0.001). Thus, we can safely assume that these results can be used as a predictor of a likely scenario of COVID-19 for Pakistan. According to the simulations the peak infection day will occur on 26 th May 2020, where 5,921,1209 persons could get infected with the virus in a single day. Previous reports from China 8 have indicated that COVID-19 initially follows an exponential growth pattern 9 coupled with asymptomatic carriers 10 , leads to a rapid increase in the number of infections. Correlating the reported data with our simulations, we can assume that if left unchecked, the disease can spread at an unprecedented rate in Pakistan, especially in closely knit communities and densely populated areas. The results of our study also predict that within a time span of 120 days, 90% of the population will have been infected with COVID-19. These include asymptomatic carriers, which could account for approximately 20% of the population 11 and will have the tendency to unknowingly infect others. However, the major concern for Pakistan would be the healthcare system which would not be able to cope with the overwhelming number of patients if the trajectory remains the same. Studies place the mortality rate of COVID-19 at 2.3%, severe cases at 14% and critical cases at 5% 12, 13 , which would imply that potentially 50,000 could die; 30,800,000 cases could become severe and 11,000,000 could become critical during the aftermath of epidemic in Pakistan. Therefore, there is an urgent need to implement effective measures to curb the rise . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.04.01.20050369 doi: medRxiv preprint Our study gives a well-balanced estimate of how the COVD-19 could spread in Pakistan. However, the study has a few limitations, such as the effect of extraneous variables has not been considered in the modelling. This could also serve as a future direction for us to model an epidemic where these factors come into play. In conclusion, the SIR modelling of the COVID-19 in Pakistan revealed that the infection could spread at an exponential rate if proper measures are not taken to reduce its transmission, through safe practices such social distancing, hand-washing and large scale testing of suspected cases in the region. Mathematics of infectious diseases Estimating clinical severity of COVID-19 from the transmission dynamics in Wuhan, China Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia Euro surveillance: bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin Time-varying transmission dynamics of Novel Coronavirus Pneumonia in China. bioRxiv Phase-adjusted estimation of the number of Coronavirus Disease 2019 cases in Wuhan Properties of AdeABC and AdeIJK efflux systems of Acinetobacter baumannii compared with those of the AcrAB-TolC system of Escherichia coli. Intergovernmental Panel on Climate Change Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China: summary of a report of 72 314 cases from the Chinese. jamanetwork The reproductive number of COVID-19 is higher compared to SARS coronavirus Presumed asymptomatic carrier transmission of COVID-19. jamanetwork Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship Potential association between COVID-19 mortality and health-care resource availability. The Lancet Global Health Characteristics of and Important Lessons from the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72314 Cases from the Chinese Center for Disease Control and Prevention