key: cord-0835984-dshew6c8 authors: Li, Yi; Liang, Meng; Yin, Xianhong; Liu, Xiaoyu; Hao, Meng; Hu, Zixin; Wang, Yi; Jin, Li title: COVID-19 epidemic outside China: 34 founders and exponential growth date: 2020-10-06 journal: J Investig Med DOI: 10.1136/jim-2020-001491 sha: cd47531fe5f6f10529610076dec0663c26ea08a4 doc_id: 835984 cord_uid: dshew6c8 COVID-19 raised tension both within China and internationally. Here, we used mathematical modeling to predict the trend of patient diagnosis outside China in future, with the aim of easing anxiety regarding the emergent situation. According to all diagnosis number from WHO website and combining with the transmission mode of infectious diseases, the mathematical model was fitted to predict future trend of outbreak. Daily diagnosis numbers from countries outside China were downloaded from WHO situation reports. The data used for this analysis were collected from January 21, 2020 and currently end at February 28, 2020. A simple regression model was developed based on these numbers, as follows: [Formula: see text] , where [Formula: see text] is the total diagnosed patient till the i-th day and t=1 at February 1, 2020. Based on this model, we estimate that there were approximately 34 undetected founder patients at the beginning of the spread of COVID-19 outside China. The global trend was approximately exponential, with an increase rate of 10-fold every 19 days. Through establishment of this model, we call for worldwide strong public health actions, with reference to the experiences learned from China and Singapore. In early December of 2019, pneumonia cases of unknown cause emerged in Wuhan, the capital of Hubei province, China. 1 A novel coronavirus (now named SARS-CoV-2) was verified and identified as the seventh member of the enveloped RNA coronaviruses (subgenus, Sarbecovirus; subfamily, Orthocoronavirinae) using high-throughput sequencing [2] [3] [4] as the cause of the disease, which is referred to as COVID-19. Based on the evidence from early transmission dynamics, human-to-human transmission in hospital and family settings had been accumulating [5] [6] [7] and occurred among close contacts since the middle of December 2019. 8 According to WHO statistics, the accumulated number of diagnosed patients in China on August 08, 2020 was 89,057. 9 COVID-19 raised tension both within China and internationally. Since the first case of COVID-19 pneumonia was reported from Wuhan, COVID-19 was rapidly diagnosed in patients in other Chinese cities and in neighboring countries, including Thailand, South Korea, Japan, and even a few Western countries. [10] [11] [12] On January 13, 2020, the Ministry of Public Health of Thailand reported the first imported case of laboratory-confirmed novel coronavirus . 13 After that, surges in cases of COVID-19 in Italy, Japan, and Iran also heightened fears that the world is on the brink of a pandemic. Therefore, on February 28, the WHO increased the assessment of the risk of spread and impact of COVID- 19 How might these results change the focus of research or clinical practice? ► As a 10-fold increase in patient numbers of COVID-19 every 19 days has been estimated, we call for strong public health actions worldwide. 2020. 9 The USA, Brazil, and India are currently the three most affected countries. 14 Recently, considerable research resource has been devoted to conducting detailed analysis of the spread of the COVID-19 epidemic. 15 16 Several parallel studies have reported that the estimated reproductive number (R0) of COVID-19 is higher than that of SARS, based on different models. [17] [18] [19] Considering the superspreaders (P), hospitalized (H), and fatality class (F), an ad hoc compartmental mathematical model of the COVID-19 has been established to describe the reality of the Wuhan outbreak and predict the daily number of the confirmed cases. 20 Several studies used deep learning to forecast COVID-19 infections. 21 22 The disease transmission model predicted the gravity of COVID-19 in Canada using the long short-term memory (LSTM) networks. 23 Data-driven estimation methods like LSTM and curve fitting were also used to evaluate the number of COVID-19 cases in India for the next 30 days and the effect of preventive measures. 24 Given the limited number of data points and the complexity of the real-life situation, a simple model is expected to be more accurate for describing the spread of the virus (see Discussion section). In this study, we propose a "log-plus" model to predict the situation, which only requires daily number of total diagnoses outside China. This model assumes that there were some unobserved founder patients at the beginning of viral spread outside China and subsequent exponential growth. Despite the simplicity of our model, it fits the data well (R 2 =0.991). This prediction has potential practical and socially applicable significance and provides evidence that can enhance public health interventions to avoid severe outbreaks. Daily numbers of COVID-19 diagnoses in countries outside China were downloaded from WHO situation reports (https://www. who. int/ emergencies/ diseases/ novel-coronavirus-2019/ situation-reports). The data used in this analysis start on January 21, 2020 and end at February 28, 2020. Data were first explored by plotting log-transformed daily case numbers. A linear trend was observed in more recent data, while the fit was relatively poor for earlier time points. The presence of some undetected founder patients at the early time points were considered. Based on exploratory analysis and mathematical intuition, we proposed the following model: where N t is the number of patients diagnosed outside China, according to WHO, on the t-th day, t=1 on February 1; u is the number of unobserved founder patients at the beginning of spread outside China; and a and b are simple linear regression parameters. We enumerated u from 0 to 100, with step size 1. For each u, we calculated Pearson's correlation (R 2 ) between t and log10 ( N t + u ) , and selected the û that maximized R 2 and estimated corresponding â and b , using a simple linear regression between t and log10 ( N t +û ) . The source code of the model is available at: https:// github. com/ wangyi-fudan/ COVID-19_ Global_ Model The WHO daily count of numbers of diagnoses outside China and 'log-plus' transformed data, as well as model fit data, are presented in table 1. According to February 28 data, û , â , and b were estimated as 34, 0.0515, and 2.075, respectively (figure 1). against time to visualize model fitting ( figure 2 ). The R 2 value for the model was 0.991, indicating an excellent fit. The number of COVID-19 diagnoses as of February 28 was 4691. Our model predicts that the number of diagnoses outside China will expand exponentially at a rate of 10-fold every 19 days in the absence of strong public health interventions. In this report, only the total number of diagnoses outside China was analyzed. Country-scale data are also available, but is less complete than the total numbers; hence, we limited our analysis to capture the global trend. This model is a minimal extension of the "default" exponential growth model, using an estimate of 34 undetected founder patients outside China. An almost perfect model fit (R 2 =0.991) indicates that the spread of disease does follow our model. A simple and straightforward linear model has some advantages: (1) it works for small sample sizes, due to limited observation or somewhat imperfect data; (2) it is relatively robust in complex situations, and the virus spreading pattern is complex and varies across the world, hence a simple model can provide coarse-grained trend estimation; and (3) a linear model easier to extrapolate than more complex models (eg, neural networks). The existence of 34 undetected founder patients is not surprising. Actually, founder patients are those patients who are not reported at the beginning (January 22) of WHO reports. Thus, most of them are not under control and continually contribute to the pandemic. These individuals may have had mild symptoms and thus did not attend hospital; however, we do not preclude that they were already present before, or parallel with, the outbreak in Wuhan. Based on this model, we estimate that there were approximately 34 undetected founder patients at the beginning of the spread of COVID-19 outside China. This suggests that the disease stably followed an approximate exponential growth model at the very beginning. This situation is dangerous, as we expect a 10-fold increase in patient numbers every 19 days, in the absence of strong intervention. We call for strong public health actions worldwide, referring to the experiences learned from China and Singapore. The manuscript has been preprinted on the medRxiv (doi: https:// doi. org/ 10. 1101/ 2020. 03. 01. 20029819). It is our pleasure that many researchers and social media care more about the outbreak trend outside China through our manuscript. The results of this article have been read more than 9000 times, picked by seven news outlets, and cited more than 10 times. [25] [26] [27] [28] [29] We reproduced the disease's initial spread to the world, which would impose a positive impact on other countries to pay attention to the development of COVID-19 and take powerful measures in time. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding WHO. WHO Director-General's remarks at the media briefing on 2019-nCoV on A novel coronavirus from patients with pneumonia in China A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster Importation and human-to-human transmission of a novel coronavirus in Vietnam Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modeling study Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia WHO. Coronavirus disease (COVID-19) Situation Report -201 Outbreak of novel coronavirus (COVID-19): what is the role of radiologists? CoVID-19 in Japan: what could happen in the future The estimate of infected individuals of the 2019-Novel coronavirus in South Korea by incoming international students from the countries of risk of 2019-Novel coronavirus: a simulation study WHO. Coronavirus disease 2019 (COVID-19) situation report -38 Forecasting the spreading of COVID-19 across nine countries from Europe, Asia, and the American continents using the ARIMA models The reproductive number R0 of COVID-19 based on estimate of a statistical time delay dynamical system Analysis of the epidemic growth of the early 2019-nCoV outbreak using internationally confirmed cases Simulating the infected population and spread trend of 2019-nCov under different policy by EIR model Early epidemiological assessment of the transmission potential and virulence of 2019 novel coronavirus in Wuhan City: China Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: a data-driven analysis in the early phase of the outbreak Mathematical modeling of COVID-19 transmission dynamics with a case study of Wuhan Deep learning for fading channel prediction Multiple-input deep convolutional neural network model for covid-19 forecasting in China Time series forecasting of COVID-19 transmission in Canada using LSTM networks Prediction for the spread of COVID-19 in India and effectiveness of preventive measures The number of coronavirus cases outside China could jump tenfold every 19 days without 'strong intervention,' a study says The number of coronavirus cases outside China could jump tenfold every 19 days without 'strong intervention,' a study says The effect of Anti-COVID-19 policies on the evolution of the disease: a complex network analysis of the successful case of Greece What can we expect in April A framework for identifying regional outbreak and spread of COVID-19 from one-minute population-wide surveys