key: cord-0860029-ty48b3o2 authors: Li, Menghui; Liu, Kai; Song, Yukun; Wang, Ming; Wu, Jinshan title: Serial interval and generation interval for respectively the imported and local infectors estimated using reported contact-tracing data of COVID-19 in China date: 2020-04-19 journal: nan DOI: 10.1101/2020.04.15.20065946 sha: cdd254cac826bd3f5b82d01f53730c21748820bf doc_id: 860029 cord_uid: ty48b3o2 Backgrounds: The emerging virus, COVID-19, has caused a massive outbreak worldwide. Based on the publicly available contact-tracing data, we identified 337 transmission chains from 10 provinces in China and estimated the serial interval (SI) and generation interval (GI) of COVID-19 in China. Methods: Inspired by possibly different values of the time-varying reproduction number for the imported cases and the local cases in China, we divided all transmission events into three subsets: imported (the zeroth generation) infecting 1st-generation locals, 1st-generation locals infecting 2nd-generation locals, and others transmissions among 2+ generations. The corresponding SI (GI) is respectively denoted as $mbox{SI}^{0}_{1}$ ($mbox{GI}^{0}_{1}$), $mbox{SI}^{1}_{2}$ ($mbox{GI}^{1}_{2}$), and $mbox{SI}^{2+}_{3+}$ ($mbox{GI}^{2+}_{3+}$). A Bayesian approach with doubly interval-censored likelihood is employed to fit the lognormal, gamma, and Weibull distribution function of the SI and GI using the identified 337 transmission chains. Findings: It is found that the estimated $mbox{SI}^{0}_{1}$ = 6.27 (95%CI : 5.62 - 6.98), $mbox{SI}^{1}_{2}$ = 5.92 (95%CI : 5.27 - 6.66), $mbox{SI}^{2+}_{3+}$ = 4.36 (95%CI : 3.65 - 5.20), and $mbox{GI}^{0}_{1}$ = 8.15 (95%CI : 6.73 - 9.91), $mbox{GI}^{1}_{2}$ = 5.26 (95%CI : 4.25 - 6.69), $mbox{GI}^{2+}_{3+}$ = 3.85 (95%CI : 2.18 - 6.72), thus overall both SI and GI decrease when generation increases. Backgrounds: The emerging virus, COVID-19, has caused a massive outbreak worldwide. Based on the publicly available contact-tracing data, we identified 337 transmission chains from 10 provinces in China and estimated the serial interval (SI) and generation interval (GI) of COVID-19 in China. Methods: Inspired by possibly different values of the time-varying reproduction number for the imported cases and the local cases in China, we divided all transmission events into three subsets: imported (the zeroth generation) infecting 1st-generation locals, 1st-generation locals infecting 2nd-generation locals, and others transmissions among 2+ generations. The corresponding SI (GI) is respectively denoted as SI 0 1 (GI 0 1 ), SI 1 2 (GI 1 2 ), and SI 2+ 3+ (GI 2+ 3+ ). A Bayesian approach with doubly interval-censored likelihood is employed to fit the lognormal, gamma, and Weibull distribution function of the SI and GI using the identified 337 transmission chains. Findings: It is found that the estimated SI 0 1 = 6.27 (95%CI : 5.62 − 6.98), SI 1 2 = 5.92 (95%CI : 5.27 − 6.66), SI 2+ 3+ = 4.36 (95%CI : 3.65 − 5.20), and GI 0 1 = 8.15 (95%CI : 6.73 − 9.91), GI 1 2 = 5.26 (95%CI : 4.25 − 6.69), GI 2+ 3+ = 3.85 (95%CI : 2.18 − 6.72), thus overall both SI and GI decrease when generation increases. Keywords: COVID-19; serial interval; generation interval; statistical analysis; contact tracing. As of April 11, 2020, COVID-19 has outbroken in 213 countries, areas or territories, and the World Health Organization (WHO) has reported over 1, 524, 161 confirmed cases and over 92, 941 confirmed deaths (WHO, 2020) . It is a huge challenge to plan intervention strategies aimed at controlling outbreaks of COVID-19 in all countries, areas, or territories. Some key disease transmission parameters, including the basic reproduction number, the time-varying reproduction number, the generation interval (GI, time difference between being infected and infecting others), the serial interval (SI, the time difference between symptom onset of the infector and the infectee), and the incubation period (IP, the time difference between being infected and symptom onset), might offer insightful information of the epidemics and thus, might be helpful in devising interventions. In particular, the basic and time-varying reproduction numbers are good indicators of the speed of disease spreading and the effectiveness of interventions. Estimation of the basic and time-varying reproduction number often needs SI. In fact, for epidemics that are infectious during the incubation period, estimation of the reproduction numbers needs GI (Ganyani et al., 2020) . It is possible that COVID-19 is infectious during incubation period (Qian et al., 2020 , Wei et al., 2020 . Therefore, in this work, we will perform a statistical analysis of both GI and SI. As soon as investigation data were made available, several papers have quantified the GI, SI and IP of COVID-19 by employing statistical and mathematical modelling (Ganyani et al., 2020 , You et al., 2020 , Du et al., 2020 , Tindale et al., 2020 , Backer et al., 2020 , Bui et al., 2020 , Moran, 2020 , Wang & Teunis, 2020 , Ping, 2020 . Please see Table 1 for their estimated values and also the sample sizes. We find that the estimated values of SI from those previous studies are in a wide range: 3.95 − 7.5 days for SI. However, an accurate estimation of SI (and GI) is crucial in calculating the reproduction numbers accurately. Therefore, in this work, we firstly want to provide a more accurate estimation of SI (and GI) with possibly larger sample sizes. Secondly, if possible, we also want to shine some light on why there can be such larger differences in the estimated value of SI. Another motivation for this work comes from the extended framework of estimating the time-varying reproduction number of COVID-19 in China . When working on determining the time-varying reproduction number of COVID-19 in China, we note that due to the different interventions for imported cases and local cases, their time-varying reproduction number should be different. Previously all analyses, as far as we know, have assumed that they are the same. See, for example, EpiEstim 2 ( Thompson et al., 2019) , which is a well-known R software on the estimation of time-varying reproduction number. For that, we need to distinguish the reported cases into the zeroth-generation imported cases X 0 , the first-generation locals infected by the imported cases X 1 , and so on, such as X 2 and X 3+ . From the transmission chains among those cases, we then find SI and GI between various generations, such as SI n n+1 and GI n n+1 , the SI and GI between the nth generation and the (n + 1)th generation. There are three tasks in the above motivations, namely, obtaining more reliable estimates of GI and SI of COVID-19, finding out possible reasons for considerable differences in previous appraisals, and also providing GI and SI for various generations. To accomplish all the three tasks above, we extracted from online reports released by 10 provincial health commissions in China except Hubei. From that data, we identified 337 transmission chains and estimated transmission parameters. As we will see later, the SI and GI for various generations are indeed quite different. We collected our data from publicly available official reports of case investigations published by provincial/municipal health commissions in China. The case investigations were performed by investigators in the corresponding centers for disease control and prevention in each province. The details of each confirmed case include the following necessary information: case ID, gender, age, date of symptom onset, date of diagnosis, history of traveling to or residing in Hubei or cities other than reporting city, date of arriving at the city where the case is reported. If identified via contact tracing performed by centers for disease control and prevention officers, the details also include contact case ID and date of exposure. The data includes 2300 confirmed cases that were compiled from online reports from 10 provinces in mainland China except Hubei between January 21, 2020, and February 29, 2020. Moreover, the cases are classified into different groups according to travel or residency history and chains of transmission of infection, if data on the case allows, as follows, 2. Local first-generation cases (X 1 ): Cases known to be infected by the imported cases, 3. Local second-generation cases(X 2 ): Cases known to be infected by the local first-generation cases, 4. Local third-plus cases (X 3+ ): Cases known to be infected by local second or higher generation cases. Imported cases can be Hubei residents, or people traveled to Hubei very recently as long as they just came out from Hubei recently and became infectors in other provinces in China. We call the former Hubei residents and the latter Hubei travelers. Note that here Hubei residents are the ones who have been living in Hubei for a long time but recently came out from Hubei to another province in China. The date of symptom onset is defined as the appearing date of symptoms relevant to COVID-19. The exposure date, which is needed for estimating GI, is estimated to be the middle of data for the earliest and latest possible exposure time for local cases and also for Hubei travelers. For Hubei residents, their exposure date is hard to find due to our lack of data on Hubei cases. Therefore, whenever the exposure date is needed, we discard our data on Hubei residents. We processed the interval-censored data in units of days and discarded non-positive values, which means, in the case of SI, the infector show symptoms latter than the infectee. This assumption might well be the truth or due to some error in data collection, especially when the infector and the infectee are from the same household. We did find many cases with non-positive values are from the same household. We decide not to use those non-positive data since it is hard to tell who the infector is between the pairs in the same household. Finally, we obtained 337 COVID-19 transmission events, and we named this dataset as "All". Then, we divided the "All" data into three subsets: Imported-first subsets E 0 1 , local first-second subsets E 1 2 , local second-third plus subsets E 2+ 3+ . E 0 1 are composed of the events that imported cases X 0 infect local first-generation cases X 1 , and others are defined accordingly. From these transmission chains, we get SI and GI for various generations. A Bayesian approach with doubly interval-censored likelihood (Reich et al., 2009 ) is then employed to obtain estimates of serial interval distribution, generation interval distribution, and incubation period distribution using the CmdStan package in R. For all 337 samples, the observed SIs have a mean at µ SI = 5.78 days and a standard deviation (SD) at δ SI = 3.92 days. By using all these 337 samples, we estimated the mean at 5.80(95%CI : 5.38 − 6.24) days and SD at 3.95(95%CI : 3.57 − 4.40) days for gamma distribution. We also applied the estimation based on the lognormal distribution and the Weibull distribution. The fitted distributions are shown in Fig. 1 and the estimated parameters are reported in Table 2 . We can see that, for most cases, the sample mean and sample SD agree quite well with the estimated values according to the gamma, lognormal, and Weibull distribution. From now on, in the main text, we only report sample values and fitted values from a gamma distribution. To further understand the wide range of the previously reported SIs, we estimated the distribution of SIs on three subsets. For the imported-first subset E 1 0 with 168 events, the observed SIs have a mean at µ SI = 6.23 days and an SD at δ SI = 4.24 days. We estimated the mean at 6.27(95%CI : 5.62 − 6.98) and SD at 4.46(95%CI : 3.84 − 5.19) from the gamma distribution, as shown in Fig. 1 Our estimated SI of the imported-first subset is slightly smaller than but close to the reported value of 7.5 in . For the local first-second subset E 1 2 with 112 events, the observed SIs have a mean µ SI = 5.88 days and a SD δ SI = 3.66 days. We estimated the mean at 5.92(95%CI : 5.27 − 6.66) days and SD at 3.70(95%CI : 3.11 − 4.42) days from the gamma distribution as shown in Fig. 1 (c) . For the local second-third plus subset with 57 events, the observed SIs have a mean µ SI = 4.28 days and a SD δ SI = 3.05 days. we estimated the mean at 4.36(95%CI : 3.65 − 5.20) days and SD at 2.89(95%CI : 2.26 − 3.77) days for the gamma distribution as shown in Fig. 1 (d). The estimated SI is close to the lower bound 3.95 reported in Ganyani et al. (2020) . It is found that the estimated SI gradually decreases from 6.38 to 4.24 as generation increases. This discovery also explains to a certain degree why previous reported SIs in different papers are sometimes quite different. This result also reminds us to look into the reasons for such a trend in SI. Qian et al. (2020), Wei et al. (2020) pointed out that with more and more infective cases, it is more probable an earlier infection will happen if there are pre-symptomatic transmissions. The earlier infections will likely make SI smaller. Thus, the gradually decreasing SI leads us to examine whether or not there are pre-symptomatic transmissions. . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . 6 . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.04.15.20065946 doi: medRxiv preprint To check if there are pre-symptomatic transmissions, we compared with the earliest exposure time of infectee and the onset time of infector. It is found that 95 of the 337 (28%) reports indicate that infectees may be infected before symptoms of infectors appear. Moreover, pre-symptomatic transmissions have occurred 36 of the 168 (21.4%) events in the imported-first subset, 34 of 112 (30.4%) events in the local first-second subset, 25 of 57 (43.9%) events in the local second-third plus subset. The ratio of pre-symptomatic transmission increases as generation increases. GI distribution is needed for the inference of the reproduction number (Wallinga & Lipsitch, 2006) . Often people use SI as a proxy of GI as the time of infection is not often reported in case files. In principle, SI and GI should have equal expected values since the IP time for the infector and infectee should cancel out. Consequently, GI is less studied than SI. However, firstly GI and SI still might have different standard deviations even if they have the same mean. As we will see later, it turns out that for COVID-19, even the means of GI and SI are slightly different, and their standard deviations are clearly different. Secondly and more importantly, for epidemics with pre-symptomatic transmissions, one needs GI instead of SI since, even before the onset of symptoms, transmissions can occur already. It has recently shown that estimates of the reproduction number are biased if ignoring the difference between SI and GI (Leonhard et al., 2019) . Surprisingly, only very few papers have studied GI of COVID-19 (Ganyani et al., 2020) . In this work, we would like to add one more study of GI of COVID-19. To get a GI value, we need exposure times of both the infector and the infectee in a transmission chain. However, exposure time is not available for many cases. Especially for Hubei residents in the imported cases, due to our lack of contacttracing data in Hubei, it is impossible to know their exact exposure date accurately. Therefore, to estimate GIs, we only consider imported cases with travel history (i.e., to exclude the Hubei residents from the imported cases) and use the middle of their trips as their date of exposure since people can often remember much better the dates of their trips. After that, we only obtained 43 events for estimating GI from 337 transmission chains. For the whole dataset with 43 events, the observed GIs have a mean of µ GI = 6.40 days and an SD of δ GI = 3.29 days. We estimated the mean at 5.96(95%CI : 5.0 − 7.07) days and SD at 3.07(95%CI : 2.28 − 4.21) days for the gamma distri-bution. The fitted distributions from the lognormal and Weibull distributions are shown in Fig. 2 and the estimated parameters are reported in Tab. 2. For the imported-first subset with 14 events, the observed GIs have a mean of µ GI = 8.14 days and an SD of δ GI = 2.48 days. We estimated the mean at 8.15(95%CI : 6.73 − 9.91) days and SD at 2.85(95%CI : 1.84 − 4.60) days for the gamma distribution. For the local first-second subset with 15 events, the observed GIs have a mean µ GI = 6.60 days and an SD of δ GI = 3.40 days. We estimated the mean at 5.26(95%CI : 4.25 − 6.69) days and SD at 1.76(95%CI : 0.91 − 3.28) days for the gamma distribution. For the local second-third plus subset with 14 events, the observed GIs have a mean µ GI = 4.43 days and an SD of δ GI = 2.98 days. We estimated the mean at 3.85(95%CI : 2.18 − 6.72) days and SD at 3.27(95%CI : 1.34 − 8.29) days for the gamma distribution. The estimated mean values of GI and SI seem to be slightly different, although their confidence intervals overlap marginally. Their standard deviations are clearly different. Moreover, as the generation increases, the means of GIs decrease from 8.15 to 3.85. This is consistent with the decreasing SI, as reported in Sec. 3.1. Of course, such a difference between GI and SI may be caused by small sample size in our GI data, or they might be indeed different. This difference calls for further analysis, which in turn calls for more information to be provided in the reported case files. We would like to point out that such differences between GI and SI at least make it not suitable to replace the distribution of GI with the distribution of SI in estimating reproduction numbers, as noted already by Leonhard et al. (2019) . Depending on different sample datasets, the estimated incubation period (IP) in previous studies have an even wider ranged 3.9 − 10.91 days , Backer et al., 2020 , Tindale et al., 2020 , You et al., 2020 , Moran, 2020 , Ping, 2020 . Please see Tab. 1 for their estimated values and sample sizes. Such a large discrepancy makes it difficult to plan for public health interventions. To estimate IP, we need the date of exposure and the date of symptom onset for each case. We identify 545 cases satisfying this condition from our data. From all of our 545 samples, we observed µ IP = 8.47 days and δ IP = 5.0 days. We estimated the mean at 8.36(95%CI : 7.94 − 8.80) days and SD at 4.89(95%CI : 4.51 − 5.31) days for the gamma distribution. The fitted distri-8 . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10. 1101 /2020 butions from the lognormal and Weibull distributions are plot in Fig. 3 and the estimated parameters are reported in Tab. 3. Again, we divide the dataset into three subsets, the imported cases with travel history (X 0,T ) (i.e., to exclude the Hubei residents from the imported cases), the local first-generation cases (X 1 ), and the local second-plus generation cases (X 2+ ). For the imported subset with 320 cases, the observed IPs have a mean of µ IP = 7.69 days and an SD of δ IP = 4.62 days. We estimated the mean at 7.39(95%CI : 6.90−7.91) days and SD at 4.2(95%CI : 3.76−4.70) days for the gamma distribution. We take the exposure date of the imported cases with travel history to be the middle of their trips since one can often remember dates of traveling accurately. Moreover, for most imported cases, their traveling times are often quite short. For the local first-generation (X 1 ) subset with 105 cases, the observed IPs have a mean µ IP = 8.83 days and a SD of δ IP = 4.20 days. We estimated the mean at 8.94(95%CI : 8.11 − 9.85) days and SD at 4.42(95%CI : 3.75 − 5.26) days for the gamma distribution. For the local second-plus generation (X 2+ ) subset with 120 cases, the observed IPs have a mean µ IP = 10.22 days and a SD of δ IP = 6.07 days. We estimated the mean at 10.15(95%CI : 9.03 − 11.38) days and SD at 6.38(95%CI : 5.39 − 7.61) days for the gamma distribution. Sometimes, for imported cases, in particular, knowing after their arrival typically how long they will show symptoms, infect locals, and also when the local infectees, who are infected by the imported cased, will show symptoms, can also be informative for decision-makers of intervention strategies. Therefore, in this work, we also show our results on these statistics. The serial interval upon arrival (S I arrival ) is defined as the interval between the date that an imported case arrives at the reporting city and the date that the infectee, infected by the imported case, shows symptoms. In our dataset, 194 transmission events meet this condition. The observed S I arrival have a mean of µ SI arrival = 10.63 days and a SD of δ SI arrival = 4.98 days. We estimated the mean at 10.65(95%CI : 9.95−11.4) days and SD at 5.08(95%CI : 4.51−5.74) days for gamma distribution, as shown in Fig. 4 (a) . We also plot the fitted distributions from the lognormal and Weibull distributions in Fig. 4 . The estimated S I arrival is reported in Tab. 2. The generation interval upon arrival (GI arrival ) is defined as the interval between the date that an imported case arrives at reporting city and the date that . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . he/she infects others. In our dataset, 40 transmission events meet this condition. The observed GI arrival have a mean of µ GI arrival = 3.35 days and an SD of δ GI arrival = 2.19 days. We estimated the mean at 2.81(95%CI : 2.21 − 3.56) days and SD at 1.85(95%CI : 1.3 − 2.71) days for the gamma distribution as shown in Fig. 4 (b) . The estimated GI arrival is reported in Tab. 2. In definition, SI upon arrival is more or less the summation of GI upon arrival and IP, which is SI arrival − GI arrival ≈ IP > 0, unlike the relation between the usual SI and GI, SI ≈ GI. The incubation period upon arrival (IP arrival ) is defined as the interval between the date that an imported case arrives at the reporting city and the date that the imported case shows symptoms. In our dataset, 815 cases meet the above condition. The observed IP arrival have a mean of µ IP arrival = 6.19 days and an SD of δ IP arrival = 4.37 days. We estimated the mean at 6.20(95%CI : 5.90 − 6.51) days and SD at 4.43(95%CI : 4.14 − 4.75) days for the gamma distribution as shown in Fig. 4 (c) . The estimated IP arrival is reported in Tab. 3. It is found that IP arrival is larger than GI arrival . This indicates again that pre-symptomatic transmissions do occur. In this paper, we firstly estimated serial intervals (SI) based on 337 transmission events, which are divided into three subsets, including imported-first subset E 0 1 , local first-second subset E 1 2 and local second-third plus subset E 2+ 3+ . It is found 12 . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . that the estimated SI decreases with the number of generations and they are respectively SI 0 1 = 6.27 ± 4.46, SI 1 2 = 5.92 ± 3.7, SI 2+ 3+ = 4.36 ± 2.89. We also found that pre-symptomatic transmissions have likely occurred in 95 events out of 337 events (28%). Then, we estimated the generation interval (GI) in the three subsets. It is also found that the estimated GI decreases as the generation increases, and they are respectively GI 0 1 = 8.15 ± 2.85, GI 1 2 = 5.26 ± 1.76, GI 2+ 3+ = 3.85 ± 3.27. We would like to point out that there are small differences between the means of GI and the corresponding SI, and clear differences between their standard deviations. This, together with the existence of pre-symptomatic transmissions, makes it necessary to use GI in estimating reproduction numbers rather than SI. Next, we estimated the incubation period (IP) of different groups of cases. It is found that the estimated IPs have a mean at IP 0,T = 7.39 ± 4.2 days for 320 imported cases with travel history, a mean at IP 1 = 8.94 ± 4.42 for 105 local firstgeneration cases, and a mean at IP 2+ = 10.15 ± 6.38 days for 120 local second plus-generation cases. Moreover, we estimated the SI, GI, and IP upon arrival of the imported cases at the reporting city. It is found that the imported cases will show symptoms after IP arrival = 6.19 ± 4.4 days arrival in reporting cities and will infect others after GI arrival = 3.29 ± 2.65 days. The difference between these two intervals also indicates that pre-symptomatic transmission is likely to occur. Finally, it is found that the local first generating cases (infected by imported cases) will show symptoms after SI arrival = 10.02 ± 4.91 days imported cases arrived at the reporting cities. Providing statistics for various generation cases so that in further studies better models can be established, for example, by making use of different values of transmission parameters for different generations, is the main contribution of this work. Our results also explain to a certain degree that why in previous studies values of those estimated parameters span a wide range. For the imported cases, in particular, we reported SI, GI, and IP upon their arrivals. This study can be meaningful for both planning intervention and modeling epidemics. Furthermore, one should note that for epidemics with pre-symptomatic transmissions, in estimating the basic and the time-varying reproduction number, GI should be used instead of SI. There are several limitations to this study. Our data is restricted to online reports from only 10 provinces in China. The content of epidemiological investigation reports from different provinces varies a lot. Many case reports do not 13 . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . have exposure date and infector ID, which are quite crucial in epidemics modeling. Thus, while admitting this limitation, here we also call for designing/utilizing a standard format of the case reports, countrywide, or even worldwide. Our sample size, especially on generation interval, is still very small. Thus, our results GI are not as reliable as the ones on SI. . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . 19 . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10. 1101 /2020 20 . CC-BY-NC-ND 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10. 1101 /2020 Incubation period of 2019 novel coronavirus (2019-ncov) infections among travellers from wuhan, china Early estimation of reproduction number of covid-19 in vietnam. medRxiv The serial interval of covid-19 from publicly reported confirmed cases Estimating the generation interval for covid-19 based on symptom onset data. medRxiv Handbook of infectious disease data analysis Early transmission dynamics in wuhan, china Incubation period and other epidemiological characteristics of 2019 novel coronavirus infections with right truncation: A statistical analysis of publicly available case data Time-varying transmission dynamics of novel coronavirus pneumonia in china. bioRxiv Epidemiological parameters of coronavirus disease 2019: a pooled analysis of publicly reported individual data of 1155 cases from seven countries. medRxiv Epidemiologic characteristics of early cases with 2019 novel coronavirus (2019-ncov) disease in korea Serial interval of novel coronavirus (2019-ncov) infections. medRxiv Estimating incubation period distributions with coarse data Further improved epiestim by distinguishing imported, infected by imported, infected by local cases and its application to covid-19 in china Improved inference of time-varying reproduction numbers during infectious disease outbreaks Transmission interval estimates suggest pre-symptomatic spread of covid-19. medRxiv How generation intervals shape the relationship between growth rates and reproductive numbers Strongly heterogeneous transmission of covid-19 in mainland china: local and regional variation. medRxiv Presymptomatic Transmission of SARS-CoV-2 -Singapore Coronavirus disease (covid-19) outbreak situation Estimation of the time-varying reproduction number of covid-19 outbreak in china. medRxiv The authors would like to thank the data processing team, including Jiatong Zhu, Jianxin Zhang, Kaiwen Li, Yuting Zhang, Ningning Qiao, for their hard work of extracting the contact-tracing data from online reports. We should also note that it is the center of disease control and prevention of China at all levels, including national, provincial, county-level, and their great investigators who performed the case investigations and released case reports. This work is partly supported by the National Key Research and Development Plan (2017YFC1502901) and the National Natural Science Foundation of China (grant no. 71974017).