key: cord-1009513-vspnuxz9 authors: Nishiura, Hiroshi; Linton, Natalie M; Akhmetzhanov, Andrei R. title: Serial interval of novel coronavirus (2019-nCoV) infections date: 2020-02-13 journal: nan DOI: 10.1101/2020.02.03.20019497 sha: a1bff76ce360e8990b0a4ee2a5228a6e6e63d9c1 doc_id: 1009513 cord_uid: vspnuxz9 Objective: To estimate the serial interval of novel coronavirus (COVID-19) from information on 28 infector-infectee pairs. Methods: We collected dates of illness onset for primary cases (infectors) and secondary cases (infectees) from published research articles and case investigation reports. We subjectively ranked the credibility of the data and performed analyses on both the full dataset (n=28) and a subset of pairs with highest certainty in reporting (n=18). In addition, we adjusting for right truncation of the data as the epidemic is still in its growth phase. Results: Accounting for right truncation and analyzing all pairs, we estimated the median serial interval at 4.0 days (95% credible interval [CrI]: 3.1, 4.9). Limiting our data to only the most certain pairs, the median serial interval was estimated at 4.6 days (95% CrI: 3.5, 5.9). Conclusions: The serial interval of COVID-19 is shorter than its median incubation period. This suggests that a substantial proportion of secondary transmission may occur prior to illness onset. The COVID-19 serial interval is also shorter than the serial interval of severe acute respiratory syndrome (SARS), indicating that calculations made using the SARS serial interval may introduce bias. The epidemic of novel coronavirus infections that began in 12 China in late 2019 has rapidly grown and cases have been reported worldwide . 13 An empirical estimate of the serial interval-the time from illness onset in a 14 primary case (infector) to illness onset in a secondary case (infectee)-is needed 15 to understand the turnover of case generations and transmissibility of the disease 16 [1] . Estimates of the serial interval can only be obtained by linking dates of onset 17 for infector-infectee pairs, and these links are not easily established. A recently 18 published epidemiological study used contact tracing data from cases reported in 19 Hubei Province early in the epidemic to estimate the mean serial interval at 7.5 20 days [2] , which is consistent with the 8.4-day mean serial interval reported for 21 severe acute respiratory syndrome (SARS) from Singaporean household contact 22 data [3] . However, there were only six infector-infectee pairs in this dataset, and 23 sampling bias may have been introduced to the variance and mean. To further 24 All rights reserved. No reuse allowed without permission. author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint (which was not peer-reviewed) is the . https: //doi.org/10.1101 //doi.org/10. /2020 -4 -assess the serial interval of COVID-19 infections we compiled a dataset of 28 1 publicly shared infector-infectee pairs and calculated the serial interval from 2 these data. 3 We scanned publicly available information published in research articles 5 and quoted from official reports of outbreak investigations to obtain our dataset. 6 The date of illness onset was defined as the date on which a symptom relevant to 7 COVID-19 infection appeared and was determined by the reporting 8 governmental body. We subjectively ranked the credibility of the ascertained 9 pairs into "certain" and "probable," where the former was used for pairs and 10 dates of illness onset were clearly defined in an academic article and the latter 11 was applied to pairs and dates of illness onset that were clearly defined but 12 quoted from outbreak investigation reports. Estimates were obtained for certain 13 and probable pairs combined (n=28) as well as for the certain pairs alone (n=18). 14 The interval censored data were handled in units of days. We employed 15 a Bayesian approach with doubly interval censored likelihood to obtain estimates 16 of the serial interval [4]: 17 where i represents the identity of each pair, E(R,L) is the interval for symptom 18 onset of the infector and S(R,L) is the interval for symptom onset of the infectee. 19 Here, g(.) is the probability density function (p.d.f.) of exposure following a 20 uniform distribution and f(.) is the p.d.f. of the serial interval, assumed to be 21 governed by three different distributions-lognormal, gamma, and Weibull. We 22 All rights reserved. No reuse allowed without permission. author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint (which was not peer-reviewed) is the . https://doi.org/10.1101/2020.02.03.20019497 doi: medRxiv preprint -5 -sampled the posterior distributions using CmdStan version 2.22.1 1 (http://github.com/aakhmetz/nCoVSerialInteval2020). 2 As the epidemic will continue to grow beyond our data collection cutoff 3 point of 12 February 2020, it is possible that the naïve likelihood (1) 4 underestimates the serial interval as sampling during the early stage of the 5 epidemic preferentially excludes infector-infectee pairs with longer serial 6 intervals. We adjusted for this selection bias-called right truncation-in our 7 model. The alternative p.d.f. that accounts for right truncation during the 8 exponential growth phase of the epidemic is written as: 9 where r is the exponential growth rate estimated at 0.14 [5] and T is the latest 10 time of observation (12 February 2020). The widely applicable information 11 criterion (WAIC) was used to compare between distributions and the model with 12 the minimal WAIC value was selected as the best-fit model for each set of 13 estimates with and without right truncation. 14 We were able to obtain data on 28 infector-infectee pairs (see 16 Supplementary Table) . Of these, 12 pairs were family clusters. Accounting for 17 right truncation and analyzing all pairs, the model using the lognormal 18 distribution was selected as the best-fit model (WAIC=224.0) The median serial 19 interval was estimated at 4.0 days (95% credible interval [CrI]: 3.1, 4.9) while 20 the mean and standard deviation (SD) of the serial interval were estimated at 4.7 21 days (95% CrI: 3.7, 6.0) and 2.9 days (95% CrI: 1.9, 4.9), respectively. Without 22 All rights reserved. No reuse allowed without permission. author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint (which was not peer-reviewed) is the . https://doi.org/10.1101/2020.02.03.20019497 doi: medRxiv preprint -6 -truncation, the model using the lognormal distribution was also the best-fit model 1 (WAIC=128.0) with the median serial interval was estimated at 3.9 days (95% 2 CrI: 3. 1, 4.8) . 3 Limiting our dataset to only certain observations, the median serial 4 interval of the best-fit Weibull distribution model was estimated at 4.6 days (95% 5 CI: 3.5, 5.9) with a mean and SD of 4.8 days (95% CrI: 3.8, 6.1) and 2.3 days 6 (95% CrI: 1.6, 3.5), respectively. Without truncation, the best-fit model used the 7 lognormal distribution and estimated the median serial interval at 4.1 days (95% 8 CrI: 3.2, 5.0). Figure 1 shows the best-fit distributions overlaid with a published 9 distribution of the SARS serial interval [4] . 10 Our estimate of the median serial interval as 4.0 days indicates that 12 COVID-19 infection leads to rapid cycles of transmission from one generation of 13 cases to the next. The shorter serial interval compared to SARS implies that 14 contact tracing methods must compete against the rapid replacement of case 15 generations, and the number of contacts may soon exceed what available 16 healthcare and public health workers are able to handle. The difference between 17 these distributions suggests that using serial intervals estimates from SARS data 18 will result in overestimation of the COVID-19 basic reproduction number. 19 More importantly, the estimated median serial interval is shorter than the 20 preliminary estimates of the mean incubation period (approximately 5 days) 21 [3, 6] . As illustrated in Figure 2 , when the serial interval is shorter than the 22 incubation period, pre-symptomatic transmission is likely to have taken place 23 and may even occur more frequently than symptomatic transmission. A 24 All rights reserved. No reuse allowed without permission. author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint (which was not peer-reviewed) is the . https://doi.org/10.1101/2020.02.03.20019497 doi: medRxiv preprint -7 -substantial proportion of secondary transmission occurring before illness onset 1 indicates that many transmissions cannot be prevented solely through isolation of 2 symptomatic cases, as by the time contacts are traced they may have already 3 become infectious themselves and generated secondary cases [7] . 4 Correct ascertainment of dates of illness onset is critical to the calculation of 5 the serial interval. Considering the overall mild nature of the infection [8] it is 6 possible that different reporting jurisdictions have different criteria for 7 determining what qualifies as illness onset for COVID-2019 cases, which is a 8 potential bias we are unable to account for. However, the present study addresses 9 the issue of data quality of the reported pairs in two ways. First, our data include 10 the updated information from a recent report of pre-symptomatic transmission in 11 Germany [9] where it was later found that the primary case was already 12 symptomatic while in contact with persons who later became infected 13 (Supplementary Material in [9] ). Second, classification of the credibility of the 14 data and comparing analyses including and excluding less certain (but 15 nonetheless highly probable) pairs allowed us to determine that our results using 16 all pairs (and therefore a greater sample size) did not differ significantly from the 17 results using only the most credible data. 18 In conclusion, we have estimated the median serial interval of COVID-19 at 19 4.0 days, which is shorter than the disease's median incubation period indicating 20 that rapid cycles of transmission and substantial pre-symptomatic transmissions 21 are occurring. Thus, containment via case isolation alone is likely to be very 22 challenging. 23 24 All rights reserved. No reuse allowed without permission. author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint (which was not peer-reviewed) is the . https://doi.org/10.1101/2020.02.03.20019497 doi: medRxiv preprint If the transmission takes place during the symptomatic period of the primary case, 9 the serial interval is longer than the incubation period. However, this relationship 10 can be reversed when pre-symptomatic transmission takes place (the secondary 11 case may even experience illness onset prior to onset in their infector). 12 13 All rights reserved. No reuse allowed without permission. author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint (which was not peer-reviewed) is the . https://doi.org/10. 1101 The interval between successive cases of an infectious disease. Am 15 Epidemiological characteristics of novel 12 coronavirus infection: A statistical analysis of publicly available case data Factors that make an 15 infectious disease outbreak controllable The rate of 20 underascertainment of novel coronavirus (2019-nCoV) infection: Estimation 21 using Japanese passengers data on evacuation flights All rights reserved. No reuse allowed without permission. author/funder, who has granted medRxiv a license to display the preprint in perpetuity.The copyright holder for this preprint (which was not peer-reviewed) is the . https://doi.org/10.1101/2020.02.03.20019497 doi: medRxiv preprint -9 - author/funder, who has granted medRxiv a license to display the preprint in perpetuity.The copyright holder for this preprint (which was not peer-reviewed) is the . https://doi.org/10.1101/2020.02.03.20019497 doi: medRxiv preprint -10 -infection from an asymptomatic contact in Germany. N Eng J Med. 2020; in 1 press. doi:10.1056/NEJMc2001468. 2 3 All rights reserved. No reuse allowed without permission. author/funder, who has granted medRxiv a license to display the preprint in perpetuity.The copyright holder for this preprint (which was not peer-reviewed) is the . https://doi.org/10. 1101