key: cord-0992552-ydsnyq39 authors: Mettler, S. K.; Maathuis, M. H. title: Clinical onset serial interval and diagnostic serial interval of SARS-CoV-2/COVID-19 in South Korea date: 2020-05-09 journal: nan DOI: 10.1101/2020.05.05.20070946 sha: cc91e7e42e914b4bfbbca2132b5d63753c639c99 doc_id: 992552 cord_uid: ydsnyq39 BACKGROUND The clinical onset serial interval, or the time between the onset of symptoms in successive cases in a chain of infection, is often used as a measurable proxy for the transmission serial interval of an infectious disease. Current estimates of the mean clinical onset serial interval of COVID-19 range from 3.96 to 7.5 days. In this article, we define the diagnostic serial interval as the time between the diagnosis dates of the infector and infectee. We study and compare the clinical onset and diagnostic serial intervals of SARS-CoV-2/COVID-19 in South Korea. METHODS Analyzing the DS4C project data which summarize information on SARS-CoV-2/COVID-19 cases reported by regional governments in South Korea, we estimate the means of the clinical onset serial interval, the diagnostic serial interval and the difference between the two. We use the balanced cluster bootstrap method to construct 95% bootstrap confidence intervals. RESULTS The mean clinical onset serial interval and mean diagnostic serial interval were estimated to be 3.58 days (95% CI: 2.62, 4.53) and 3.68 days (95% CI: 3.14, 4.22), respectively. A matched sample analysis showed that the diagnostic serial interval was significantly shorter than the clinical onset serial interval (estimated mean difference -1.17 days, 95% CI: -2.26, -0.09). CONCLUSIONS The short diagnostic serial interval of SARS-CoV-2/COVID-19 in South Korea may explain why South Korea was able to contain the COVID-19 outbreak and avoid high mortality. We conjecture that the mean diagnostic serial interval may serve as a predictor for the success of a country's containment efforts. The serial interval of an infectious disease, also known as the generation time, is defined as the time between analogous phases in successive cases of a chain of infection [1] . The transmission serial interval, or the time between the infection events of the infector and infectee, is particularly important as it is one of the factors that determine how rapidly the disease can spread in the community, but it is di cult to measure [2] . The 1 . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 9, 2020. . https://doi.org/10.1101/2020.05.05.20070946 doi: medRxiv preprint clinical onset serial interval, a commonly used form of serial interval for its easier measurement, refers to the time between the onset of symptoms of the infector and the infectee [2] . Table 1 summarizes recent findings on the clinical onset serial interval of COVID-19. In this article, we study the clinical onset serial interval of COVID-19 using a rich data set of infector-infectee pairs observed in South Korea between January 20th and April 7th, 2020. We also introduce and study the diagnostic serial interval, which we define as the time between the infector's diagnosis and the infectee's diagnosis. In the case of SARS-CoV-2/COVID-19, diagnosis is assumed to be laboratory confirmation of SARS-CoV-2 infection. While the clinical onset serial interval is largely a characteristic of the pathogen, the diagnostic serial interval contains additional information on public health policy and capacity. A total of 10'384 individuals were confirmed to be infected with SARS-CoV-2 in South Korea between January 20th and April 7th [7] . The seventeen regional governments of South Korea have published information on infected individuals, including date of diagnosis, age, gender, infection route, infector (if known) and symptom onset date (if symptomatic and reported). Among the seventeen regional governments, fourteen have published information on over 85% of their confirmed cases, the exceptions being Daegu (publishing information on less than 1% of 6'803 patients as of April 7th), Jeju-do and Gangwon-do [7, 8] . The DS4C project summarizes the information published by the regional governments and makes these data available for public use [8] . Our analysis is based on the DS4C project data (as of April 8th), containing information on 3'127 individuals. Among these 3'127 individuals, there were 728 known infector-infectee pairs. Date of diagnosis was recorded for all infectors and all infectees in these pairs. In order to reduce bias resulting from right truncation, we 1 Infector-infectee information from China, Germany, South Korea, Singapore, Taiwan and Vietnam 2 . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 9, 2020. . https://doi.org/10.1101/2020.05.05.20070946 doi: medRxiv preprint excluded 29 pairs for which the infector's diagnosis date was later than March 31st. The remaining 699 infector-infectee pairs, containing 310 unique infectors and 699 unique infectees, were used to study the diagnostic serial interval. Twelve of the seventeen regional governments have sporadically reported symptom-related information including the symptom onset dates. Among the 728 known infector-infectee pairs, there were 76 pairs for which the onset date of symptoms was known for both the infector and the infectee. These 76 pairs were composed of 38 unique infectors and 76 unique infectees. As no infector's symptom onset date was later than March 31st, all 76 pairs were kept for analysis. These data were used to study the clinical onset serial interval and the di erence between the diagnostic and clinical onset serial intervals. An inclusion flowchart is provided in the supplement. We estimated the mean of the clinical onset serial interval based on the 76 pairs with known symptom onset dates. Similarly, we estimated the mean of the diagnostic serial interval based on the 699 pairs described above. We computed 95% confidence intervals for these parameters using the bootstrap method. To account for dependencies between observations caused by common infectors, we used the balanced cluster bootstrap method which regards each infector as a cluster [9] . For the 76 infector-infectee pairs for which we know both the clinical onset serial interval and the diagnostic serial interval, we also computed the di erence between these two intervals ( = diagnostic serial interval ≠ clinical onset serial interval). We estimated the mean of the distribution of these di erences, and used the balanced cluster bootstrap method to construct the corresponding 95% bootstrap confidence interval. All computations were performed using the statistical software R. All R code is provided in the supplement. Figure 1 visualizes the relationship between infectors' and infectees' symptom onset dates (Figure 1(a) ) and diagnosis dates (Figure 1(b) ). The data points below the diagonal line in Figure 1 (a) show that some infectees developed symptoms before their infectors, indicating the presence of asymptomatic transmission. A similar phenomenon occurs for the diagnosis dates in Figure 1(b) , but to a lesser extent. Histograms of the clinical onset serial intervals and diagnostic serial intervals are shown in Figure 2 (a). The distributions appear to have di erent shapes. Also, as already indicated in Figure 1 , we see some negative serial intervals, especially for the clinical onset serial interval. Precisely, among the 76 pairs for whom the clinical onset serial interval was calculated, 9.2 percent of the infectees developed symptoms before their infectors. Among the 699 pairs for whom the diagnostic serial interval was calculated, 2.1 percent of the 3 . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 9, 2020. 4 . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 9, 2020. . https://doi.org/10.1101/2020.05.05.20070946 doi: medRxiv preprint infectees were diagnosed before their infectors. Among the 76 pairs for whom both intervals were calculated, 56.6 percent of the pairs had a shorter diagnostic serial interval than clinical onset serial interval. Table 2 shows the estimated means of the clinical onset serial interval, the diagnostic serial interval, and their di erence in days. The 95% confidence interval for the mean diagnostic serial interval is shorter than that for the mean clinical onset serial interval, due to the larger data set that can be used for the diagnostic Estimating the transmission serial interval in the context of a pandemic is important because it provides the time window for the containment of the spreading infection. Nishiura et al [10] pointed out that the shorter the true generation time of SARS-CoV-2/COVID-19 is, the more undiagnosed infected individuals there will be. A particular challenge of SARS-CoV-2/COVID-19 is that these undiagnosed individuals may spread the virus resulting in an extraordinarily fast growing pandemic [4, 10, 11] . As infection dates are typically unknown, the transmission interval is very hard to obtain. We therefore studied the commonly used clinical onset serial interval of COVID-19 in South Korea. We also introduced and studied a new type of serial interval, the diagnostic serial interval. While the clinical onset serial interval is largely a characteristic of the pathogen, we argue that the diagnostic serial interval contains additional information on public health policy and capacity. Our estimate of the mean clinical onset serial interval of 3.58 days (95% CI: 2.62, 4.53) is comparable to the findings of Nishiura et al [4] and Du et al [5] (see Table 1 ) and similar to the results for the diagnostic serial interval (estimated mean: 3.68 days, 95% CI: 3.14, 4.22). According to the matched sample analysis of 76 5 . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 9, 2020. . https://doi.org/10.1101/2020.05.05.20070946 doi: medRxiv preprint pairs for which we had both the clinical onset and the diagnostic serial intervals, the diagnostic serial interval was significantly shorter than the clinical onset serial interval (estimated mean di erence: -1.17 days, 95% CI: -2.26, -0.09). The relatively short diagnostic serial intervals in South Korea, compared to the clinical onset serial intervals, are likely due to the country's broad testing policy, laboratory capacity and its intensive e orts in contact tracing. In countries with more restrictive testing policies (e.g. symptoms such as fever or cough and pre-existing health conditions are prerequisites), we expect the diagnostic serial intervals to be longer. Short diagnostic serial intervals imply faster identification of new infection events and therefore point to a better containment of SARS-CoV-2/COVID-19. We therefore pose that the diagnostic serial interval may be used as an early predictor for the success of a country's containment e orts. It will be interesting to study this in other countries and investigate the predictive power of this indicator. Besides its relevance for public health policy, the diagnostic serial interval has the advantage that it is easier to estimate than the clinical onset serial interval, since dates of diagnosis are routinely recorded and less subjective than the onset of symptoms. Moreover, in contrast to the clinical onset serial interval, it is defined for asymptomatic carriers of SARS-CoV-2. This is especially favorable as asymptomatic transmission appears possible [11] , and a significant portion of asymptomatic carriers remain asymptomatic [12, 13] . Our study has several limitations, the main one being that our data do not constitute a random sample of the population of infected individuals. As a result, we may su er from selection bias in various ways. For example, certain regions are strongly underrepresented, one of which being the province of Daegu which was severely a ected by COVID-19. Also, some selection bias for shorter intervals is likely to exist, as infector-infectee pairs with shorter serial intervals may be identified more easily than those with longer serial intervals. . CC-BY-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted May 9, 2020. . https://doi.org/10.1101/2020.05.05.20070946 doi: medRxiv preprint A Dictionary of Epidemiology The Interval between Successive Cases of an Infectious Disease Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia Serial interval of novel coronavirus (COVID-19) infections Serial Interval of COVID-19 among Publicly Reported Confirmed Cases Epidemiologic characteristics of early cases with 2019 novel coronavirus (2019-nCoV) disease in Korea Updates on COVID-19 in Republic of Korea Data Science for COVID-19 (DS4C) Bootstrap Methods and Their Application The Rate of Underascertainment of Novel Coronavirus (2019-nCoV) Infection: Estimation Using Japanese Passengers Data on Evacuation Flights Transmission of 2019-nCoV Infection from an Asymptomatic Contact in Germany Estimation of the asymptomatic ratio of novel coronavirus infections (COVID-19) Follow-up of the asymptomatic patients with SARS-CoV-2 infection International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity The copyright holder for this preprint this version posted May 9, 2020. . https://doi.org/10.1101/2020.05.05.20070946 doi: medRxiv preprint