key: cord-0582744-zxlt7svn authors: Siegenfeld, Alexander F.; Bar-Yam, Yaneer title: Eliminating COVID-19: A Community-based Analysis date: 2020-03-23 journal: nan DOI: nan sha: a8a77e7cd91c1cdf3528f09dad1e1cc32474269d doc_id: 582744 cord_uid: zxlt7svn We analyze the spread of COVID-19 by considering the transmission of the disease among individuals both within and between communities. A set of communities can be defined as any partition of a population such that travel/social contact within each community far exceeds that between them (e.g. the U.S. could be partitioned by state or commuting zone boundaries). COVID-19 can be eliminated if the community-to-community reproductive number---i.e. the expected/average number of other communities to which a single infected community will transmit the virus---is reduced to less than one. We find that this community-to-community reproductive number is proportional to the travel rate between communities and exponential in the length of the time-delay before community-level action is taken. Thus, reductions in travel and the speed at which communities take action can play decisive roles in stopping the outbreak. The analysis suggests that for the coronavirus to be eliminated, it is not necessary to impose aggressive social distancing measures all over the world at once, but rather only in communities in which active spreading is detected. The sooner such measures are imposed, the shorter the duration they must remain in place. If infected communities (including those that become re-infected in the future) are quick enough to act, the number of actively infected communities (and thus the number of communities in which such measures are required) will exponentially decrease over time. We analyze the spread of COVID-19 by considering the transmission of the disease among individuals both within and between communities. A set of communities can be defined as any partition of a population such that travel/social contact within each community far exceeds that between them (e.g. the U.S. could be partitioned by state or commuting zone boundaries). COVID-19 can be eliminated if the community-to-community reproductive number -i.e. the expected/average number of other communities to which a single infected community will transmit the virus-is reduced to less than one. We find that this community-to-community reproductive number is proportional to the travel rate between communities and exponential in the length of the time-delay before communitylevel action is taken. Thus, reductions in travel and the speed at which communities take action can play decisive roles in stopping the outbreak. The analysis suggests that for the coronavirus to be eliminated, it is not necessary to impose aggressive social distancing measures all over the world at once, but rather only in communities in which active spreading is detected. The sooner such measures are imposed, the shorter the duration they must remain in place. If infected communities (including those that become re-infected in the future) are quick enough to act, the number of actively infected communities (and thus the number of communities in which such measures are required) will exponentially decrease over time. Many studies of disease spread consider individuals as the primary unit of analysis, with the reproductive number-i.e. the number of people to whom an infected individual will on average transmit the disease-playing a central role. It is well known that an outbreak can be stopped if interventions reduce this reproductive number to less than 1. Here, we consider the spread of disease among communities, mediated by its spread among individuals. Central to our analysis is the analogous community-to-community reproductive number R * [1, 2] , i.e. the expected/average number of other communities (including those that have been previously infected) to which a single infected community will transmit the infection. Achieving R * < 1 will stop the outbreak. Our analysis applies to any partition of a population into a set of communities in which travel/social contact within each community far exceeds that between them. For the purposes of interventions, treating larger areas as single communities means that social distancing measures will be homogeneously applied to larger areas at once but also means that it is easier to achieve lower travel rates between such areas. Section II describes the model, section III estimates parameters for COVID-19, including its doubling time and basic reproductive number R 0 , and section IV concludes by evaluating the effects of various interventions. Key results include that (1) it is possible for a region to eliminate COVID-19 if fast enough action is taken in the communities therein, (2) the probability of one The disease is modeled as being transmitted among individuals within a community, with travel allowing the disease to spread between communities. We define a community as infected if someone with the infection enters the community. Let i c 0 be a stochastic factor that roughly corresponds to the initial foothold that the virus gains in community c conditioning on an infected individual entering the community, with i c 0 = 0 corresponding to the case in which no one was infected or a few people were infected but the outbreak was contained (perhaps through contact tracing and quarantine). If i c 0 = 0, the outbreak spreads no further within or between communities, and so we can write i c (t) = 0, where i c (t) is the number of active infections in community c as a function of time. If the infection is not contained (i.e. i c 0 = 0), the number of active infections is modeled as growing with time t at an exponential rate e rct . After some time T c (the delay in response), the community implements aggressive social distancing measures that cause the number of active infections to decay as e −t/τc , where τ c is the amount of time that it takes for the number of active infections to drop by a factor of e. Such exponential decrease will occur if the aggressive social distancing measures, together with testing, contact tracing, and quarantine, can reduce the reproductive number (R) of the virus below 1. The basic reproductive number R 0 (without intervention) is estimated at approximately 2.5 (see section III); if the FIG. 1. The number of active infections in a community as a function of time. After time Tc, aggressive social distancing measures are implemented. They must remain in place for a duration greater than τc(rcTc + ln i c 0 ) (see eq. (2)). Thus, the longer the community waits to enact the measures, the longer the total amount of time they must remain in place. spread of the virus is reduced by a factor greater than R 0 , the number of infections will exponentially decay. 1 Decreases in case counts in South Korea and China (see section III) indicate that such a reduction is achievable. The greater the reduction in R, the smaller the value of τ c and thus the faster the decrease in infections. 2 The number of active infections i c (t) (see fig. 1 ) can therefore be written as: The social distancing measures can be lifted once there are no remaining active infections in the community or once all active infections have been contained. Solving for i c (t) = 1 (assuming i c 0 = 0) yields a duration of the aggressive social distancing measures of plus some additional time at the end to ensure that there are no remaining hidden cases. As the number of cases becomes increasingly small, contact tracing may become increasingly effective and hasten the drop of i c (t) to 0. The probability that the number of infections will rebound after the social distancing measures are lifted (in 1 For instance, simulations from the Institute for Disease Modeling show that a 75% contact reduction will result in an exponential decrease in the number of active infections after a time-delay due to the incubation periods and durations of existing infections [3] . 2 For R < 1, R is related to τc by 1 = ∞ 0 Rg(t)e t/τc dt where g(t) is the distribution of generation intervals [4] (see section III and its footnotes for a description of generation intervals). which case an additional phase of such measures will be needed), as in Imperial College Report [5] , will depend on the rate of importation from other communities, which in turn will depend on the community-to-community reproductive number R * , as described below. Despite the possibility of the virus being re-imported, as long as R * < 1 the number of infected communities will exponentially decrease over time, since re-importation events are included in R * . Each infected community c infects a currently uninfected community with a probability rate proportional to the number of active infections i c (t) times the probability rate p c that an infected individual will travel to an uninfected community. The number of new infected communities spawned by community c can thus be modeled as a Poisson process with rate i c (t)p c . This modeling assumption overestimates the spread of the disease to new communities by counting a single new community that has been infected multiple times as multiple new infected communities. Note that a single new community that is infected by more than one other community is also counted as multiple new infected communities. If most communities are uninfected and the virus is being contained, an infected community will infect on average less than one other community, and thus there is a negligible probability that the same one will be infected twice. However, if many communities are infected and/or the number of infected communities is exponentially growing, this analysis serves only as an upper bound, as it neglects saturation effects. In order for the disease to be eliminated, the former scenario must be attained, and thus the Poisson assumption does not affect our conclusions. Let p c 0 be the per capita probability rate before time T c of individuals in community c traveling to other communities and p c 1 be the probability rate afterwards (p c 1 will be less than p c 0 if travel is discouraged and/or restricted at the time at which social distancing measures are implemented). The number of new communities that are infected by community c will then be a Poisson random variable with a mean of Taking the expected value over i c 0 yields where R c * is the community-to-community reproductive number for community c, i.e. the expectation of the number of communities that community c will infect if it is infected. As the parameters r c , τ c , T c , E[i c 0 ], p c 0 , and p c 1 may differ from community to community, so will R c * . If the interventions are fast enough and strong enough such that R * , the average value of R c * over a set of communities with each community weighted by its probability of being infected, is less than 1, then the outbreak will not be self-sustaining within that set of communities. A set of communities can thus exist in one of two phases or regimes: a regime in which over time the number of infected communities exponentially decreases to zero and a regime in which over time the number of infected communities exponentially increases until it saturates as an endemic disease or reaches burnout. Thus, small deviations from the assumptions that led to the formula for R c * will not matter so long as they do not change which regime the system is in, i.e. whether or not R * < 1. It is this universality [6] that allows us to understand whether or not any disease will spread, even if it is not possible to precisely describe the details of disease transmission and social connectivity. In order to better understand the extent of the measures required to achieve R * < 1, we estimate the values of the parameters in eq. (5). The doubling time can vary from location to location, but using the number of confirmed cases in China by date of symptom onset (rather than by date of diagnosis) [7] gives a doubling time of 3.04 days in the period leading up to the Jan. 23 lockdown, which corresponds to r = 0.228 day −1 (see fig. 2 ). 3 From this growth rate before the Jan. 23 lockdown, the basic reproductive number R 0 can be calculated from the distribution of generation intervals [4] . 4 Empirical data from various sources support a distribution of generation intervals with a mean of approximately 4.0 to 4.7 days [12] [13] [14] [15] [16] [17] , which yield upper bounds of R 0 < e 4.0r = 2.5 to R 0 < e 4.7r = 2.9. These upper bounds assume all transmission occurs at the mean generation interval; the spread of generation intervals given a fixed mean results in lower values for R 0 given a fixed 3 Some studies estimate the doubling time for COVID-19 at approximately 7 days [8, 9] , but even a 5 day doubling period is implausibly long, given that in various countries, even with some preventative measures, the number of infections has increased by far more than a factor of 64 over 30 days [10] . Part of the difficulty in estimating the doubling time from the initial period of transmission is that 'super-spreader' events may play an important role in the transmission process. The presence of superspreader events indicates that the transmission process may be fat-tailed and therefore standard statistical approaches may underestimate the rate of spread when the total number of cases is still small [11] . 4 Empirically, we generally observe the distribution of serial intervals (the times between the onsets of symptoms in two successive cases in a transmission chain) rather than the distribution of generation intervals (the times between two successive infections in a transmission chain). The means of the two distributions should, however, be the same. Date of onset of symptoms r (or higher values for r given a fixed R 0 ). For instance, the serial interval data with a mean of 4.7 days best fit a lognormal distribution (S.D.=2.9 days) [12] , which yields R 0 = 2.5. 5 This short mean generation time of 4.0 to 4.7 days may explain the difficulty of containing the virus through contact tracing alone. The values of τ c that can be achieved depend on the effectiveness of the social distancing measures. The data from China (see fig. 2 ) indicate a halving time of as little as 4.78 days is achievable, which corresponds to τ = 6.9 days. E[i c 0 ] is the expected "effective" number of people an infected traveler will infect while visiting community c, taking into account containment efforts. 6 We estimate E[i c 0 ] = R 0 ; the degree to which E[i 0 c ] differs from R 0 depends on how likely a typical traveler is to transmit the virus relative to a typical resident, as well as on the effectiveness of contact tracing and other containment efforts. The value of p c 0 depends on the frequency of travel out of community c. There is some choice in how to model the partition of a population into a set of communities. In general, the larger the communities, the lower the frequency of per capita travel out of them but the more homogeneous the application of the aggressive social distancing measures. Considering a set of communities within the U.S. that are large enough such that travel between the communities is predominantly by flight yields a per capita travel rate of 0.004 flights out of a community per person per day. 7 The values above allow us to use eq. (5) to determine R c * as a function of the time-delay before aggressive social distancing measures are enacted, as shown in fig. 3 . Note that the time-delay is measured from the time at which exponential growth begins to occur-which could be as early as the first infection transmitted within the community if containment is not successful-not the time at which exponential growth is first measured. Regardless of the values of the parameters, i.e. regardless of the current value of R * , R * can be reduced by a number of interventions: • A reduction in travel from community c results in a proportional (linear) reduction in R c * through p c 0 and p c 1 . • Improvements in testing, contact tracing, and quarantine result in a proportional reduction in R c * through E[i c 0 ]. Such improvements also reduce r c and τ c . • The preemptive reduction of large events such as conferences (as well as general social distancing) 6 For instance, if the outbreak is contained such that exponential growth never occurs, the effective number of people infected by the traveler is zero, even if the traveler did infect some community members. 7 This estimate is obtained by dividing the 1.01 billion total passengers traveling by plane to, from, or within the U.S. in 2018 [19] by the 2018 U.S. population and the number of days in 2018, and then also dividing by 2 so that only flights out of and not into a community are counted. Using this estimate for p c 0 assumes that the probability that an infected individual will travel equals that of the general public. Dependence of R c * (the average number of communities to which community c will transmit the disease) on Tc (the time-delay before the social distancing measures are enacted). If R * (a weighted average of R c * ) is less than 1, the number of infected communities will exponentially decrease and the disease will be eliminated (and the smaller R * is, the faster it will be eliminated); otherwise, the number of infected communities will increase until saturation. reduces the probability of a super-spreader event as well as general transmission, reducing both E[i c 0 ] and r c . • Reductions in r c T c not only exponentially reduce R c * but also linearly reduce the amount of time for which social distancing measures must remain in place. • Augmenting social distancing measures (after time T c ) decreases τ c , which results in a proportional decrease in both R c * and the time for which the distancing measures must remain in place. We conclude with a few comments. First, without the timely implementation of aggressive social distancing measures, restricting travel from infected communities serves only to delay the spread of the outbreak. But when a reduction in travel is coupled with the social distancing measures, the travel reduction will not only delay the spread of the outbreak but will also in some cases be the deciding factor in determining whether or not the outbreak is eliminated. And if R * < 1 can be achieved without reducing travel, travel reductions can, by further reducing R * , greatly decrease the duration of the outbreak (and therefore also decrease the total number of illnesses and deaths). Second, the most important and easiest parameter to control is T c : the amount of time delay between the time the infection takes hold in the community and the time that aggressive social distancing measures are implemented. Since R c * grows exponentially with T c , it is of paramount importance that communities and governments act as soon as possible. Once the infection has taken hold in a community, exponential growth ensures that it is only a matter of time before the infection becomes widespread. Thus government action is inevitable, and delaying action not only linearly increases the expected total amount of time for which the distancing measures will need to remain in place but also exponentially increases the probability that another community will become infected or re-infected. Third, because R c * depends exponentially on T c , each additional increase in T c becomes increasingly costly. In other words, the longer a community has already waited to take aggressive social distancing measures, the more important it becomes to avoid further delay. It is important to note, however, that there is no advantage to delaying at all: immediately implementing aggressive social distancing measures as soon as there is evidence that the disease is actively spreading within the community will reduce the total amount of time for which such measures will need to be implemented, exponen-tially reduce the number of infections within the community, and exponentially reduce the probability of infecting or re-infecting another community. While the risk to individuals may be low in the early stages of a community outbreak, the collective risk at the community and global levels is extremely high. Finally, just as transmission within a community is an exponential process in which the need for immediate action is not always apparent (due to a deceptively small number of initial cases), the transmission between communities is also such an exponential process. At first the number of infected communities is deceptively small, but without rapid action this number exponentially grows. The sooner a set of communities decides to adopt a protocol that reduces R * below 1, the shorter the amount of time between the adoption of the protocol and the elimination of the disease. Epidemics with two levels of mixing Invasion threshold in heterogeneous metapopulation networks Working paper -model-based estimates of COVID-19 burden in King and Snohomish counties through How generation intervals shape the relationship between growth rates and reproductive numbers Impact of non-pharmaceutical interventions (NPIs) to reduce COVID-19 mortality and healthcare demand. Imperial College COVID-19 Response Team An introduction to complex systems science and its applications Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China: Summary of a report of 72314 cases from the Chinese Center for Disease Control and Prevention Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study Early transmission dynamics in Wuhan, China of novel coronavirus-infected pneumonia COVID-19 coronavirus outbreak How much data do you need? An operational, pre-asymptotic metric for fat-tailedness Serial interval of novel coronavirus (COVID-19) infections. International The serial interval of COVID-19 from publicly reported confirmed cases. medRxiv Estimating the serial interval of the novel coronavirus disease (COVID-19): A statistical analysis using the public data in Hong Kong from Estimation of the time-varying reproduction number of COVID-19 outbreak in China Transmission interval estimates suggest presymptomatic spread of COVID-19. medRxiv Estimating the generation interval for COVID-19 based on symptom onset data The reproductive number of COVID-19 is higher compared to SARS coronavirus 2018 traffic data for U.S. airlines and foreign airlines U.S. flights