key: cord-1013146-d1hamzgk authors: Lovell-Read, Francesca A.; Funk, Sebastian; Obolski, Uri; Donnelly, Christl A.; Thompson, Robin N. title: Interventions targeting non-symptomatic cases can be important to prevent local outbreaks: SARS-CoV-2 as a case study date: 2021-05-19 journal: Journal of the Royal Society, Interface DOI: 10.1098/rsif.2020.1014 sha: 7289f76e8c272fbd6f16edf328bba85139c9e5c4 doc_id: 1013146 cord_uid: d1hamzgk During infectious disease epidemics, an important question is whether cases travelling to new locations will trigger local outbreaks. The risk of this occurring depends on the transmissibility of the pathogen, the susceptibility of the host population and, crucially, the effectiveness of surveillance in detecting cases and preventing onward spread. For many pathogens, transmission from pre-symptomatic and/or asymptomatic (together referred to as non-symptomatic) infectious hosts can occur, making effective surveillance challenging. Here, by using SARS-CoV-2 as a case study, we show how the risk of local outbreaks can be assessed when non-symptomatic transmission can occur. We construct a branching process model that includes non-symptomatic transmission and explore the effects of interventions targeting non-symptomatic or symptomatic hosts when surveillance resources are limited. We consider whether the greatest reductions in local outbreak risks are achieved by increasing surveillance and control targeting non-symptomatic or symptomatic cases, or a combination of both. We find that seeking to increase surveillance of symptomatic hosts alone is typically not the optimal strategy for reducing outbreak risks. Adopting a strategy that combines an enhancement of surveillance of symptomatic cases with efforts to find and isolate non-symptomatic infected hosts leads to the largest reduction in the probability that imported cases will initiate a local outbreak. Emerging epidemics represent a substantial challenge to human health worldwide [1] [2] [3] [4] . When cases are clustered in specific locations, two key questions arise: (i) will exported cases lead to local outbreaks in new locations? and (ii) which surveillance and control strategies in those new locations will reduce the risk of local outbreaks? Branching process models are used for a range of diseases to assess whether cases that are newly arrived in a host population will generate a local outbreak driven by sustained local transmission [5] [6] [7] [8] [9] [10] [11] . These models can also be used to predict the effectiveness of potential control interventions. For example, early in the coronavirus disease 2019 (COVID-19) pandemic, Hellewell et al. [12] used simulations of a branching process model to predict whether new outbreaks would fade out under different contact tracing strategies. Thompson [13] estimated the probability of local outbreaks analytically using a branching process model and found that effective isolation of infectious hosts leads to a substantial reduction in the outbreak risk. A factor that can hinder control interventions during any epidemic is the potential for individuals to transmit a pathogen while not showing symptoms. For COVID-19, the incubation period has been estimated to last approximately 5 or 6 days on average [14, 15] , and presymptomatic transmission can occur during that period [16] [17] [18] [19] [20] . In addition, asymptomatic infected individuals (those who never develop symptoms) also contribute to transmission [16, 21, 22] . Motivated by the need to assess the risk of outbreaks outside China early in the COVID-19 pandemic, we show how the risk that imported cases will lead to local outbreaks can be estimated using a branching process model. Unlike standard approaches for estimating the probability of a major epidemic analytically [23] [24] [25] [26] , non-symptomatic individuals are included in the model explicitly. By using a function that characterizes the efficacy of interventions for different surveillance efforts (denoted f (ρ, δ) in the model), we explore the effects of interventions that aim to reduce this risk. Under the assumption that detected infected hosts are isolated effectively, we consider whether it is most effective to dedicate resources to enhancing surveillance targeting symptomatic individuals, to instead focus on increasing surveillance for non-symptomatic individuals or to use a combination of these approaches. We show that, when surveillance resources are limited, the maximum reduction in the outbreak risk almost always corresponds to a mixed strategy involving enhanced surveillance of both symptomatic and non-symptomatic hosts. This remains the case even if the surveillance effort required to find non-symptomatic infected individuals is substantially larger than the effort required to find symptomatic individuals. This highlights the benefits of not only seeking to find and isolate symptomatic hosts but also dedicating resources to detecting non-symptomatic cases during infectious disease epidemics. We consider a branching process model in which infectious individuals are classified as asymptomatic (A), pre-symptomatic (I 1 ) or symptomatic (I 2 ). Hosts in any of these classes may generate new infections. The parameter ξ represents the proportion of new infections that are asymptomatic, so that a new infection either involves increasing A by one (with probability ξ) or increasing I 1 by one (with probability 1 − ξ). Pre-symptomatic hosts may go on to develop symptoms (transition from I 1 to I 2 ) or be detected and isolated (so that I 1 decreases by one). Symptomatic individuals (I 2 ) can be isolated (so that I 2 decreases by one) or be removed due to recovery or death (so that again I 2 decreases by one). Similarly, asymptomatic hosts may be detected and isolated or recover (so that A decreases by one in either case). A schematic showing the different possible events in the model is shown in figure 1a. The analogous compartmental differential equation model to the branching process model that we consider is given by A À nA, The parameters of the model, and the form of the function f (ρ, δ) that describes how the expected time to isolation is reduced for a given surveillance effort, are outlined below. In our model, the parameter β and its scaled counterparts αβ and ηβ represent the rates at which symptomatic, pre-symptomatic, and asymptomatic hosts generate new infections, respectively. Since we are modelling the beginning of a potential local outbreak, we assume that the size of the susceptible population remains approximately constant and do not track the depletion of this population. The parameter λ governs the rate at which pre-symptomatic individuals develop symptoms, so that the expected duration of the pre-symptomatic period is 1/λ days in the absence of interventions. Similarly, without interventions, The relationship between the surveillance intensification effort (ρ) and the proportional reduction in the expected time to isolation ( f (ρ, δ)), shown for different values of the parameter δ (solid lines). The parameter δ ∈ (0, 1) represents the upper bound of f(ρ, δ) (dotted lines). This general functional relationship between surveillance effort and isolation effectiveness is assumed to hold for surveillance of both non-symptomatic and symptomatic individuals, although non-symptomatic hosts are more challenging to detect than symptomatic hosts (ε < 1). royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20201014 the expected durations of the symptomatic and asymptomatic infectious periods are 1/μ days and 1/ν days, respectively. The baseline rate at which symptomatic individuals are detected and isolated is determined by the parameter γ. Assuming that non-symptomatic individuals are more difficult to detect than symptomatic individuals, we take the analogous quantity for nonsymptomatic hosts to be εγ, where the scaling factor ε < 1 reflects the fact that interventions targeting non-symptomatic hosts are likely to be less effective for the same surveillance effort. We assume that the sensitivity of surveillance is identical for presymptomatic and asymptomatic individuals and therefore use the same isolation rate for both of these groups. The parameters ρ 1 and ρ 2 represent the surveillance intensification effort targeted at non-symptomatic and symptomatic hosts, respectively. The function f(r, d) ¼ dr=ð1 þ rÞ governs the proportional reduction in the expected time to isolation for a given surveillance effort, ρ (for a similar approach in which the proportion of infectious cases prevented is assumed to be a function of control effort, see Matthews et al. [27] ). The functional form of f(ρ, δ) is chosen for three main reasons. First, it generates a reduced expected time to isolation when the surveillance effort increases. Second, since the proportional reduction in the expected time to isolation is bounded above by the parameter δ ∈ (0, 1), the isolation rate saturates and cannot increase indefinitely. Third, the gradient @f=@r decreases with the surveillance effort ρ, meaning that an increase in the surveillance effort has a greater impact at low surveillance efforts compared to when this effort is already large [27] . The function f (ρ, δ) is shown in figure 1b for different values of the parameter δ. The basic reproduction number, R 0 , represents the expected number of secondary infections generated by a single infected individual introduced at the start of their infection into a fully susceptible population in the absence of intensified surveillance: This expression is the sum of the expected number of transmissions from a host who begins in the asymptomatic class and from a host who begins in the pre-symptomatic infectious class, weighted by the respective probabilities ξ and 1 − ξ that determine the chance that the host experiences a fully asymptomatic course of infection. The expected number of transmissions from a host who begins in the pre-symptomatic infectious class comprises transmissions occurring during the incubation period and transmissions occurring during the symptomatic period, accounting for the possibility that the host is isolated before developing symptoms. The proportion of infections arising from pre-symptomatic hosts in the absence of intensified surveillance is then given by and the equivalent quantity for asymptomatic hosts is given by Since this study was motivated by the need to estimate outbreak risks outside China in the initial stages of the COVID-19 pandemic, we used a baseline set of parameter values in our analyses that was informed by studies conducted during this pandemic (table 1) . Where possible, these parameter values were obtained from the existing literature. However, we also performed sensitivity analyses to determine how our results varied when the parameter values were changed (see electronic supplementary material, text S3 and figures S3-S12). In table 1, and throughout, rounded values are given to three significant figures. The value of the parameter governing the baseline rate at which symptomatic individuals are isolated, γ, was chosen to match empirical observations, which indicate that individuals who seek medical care before recovery or death do so around 4-6 days after symptom onset [35] . Specifically, we assumed that the period of time to the first medical visit could be used a proxy for the time to isolation, and chose γ so that the expected time period to isolation conditional on isolation occurring during the symptomatic period was given by 1=ðg þ mÞ ¼ 4:6 days [35] . This is different to the time period that we refer to as the expected time to isolation for symptomatic hosts, which is 1/γ days (see Methods). For stochastic simulations of compartmental epidemiological models starting from a small number of hosts infected initially, there are generally two qualitatively different types of behaviours. The pathogen may fade out rapidly, or case numbers may begin to increase exponentially (only starting to fade out once the number of susceptible individuals has been sufficiently depleted, unless public health measures are introduced to reduce transmission). Consequently, running many simulations of those types of model with R 0 larger than but not close to one, the epidemic size is distributed bimodally, with the total number of individuals ever infected falling into one of two distinct ranges (for a simple example, see electronic supplementary material figure S1A; see also refs. [39] [40] [41] ). In that scenario, a natural definition for the probability of a local outbreak is therefore the proportion of outbreak simulations for which the total number of infected individuals falls into the higher of these two ranges. Here, since we are considering the initial phase of potential local outbreaks, we instead considered a branching process model in which depletion of susceptibles was not accounted for. If simulations of branching process models are run, then in each simulation, the pathogen either fades out with few infections or case numbers generally increase indefinitely. The probability of a local outbreak starting from a small number of infected hosts then corresponds to the proportion of simulations in which the pathogen does not fade out quickly and case numbers increase indefinitely instead. This again provides a natural definition of a local outbreak since simulations can be partitioned into two distinct sets (for an example in which simulations of a simple branching process model are used to calculate the probability of a local outbreak, see electronic supplementary material, figure S1B). As an alternative to repeated simulation, we instead use our branching process model (figure 1a) to perform analytic calculations of the probability that a single imported infectious host initiates a local outbreak. To do this, we denote the probability of a local outbreak not occurring, starting from i pre-symptomatic hosts, j symptomatic hosts, and k asymptomatic hosts, by q i,j,k . Starting from one pre-symptomatic host (so that i = 1 and j = k = 0), there are four possibilities for the next event. That host could: royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20201014 These probabilities are obtained by considering the rates at which different possible events occur in the branching process model. Pre-symptomatic hosts generate new infections at rate αβ, and these new infections occur in asymptomatic and presymptomatic hosts with probabilities ξ and 1 − ξ, respectively. Therefore, starting from a single pre-symptomatic host, new asymptomatic infections occur at rate ξαβ, while new pre-symptomatic infections occur at rate (1 − ξ)αβ. In addition, presymptomatic hosts develop symptoms at rate λ, and are isolated at rate eg=ð1 À f(r 1 ,d)Þ. The overall rate at which events occur is the sum of these individual event rates: For each of the four possible next events ((i)-(iv), earlier), the probability that event occurs next is the individual rate at which that event occurs divided by the total event rate, leading to the expressions given. We use these probabilities to condition on the event that occurs next in the branching process, following the introduction of a single pre-symptomatic infectious individual into the population. If that event is the generation of a new asymptomatic infection, which occurs with probability jab=ðab þ l þ 1g=ð1 À f(r 1 ,d)ÞÞ, the probability that a local outbreak subsequently does not occur is q 1,0,1 . Applying analogous reasoning to the other possible events, we obtain If there are no infectious hosts present in the population (i.e. i = j = k = 0), then a local outbreak will not occur and so q 0,0,0 = 1. Assuming that transmission chains arising from two infectious individuals are independent gives q 1,0,1 = q 1,0,0 q 0,0,1 and q 2,0,0 ¼ q 2 1,0,0 . Hence, q 1,0,0 ¼ ajq 1,0,0 q 0,0,1 þ a(1 À j)q 2 1,0,0 þ bq 0,1,0 þ (1 À a À b), ð2:3Þ where a ¼ ab=ðab þ l þ 1g=ð1 À f(r 1 ,d)ÞÞ and b ¼ l=ðab þ lþ 1g=ð1 À f(r 1 ,d)ÞÞ. Similarly, considering the probability of a local outbreak failing to occur starting from a single symptomatic host gives As before, noting that q 0,0,0 = 1 and assuming that different infection lineages are independent leads to Finally, considering the probability of a local outbreak failing to occur starting from a single asymptomatic host gives Equations (2.3), (2.4) and (2.5) may be combined to give a single quartic equation for q 0,0,1 , yielding four sets of solutions for q 1,0,0 , q 0,1,0 and q 0,0,1 (see electronic supplementary material, text S1). It is straightforward to verify that q 1,0,0 = q 0,1,0 = q 0,0,1 = 1 is always a solution, and further solutions can be found numerically. The appropriate solution to take is the minimal non-negative real solution q 1,0,0 ¼ q à 1,0,0 , q 0,1,0 ¼ q à 0,1,0 , q 0,0,1 ¼ q à 0,0,1 (see electronic supplementary material, text S1). Then, the probability of a local outbreak occurring beginning from a single pre-symptomatic host is given by with equivalent expressions holding for p 0,1,0 and p 0,0,1 (the probability of a local outbreak occurring beginning from a single symptomatic host or a single asymptomatic host, respectively). Throughout, we consider the probability p of a local outbreak starting from a single non-symptomatic host entering the population, accounting for the possibility that the non-symptomatic host is either pre-symptomatic or asymptomatic: We considered the effect of R 0 and the duration of the presymptomatic and asymptomatic periods on the probability of a local outbreak when a non-symptomatic host enters a new host population (figure 2). We examined pre-symptomatic periods of length 1/λ = 1 day, 1/λ = 2 days and 1/λ = 4 days; in each case, the duration of the asymptomatic period (1/ν days) was adjusted so that the relative proportion of infections arising from asymptomatic hosts compared to pre-symptomatic hosts remained fixed (K a /K p = 0.218, as in the baseline case). If instead non-symptomatic infections are not accounted for, the infectious period follows an exponential distribution and the probability of a local outbreak is given by p = 1 − 1/R 0 (red dash-dotted line in figure 2a) . Including non-symptomatic infection in the model therefore led to an increased risk of a local outbreak in the absence of surveillance intensification (figure 2a). We then considered the dependence of the probability of a local outbreak on the intensity of surveillance targeting non-symptomatic and symptomatic hosts ( figure 2b-d) . The maximum value of the surveillance intensification effort that we considered (given by ρ 1 or ρ 2 values of 20) corresponded to a 76% reduction in the expected time to isolation (blue line in figure 1b) , i.e. a 76% reduction in 1/εγ or 1/γ. The length of the pre-symptomatic and asymptomatic periods significantly affected the dependence of the probability of a local outbreak on the level of surveillance targeted at non-symptomatic and symptomatic hosts. In figure 2b , in which the duration of the pre-symptomatic period was 1 day, increasing surveillance targeted at nonsymptomatic hosts (ρ 1 ) had a limited effect on the probability of a local outbreak, while increasing surveillance targeted at symptomatic hosts (ρ 2 ) had a more significant effect. For example, increasing the surveillance effort targeted at nonsymptomatic hosts to ρ 1 = 5 (a 67% reduction in the time to isolation) only reduced the probability of a local outbreak from 0.730 to 0.716, whereas the equivalent effort targeted at symptomatic hosts (ρ 2 = 5) reduced the probability to 0.630. As shown in figure 3c,d, however, when the pre-symptomatic and asymptomatic periods were longer, the benefit of directing surveillance resources towards detecting non-symptomatic individuals increased. This was because longer presymptomatic and asymptomatic periods increased the proportion of infections generated by non-symptomatic individuals (K p + K a , see eqns (2.1) and (2.2)); a pre-symptomatic period of 1 day, 2 days and 4 days corresponded to values of K p + K a equal to 0.424, 0.595 and 0.746, respectively. We next considered in more detail the impact of surveillance targeted at non-symptomatic hosts (ρ 1 ) relative to the impact of surveillance targeted at symptomatic hosts (ρ 2 ). For our baseline parameter values, we considered the probability of a local outbreak starting from a single imported non-symptomatic individual for a range of values of ρ 1 and ρ 2 . We calculated the steepest descent contours (white lines in figure 3a) numerically using a gradient maximization approach, in which at each point the contour direction was determined by minimizing the local outbreak probability over a fixed search radius (see electronic supplementary material, text S2 and figure S2 ). These contours indicate how ρ 1 and ρ 2 should be altered to maximize the reduction in the probability of a local outbreak. In this case, enhancing surveillance targeting both symptomatic and nonsymptomatic hosts is always optimal (the steepest descent contours are neither horizontal nor vertical). We then considered a scenario in which, at any time, it is only possible to direct resources towards enhancing surveillance of either non-symptomatic individuals or symptomatic individuals (e.g. antigen testing of non-symptomatic contacts of known infectious individuals, or screening for symptomatic individuals at public events). In figure 3b , the blue region represents values of ρ 1 and ρ 2 for which enhancing surveillance targeting symptomatic hosts (i.e. increasing ρ 2 ) leads to a larger reduction in the local outbreak probability than enhancing surveillance targeting non-symptomatic hosts (i.e. royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20201014 increasing ρ 1 ). In contrast, in the green region, enhancing surveillance of non-symptomatic individuals is more effective than enhancing surveillance of symptomatic individuals. The white line represents the steepest descent contour starting from ρ 1 = ρ 2 = 0, under the constraint that surveillance can be enhanced only for symptomatic or non-symptomatic hosts at any time. Practical deployment of surveillance is often subject to logistical constraints, and policy makers may wish to design surveillance strategies to achieve a specific objective-for example, to maximize the effectiveness of limited resources or to minimize the cost of achieving a desired outcome. We therefore also considered the following two examples of such objectives. 3.2.1. Objective 1: minimize the probability of a local outbreak for a fixed total surveillance effort First, we considered the question: given a fixed maximum surveillance effort (ρ 1 + ρ 2 = C), how should surveillance be targeted at non-symptomatic and symptomatic hosts? This involves setting the values of ρ 1 and ρ 2 to minimize the local outbreak probability. The optimal strategies in this case are shown in figure 3c. The red dotted lines represent contours along which the total surveillance effort ρ 1 + ρ 2 is held constant (i.e. different values of C). On each contour, the red circle indicates the point at which the local outbreak probability is minimized. If surveillance resources are increased (i.e. C increases), a further question is how surveillance should then be increased. In figure 3c , the white line represents the contour of steepest descent, under the constraint that the total change in surveillance effort (ρ 1 + ρ 2 ) is held constant at each step (rather than a constant search radius, as shown in figure 3a-for more details, see electronic supplementary material, text S2 and figure S2 ). This contour coincides exactly with that shown in figure 3b . These results indicate that if surveillance resources are such that C is greater than 2.8 (corresponding to a 59% reduction in the time to isolation of symptomatic hosts), the optimal surveillance strategy involves both enhanced surveillance of symptomatic individuals and non-symptomatic individuals (the red dots correspond to strictly positive values of both ρ 1 and ρ 2 , unless C is less than 2.8). royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20201014 3.2.2. Objective 2: minimize the total surveillance effort to achieve a pre-specified reduction in the probability of a local outbreak Second, we considered the question: given a pre-specified acceptable risk level (i.e. probability of a local outbreak), how should the surveillance level targeted at non-symptomatic and symptomatic hosts be chosen? This involves choosing ρ 1 and ρ 2 to minimize ρ 1 + ρ 2 along a given contour corresponding to a fixed local outbreak probability (red dotted lines in figure 3d ). On each contour, the red circle indicates the point along that contour at which the total surveillance effort ρ 1 + ρ 2 is minimized. These optimal points also lie exactly along the line on which enhancing surveillance targeted at symptomatic hosts is equally effective compared to enhancing surveillance targeted at non-symptomatic hosts. As long as the target local outbreak probability is less than 0.69, optimal surveillance involves enhanced surveillance of non-symptomatic individuals as well as symptomatic individuals. For example, to reduce the local outbreak probability to 0.6, the optimal approach is to deploy resources such that ρ 1 = 12.4 (a 74% reduction in the time to isolation of nonsymptomatic individuals) and ρ 2 = 18.0 (a 76% reduction in the time to isolation of symptomatic individuals). Plots analogous to figure 3d in which the parameters were varied from their baseline values are shown in electronic supplementary material, figures S3-S12. In each case that we considered, our main finding remained unchanged. There always exists a threshold local outbreak probability such that, if the target local outbreak probability is below this threshold, the optimal strategy for further reduction in the local outbreak probability involves enhancing surveillance targeting both non-symptomatic and symptomatic individuals. Values of ρ 1 and ρ 2 for which increasing surveillance for non-symptomatic hosts (i.e. increasing ρ 1 ) is more effective at reducing the local outbreak probability than increasing surveillance for symptomatic hosts (i.e. increasing ρ 2 ) (green region) and vice versa (blue region). The white line represents the steepest descent contour starting from ρ 1 = ρ 2 = 0, under the constraint that surveillance can only be enhanced for either symptomatic or non-symptomatic hosts at any time. The diagonal section of the steepest descent contour is made up of small horizontal and vertical sections. (c) Strategies for minimizing the local outbreak probability for a given fixed total surveillance effort (ρ 1 + ρ 2 = C). Red dotted lines indicate contours on which ρ 1 + ρ 2 is constant, and red circles indicate the points along these contours at which the local outbreak probability is minimized. The white line indicates the optimal surveillance enhancement strategy if the maximum possible surveillance level (i.e. the maximum value of ρ 1 + ρ 2 = C) is increased. (d ) Strategies for minimizing the surveillance effort required to achieve a pre-specified risk level (an 'acceptable' local outbreak probability). Red dotted lines indicate contours of constant local outbreak probability (i.e. lines on which the probability of a local outbreak takes the values shown); red circles indicate the points along these contours at which the total surveillance effort ρ 1 + ρ 2 is minimized. The white line indicates the optimal strategy to follow if the pre-specified risk level is increased or reduced. royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20201014 A key component of infectious disease epidemic management is inferring the risk of outbreaks in different locations [5] [6] [7] [8] 11, 41, 42] . Surveillance and control strategies can be introduced to reduce the risk that imported cases will lead to local outbreaks [12, 13, [43] [44] [45] [46] . However, for a range of pathogens, public health measures are hindered by nonsymptomatic infectious hosts who can transmit the pathogen yet are challenging to detect [16, 42, 44, [47] [48] [49] . Here, we showed how the probability of a local outbreak can be estimated using a branching process model that accounts for transmission from non-symptomatic infected individuals (figure 1). The model can be used to assess the local outbreak probability for different surveillance strategies that target non-symptomatic or symptomatic hosts (figure 2). Previous studies have shown that detection of nonsymptomatic infections can be a key component of epidemic forecasting [42] and containment [44] and have demonstrated the benefits of identifying and isolating infectious nonsymptomatic hosts to reduce transmission [16, 17] . We focused instead on investigating how surveillance should be targeted at non-symptomatic or symptomatic hosts to reduce the probability that cases imported to new locations will trigger a local outbreak ( figure 3a,b) . We also showed how the optimal surveillance level targeting these two groups can be assessed when surveillance resources are limited and policy makers have specific objectives (figure 3c,d). In each case, our main conclusion was that surveillance for non-symptomatic infected hosts (ρ 1 > 0) can be an important component of reducing the local outbreak risk during epidemics. This result has broad implications, and our analysis could be extended to assess the potential for containing outbreaks at their source using a range of specific interventions targeting symptomatic and non-symptomatic hosts. Our goal here was to use the simplest possible model to explore the effects of surveillance of non-symptomatic and symptomatic individuals on the risk of local outbreaks. However, this model is not without its limitations. One area of uncertainty is the precise values of the parameters governing pathogen transmission and control. In this article, we chose a baseline set of parameter values that is consistent with the findings of studies conducted during the COVID-19 pandemic, although constructing a detailed transmission model for this pandemic was not our main focus. For example, we set the relative rates at which pre-symptomatic and asymptomatic individuals generate new infections compared to symptomatic individuals so that 48.9% of transmissions arise from pre-symptomatic infectors and 10.6% arise from asymptomatic infectors [16] . While this is in line with reported estimates [50, 51] , there is substantial variation between studies. Similarly, the proportion of individuals who experience a fully asymptomatic course of infection (denoted by ξ in our model) is subject to a considerable degree of uncertainty. Here, we chose ξ = 0.2 as the baseline value [32] [33] [34] , but estimates in the literature range from 0.04 to over 0.8 [33, [52] [53] [54] . We therefore also conducted sensitivity analyses in which we explored a range of different values of model parameters (electronic supplementary material, text S3 and figures S3-S12). In each case that we considered, our main conclusion was unchanged: surveillance of non-symptomatic individuals can contribute to reducing the risk of local outbreaks. This result is expected to hold for epidemics of any pathogen for which non-symptomatic individuals contribute significantly to transmission. For our modelling approach to be used to make precise quantitative predictions during epidemics, it would be necessary to update the model to include the range of different specific surveillance and control interventions that are in place. For example, detection of non-symptomatic infected individuals is facilitated by contact tracing and antigen testing, which are carried out routinely during epidemics and can be included in models explicitly [12, 44, 55, 56] . Reductions in contacts due to social distancing strategies and school or workplace closures could also be accounted for [57, 58] , although such interventions are often introduced after a local outbreak has begun rather than in the initial phase of a potential local outbreak as considered here. We modelled the level of surveillance targeted at non-symptomatic and symptomatic hosts in a simple way using a function describing the relationship between surveillance effort and effectiveness (figure 1b). We assumed that this general functional relationship could be applied to interventions targeting both symptomatic and non-symptomatic hosts, accounting for logistical differences in the ease of targeting either group by scaling the effectiveness of surveillance for non-symptomatic hosts using the parameter ε (results are shown for different values of ε in electronic supplementary material, figure S8 ). In principle, it would be possible to include entirely different functional forms describing the relationship between surveillance effort and effectiveness for strategies targeting symptomatic and non-symptomatic individuals, and these could be tailored to the effects of particular interventions. If different public health measures are included in the model explicitly, then it would be possible to increase the accuracy of assessments of the relative public health benefits of specific interventions that only target symptomatic individuals (e.g. screening for passengers with heightened temperatures at airports [59, 60] ) compared to interventions that also target non-symptomatic hosts (e.g. travel bans or quarantine of all inbound passengers [61, 62] ). Of course, this would require data from which the relative effectiveness of different measures could be inferred. The underlying transmission model could also be extended to include additional realism in several ways. Transmission dynamics are influenced by marked heterogeneities in the patterns of contacts between individuals in different age groups [63, 64] , and, for SARS-CoV-2, susceptibility to infection, the likelihood of developing symptoms, and the average severity of those symptoms increase with age [65, 66] . Age-dependent variation in the proportion of asymptomatic cases in particular implies that the optimal balance of surveillance between symptomatic and nonsymptomatic hosts may differ between age groups. An age-structured version of the model presented here is a focus of our ongoing research. Similarly, for a range of infectious diseases, the distribution characterizing the number of secondary infections generated by each infected host (the offspring distribution) exhibits a high degree of overdispersion [67] [68] [69] [70] . For a fixed value of R 0 , a higher degree of overdispersion increases the likelihood that initial cases will fade out without leading to a local outbreak [71, 72] and suggests that greater reductions in local outbreak risks could theoretically be achieved for the same surveillance effort if potential superspreaders or superspreading events can be identified and targeted. royalsocietypublishing.org/journal/rsif J. R. Soc. Interface 18: 20201014 Despite the necessary simplifications, we have shown how the risk of local outbreaks can be estimated during epidemics using a branching process model that includes nonsymptomatic infectious hosts explicitly. Determining the extent to which non-symptomatic individuals contribute to transmission is essential early in emerging epidemics of a novel pathogen. As we have shown, if transmissions occur from non-symptomatic infectors, dedicating surveillance resources towards finding non-symptomatic cases can be an important component of public health measures that aim to prevent local outbreaks. Ethics. This article does not present research with ethical considerations. Data accessibility. The computer code used to perform the analyses in this article is available as part of the supplementary material [73]. All computer code was written in the MATLAB programming environment (version R2019a). Infectious disease threats in the twenty-first century: strengthening the global response Emerging infectious diseases: threats to human health and global stability Addressing challenges for clinical research responses to emerging epidemics and pandemics: a scoping review Detection, forecasting and control of infectious disease epidemics: modelling outbreaks in humans, animals and plants Ebola virus disease outbreak in Nigeria: transmission dynamics and rapid control 2020 Risk assessment of novel coronavirus COVID-19 outbreaks outside Estimating COVID-19 outbreak risk through air travel Branching process approach for epidemics in dynamic partnership network Assortativity and the probability of epidemic extinction: a case study of pandemic influenza A Sustained transmission of Ebola in new locations: more likely than previously thought Increased frequency of travel in the presence of cross-immunity may act to decrease the chance of a global pandemic Feasibility of controlling COVID-19 outbreaks by isolation of cases and contacts Novel coronavirus outbreak in Wuhan, China, 2020: intense surveillance is vital for preventing sustained transmission in new locations 2020 The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application Incubation period of COVID-19: a rapid systematic review and meta-analysis of observational research Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing 2020 The implications of silent transmission for the control of COVID-19 outbreaks Evidence for transmission of COVID-19 prior to symptom onset Potential presymptomatic transmission of SARS-CoV-2 Presymptomatic transmission of SARS-CoV-2-Singapore Presumed asymptomatic carrier transmission of COVID-19 2020 Evidence supporting transmission of severe acute respiratory syndrome coronavirus 2 while presymptomatic or asymptomatic On the spread of a disease with gamma distributed latent and infectious periods The relevance of stochastic models for large-scale epidemiological phenomena Estimating the probability of a major outbreak from the timing of early cases: an indeterminate problem? PLoS ONE 8, e57878 Stochasticity and heterogeneity in host-vector models Neighbourhood control policies and the spread of infectious diseases Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and coronavirus disease-2019 (COVID-19): the epidemic and the challenges. royalsocietypublishing.org/journal/rsif 2020 The reproductive number of COVID-19 is higher compared to SARS coronavirus 2020 The epidemiology, diagnosis and treatment of COVID-19 Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: a datadriven analysis in the early phase of the outbreak 2020 Occurrence and transmission potential of asymptomatic and presymptomatic SARS-CoV-2 infections: a living systematic review and metaanalysis Estimating the extend of asymptomatic COVID-19 and its potential for community transmission: systematic review and meta-analysis Proportion of asymptomatic coronavirus disease 2019: a systematic review and meta-analysis Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia Presymptomatic SARS-CoV-2 infections and transmission in a skilled nursing facility Predicting infectious severe acute respiratory syndrome coronavirus 2 from diagnostic samples Virological assessment of hospitalized patients with COVID-2019 The threshold concept in stochastic epidemic and endemic models Stochastic epidemic models: a survey 2020 Will an outbreak exceed available resources for control? Estimating the risk from invading pathogens using practical definitions of a severe epidemic Detecting presymptomatic infection is necessary to forecast major epidemics in the earliest stages of infectious disease outbreaks Relationship between clinical signs and transmission of an infectious disease and the implications for control Factors that make an infectious disease outbreak controllable Global infectious disease surveillance Internet-based surveillance systems for monitoring emerging infectious diseases Pandemic (H1N1) 2009 transmission during presymptomatic phase Viral shedding and transmission potential of asymptomatic and paucisymptomatic influenza virus infections in the community Contributions from the silent majority dominate dengue virus transmission Temporal dynamics in viral shedding and transmissibility of COVID-19 Evidence for pre-symptomatic transmission of coronavirus disease 2019 (COVID-19) in China Proportion of asymptomatic infection among COVID-19 positive persons and their transmission potential: a systematic review and meta-analysis Towards an accurate and systematic characterisation of persistently asymptomatic infection with SARS-CoV-2 2020 Prevalence of asymptomatic SARS-CoV-2 infection: a narrative review The effectiveness of contact tracing in emerging epidemics A model of the 2014 Ebola epidemic in West Africa with contact tracing Effects of non-pharmaceutical interventions on COVID-19 cases, deaths, and demand for hospital services in the UK: a modelling study Epidemiological models are important tools for guiding COVID-19 interventions Fever screening during the influenza (H1N1-2009) pandemic at Narita International Airport Effectiveness of airport screening at detecting travellers infected with novel coronavirus (2019-nCoV) The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak 2020 Strategies at points of entry to reduce importation risk of COVID-19 cases and reopen travel Projecting social contact matrices in 152 countries using contact surveys and demographic data Changes in contact patterns shape the dynamics of the COVID-19 outbreak in China Age-related morbidity and mortality among patients with COVID-19 Age-dependent effects in the transmission and control of COVID-19 epidemics Estimating the overdispersion in COVID-19 transmission using outbreak sizes outside China 2020 Characterizing superspreading events and age-specific infectiousness of SARS-CoV-2 transmission in Georgia Heterogeneities in the transmission of infectious agents: implications for the design of control programs Key questions for modelling COVID-19 exit strategies Superspreading and the effect of individual variation on disease emergence 2021 Interventions targeting nonsymptomatic cases can be important to prevent local outbreaks: SARS-CoV-2 as a case-study. Figshare. royalsocietypublishing.org/journal/rsif Acknowledgements. Thank you to Andrew Wood and Charles Tolkien-Gillett for proofreading.