key: cord-0873569-huvl2cim authors: Teimouri, A. title: An SEIR Model with Contact Tracing and Age-Structured Social Mixing for COVID-19 outbreak date: 2020-07-07 journal: nan DOI: 10.1101/2020.07.05.20146647 sha: f41c324711838fa2048f13701993f974ccf8c654 doc_id: 873569 cord_uid: huvl2cim In December 2019 a severe acute respiratory syndrome now known as SARS-CoV-2 began to surge in Wuhan, China. The virus soon spread throughout the world to become a pandemic. Since the outbreak various measures were put in place to contain and control the spread, these interventions were mostly based on compartmental models in epidemiology with the main goal of controlling and monitoring the rate of the basic and effective reproduction number. In this paper, we propose an SEIR model where we incorporate contact tracing and age-structured social mixing. We show the explicit relation between contact tracing and social mixing and other relevant parameters of the proposed model. We derive a formula for the effective reproduction number which is expressed in terms of reported cases, tracing quantities and social mixing. We use this formula to determine the expectation value of the effective reproduction number in London, UK. COVID-19 is a fast-spreading infectious disease which is caused by the novel coronavirus SARS-COV-2 [1, 2] . The rapid rate of the spread and lack of preparation soon translated to a pandemic causing major disruption globally [3, 4] . From the early days of the spread, many well-established models were proposed to contain and control the outbreak. These models were mainly the extensions of the simple SIR model [5] - [11] where the population is being divided into susceptible, infected and recovered compartments. Various parameters can be incorporated into the models to study and analyse the effect of intervention methods and also to predict the spread of the infectious disease. In this paper, we propose an SEIR model integrated with contact tracing and age-structured social mixing. Contact tracing is an important intervention method when it comes to controlling an infectious outbreak [12] . As the name suggests it involves tracing the contacts of a reported infectious individual and remove them into isolation if needed, so further spread would be prevented. There are numerous methods of contact tracing and this intervention strategy proved to be effective in controlling the COVID-19 outbreak in countries such as China, South Korea, Taiwan and Japan [14] - [16] . In this study, we adopt contact tracing appropriate to the characteristics of the COVID-19. It is now established that infected individuals with the COVID-19 are distinguished into two groups of asymptomatic and symptomatic [17] . The asymptomatic individuals are infectious individuals who show no symptoms while symptomatic individuals are those who show symptoms, and hence are reported and might need hospitalisation. To this end, our model traces the contact of those individuals who are symptomatic and reported. The rationale of the model is that if the the traced contacts show symptoms they will be removed into isolation, and if they do not show symptoms upon tracing they will be monitored by healthcare workers or via a mobile application for the recommended time of two weeks by the WHO [12] . Another feature that we integrate into our model is age-structured social mixing matrix. This component of the model is important when it comes to effectively classifying the risk within different age groups in a society. One of the known characteristics of the COVID-19 is that it affects mid-age and elderly people more severely than those who are in the young age group [18] . As a result, to implement interventions such as social-distancing within households or at workplaces, it is important to know the mixing between different age groups. In this work, we show the theoretical scaling effect of the social mixing matrix on the value of the effective reproduction number. We begin by giving an overview of the SEIR model in section 2.1, we then incorporate contact tracing and social mixing respectively into our model in sections 2.2 and 2.3. We then analyse our results in section 3 and provide a deterministic formula for keeping the effective reproduction number below one. We calculate the change in the effective reproduction number when contact tracing and social mixing are involved. In section 3.1 we provide an example where we determine the value of the expected effective reproduction number for London, United Kingdom (UK). We begin by considering a simple and adequate SEIR compartmental model which captures the essence of the spread of the COVID-19 through a homogenous population. In this model, the population N is the the sum of Susceptible (S), Exposed (E), Symptomatic (I s ), Asymptomatic (I a ) and Recovered (R) individuals (i.e. N = S + E + I s + I a + R). By definition, exposed individuals are those incubating the disease and will be infected by the symptomatic and asymptomatic individuals, symptomatic individuals are those who are infected, show symptoms and will be reported or hospitalised, asymptomatic infected individuals are those who have mild or no symptoms and hence will not be reported and do not require hospitalisation. Both infected groups will eventually move into the recovered compartment, which includes recovered and dead cases. The system of the ordinary differential equations (ODE) which describe the dynamics of this model is, The ODEs are parameterised by β which is the transmission rate for infectious individuals, μ that is the multiplicative factor reducing the transmission rate of the asymptomatic infected patients, τ , the mean incubation period in days, α, the fraction of the reported/hospitalised cases, T s , the time from symptom onset until hospitalisation/isolation or reporting in days and T a , the mean infectious period in days for the asymptomatic individuals. Note that T s is the same for the cases which are hospitalised, go to isolation or just being reported. The distinction of infected individuals into two groups is essential in this work as in reality only the symptomatic cased are being recorded and hence can trigger contact tracing [31] . Note that there are studies that take T s and T a as equal [32] . The basic reproduction number R 0 is defined as average number of secondary infections produced by an infected individual in a population where everyone is susceptible, this number is used to measure the transmission potential of a disease. In a realistic world a population can rarely be wholly susceptible. Some individuals can be immune, for instance due to prior infection. As a result, not all contacts will become infected and the average number of secondary cases per an infectious case will be lower than the basic reproduction number. To this end, the effective reproduction number R e is defined as the average number of secondary cases in a population made up of both susceptible and non-susceptible individuals. The effective reproduction number is proportional to the product of the basic reproduction number and the fraction of the host population that is susceptible. Using the next generation matrix method [19] the effective reproduction number, R e , is calculated by where the matrix T corresponds to transmissions and the matrix Σ corresponds to transitions. Note that all the epidemiological events that lead to new infections are incorporated in the model via T and the rest of the events via Σ, hence progress to either dead or recovery makes Σ invertible. We obtain K L as, the dominant eigenvalue of this matrix is equal to the effective reproduction number, where R 0 = βT a is defined as the basic reproduction number without cases being reported or hospitalised. We assumed that the population size N is constant over the short time of the outbreak and hence we absorb it into parameter β (i.e. we do not consider birth and death rates). Furthermore, we take the total population N = 1 so the results would be per-capita. We note that for T s = T a and μ = 1 the relation in the above equation would reduce to R e = R 0 where the assumption is that the basic reproduction number is only proportional to transmission rate and the mean infectious period. Namely this ratio would be the expected number of new infections from a single infection in a population where all individuals are susceptible. To incorporate contact tracing into our model we follow the steps of [5] . Contact tracing in this manner is based on three elements, contact identification, contact listing and follow-up. This process begins once an infected (dead or alive) case is being reported, at this stage one lists the contacts of the reported person, then a team of contact tracers track the listed contacts (this could also be done via a mobile application). If the traced contacts show symptoms at the time of tracking they will be removed by isolation or quarantine. Otherwise, they will be instructed to update their status if they develop any symptoms over 14 days. The follow-up team would then look after the traced contacts during this period and will remove them if symptoms appear. There are two assumptions to proceed with this model: • Only the reported or hospitalised cases can trigger contact tracing. • Contact of a reported patient are traced with probability φ. There is also a fixed delay δ between index case reporting and contact tracking. If the traced individuals are symptomatic at the time of contact, they will be removed immediately, otherwise they follow up the monitoring protocol. Untraced contacts have probability (1 − φ), these are the cases that are missed and did not trigger tracing from the reported cases. We shall now define the cumulative number, at time t, of traced infected individuals who are in exposed or infectious group when first tracked, where: i is the rate of reporting of infectious individuals, ii is the reported case infectious period probability density, iii is the integral over infectious period of length z, iv is the probability still incubating t − r + δ units after infection and v is the traced transmissions infected at r ∈ [t − z, t]. We assumed that the incubation period cumulative distribution F E is exponentially distributed. Since the average infectious period is short the the change in population size is negligible and one assumes that S is approximately constant over this period. Hence, the total number of the traced transmissions at time t + δ is, (2.7) We now define four additional compartments, E e c are the exposed individuals who will be traced and incubating when they were tracked, E i c are the exposed individuals who will be traced and infectious when tracked. I e c are the monitored (and under monitoring) infectious individuals who have shown no symptoms (while incubating) at the time of first contact, and I i c are the infectious individuals who should be removed as they were found symptomatic at the time of first contact. The probability p e of a traced contact being incubating when first tracked should depend on the mean infectious period T s , the delay between index case reporting and tracing δ, and the incubation period of the traced case τ , as a result: for t → τ . Note that we assumed the incubation period is exponentially distributed i.e. F E (x) = e −x/τ , and that S(t) is constant over the infectious period. The average infectious period, prior to being tracked, for the fraction (1 − p e ) individuals is defined by T i and can be approximated by the expected value of the time after symptom onset when the contact is being tracked,T i , was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint (which this version posted July 7, 2020. . We assume monitored infectious individuals, I e c have transmissibility β m and mean infection period T m in days. The value of β m and T m directly depends on the efficiency of the contact tracing protocol. Note that for the infectious individuals, I i c , traced during their infectious period, we assign transmissibility β prior to tracking and mean infectious period T i . Contact tracing which is triggered by individuals in I e c and I i c compartments counts as secondary tracing, tertiary tracing and so on and thus we call them higher order tracing. In contrast, the first order tracing are the contacts of the fraction φ who will be traced. We shall then define probabilities p e e and p i e as probability of traced contact, infected by I e c individuals, and probability of traced contact, infected by I i c individuals, as where we remind that T m is the mean infectious period for the monitored infectious individuals, I e c , (time between symptom onset and isolation) and T i is the mean infectious period (prior to being tracked) for the contact traced infectious individuals, I i c . The modified SEIR model with contact tracing is, In the same manner as in the previous section the effective reproduction number can be obtained by, 13) where the dominant eigenvalue of this matrix is, as before R 0 = βT a is the basic reproduction number independent of T s . α is the fraction of untraced cases which are reported and φ is the fraction of reported cases which are traced. The main difference of the above equation and Eq.(2.4) is the probability φ which measures the likelihood of contacts of a reported case being traced. In the limit φ → 0 we recover Eq. (2.4) where there is no contact tracing. To take the social mixing into account we rewrite the transmission coefficient as the product of a transmission β times a constant matrix C whose element C λσ measures the average number of physical contacts, on daily basis, among an individual in group age λ with an individual in group age σ. It should be noted that the probability of a susceptible individual, in age class λ, to come to contact with an infected individual in class σ is equivalent to the product of the contact rate C λσ times the probability I σ that individual in class σ is infected. Hence we can denote age groups appropriately via λ and σ and our model modifies to, We note that C λσ denotes the interaction between different age groups and hence it does not matter what groups of infected individuals it is coupled to. It should also be noted that the age structured mixing of individuals in the age group λ change their likelihood of being exposed to the virus given certain number of infectious people in the population. Furthermore, in our model we made a dependency between α and λ (i.e. α → α λ ) due to the assumption that younger individuals are more likely to be asymptomatic and less infectious [9] . The effective reproduction number is now calculated as, (2. 16) The dominant eigenvalue of this matrix is of the form, where n 1 is the number of age groups one wishes to consider. It can be seen from the formula above that the social mixing matrix C λσ has a scaling effect on increasing or decreasing the effective reproduction number. The impact of contact tracing on the effective reproduction number: In order to keep the spread of the virus under control we need to keep the effective reproduction number below one. If we assume the same mean infectious period for both symptomatic and asymptomatic case (i.e. T s ≈ T a ) and also if we take the multiplicative factor μ = 1, we get, where α λ φ is the critical proportion of the total cases which need to be traced in different age groups in order for R e < 1. Furthermore, the impact of contact tracing can be quantified by considering the relative change in the effective reproduction number. From the simple SEIR model let us recall the effective reproduction number and call it, where we added the social mixing term appropriately. Then the relative change in R e with contact tracing would be of the form, For T s ≈ T a and μ = 1 the relative change takes the critical form of α λ φ which is independent of the social mixing matrix C λσ and is only proportional to the probability that contacts of a reported case being traced, φ, and fraction of untraced cases that are reported in a particular age group, α λ . In literature [3] , μ has a unconstrained distribution of μ ∼ uniform(0.2, 0.55). α λ has a unconstrained distribution of α λ ∼ uniform(0.1, 0.99). In Fig. 1 we plot the relative change for upper and lower values of μ while keeping T s ≈ T a . Higher value of μ means higher transmission rate for asymptomatic individuals, as a result the relative change in the effective reproduction number with tracing is proportional to ∼ α λ φ for upper bound of μ while proportional to ∼ φ for lower bound of μ. Imperfect tracing: So far we assumed perfect tracing, that is to consider that traced individuals do not cause secondary infections due to an effective monitoring and tracking policies. In other words, we assumed transmission rate for monitored individuals is zero (i.e. β m = 0). Let us now relax this condition and assume that β m > 0, then the full dominant eigenvalue of (2.16) is, To capture the effect of the tracing let us assume p e e = 1 that is to consider that higher order traced contact is 100% infected by I e c individual, let us also assume that p i e = 1 meaning that higher order traced contact is 100% infected by I i c individual. In that case, the effective reproduction number reduces to, We remind that I e c are the monitored infected individuals who have shown no symptoms upon contacting them and I i c are the symptomatic infectious individuals who were traced and removed into isolation. It is now trivial that the efficiency of contact tracing has a linear contribution in the value of the effective reproduction number given by φβ m T m . The formula above shows that in order to have an effective monitoring policy in place the mean infectious period for monitored infectious individuals should be minimised. In other words, the time between symptom onset and isolation, T m should be at its minimum. The impact of age-structured social mixing on the effective reproduction number: To impose an effective distancing policy one needs to study the social mixing matrices that illustrate the average number of contacts between different age groups. Measuring the social mixing varies for different locations and could depend on the structure of households, schools and workplaces [9, 20] . In this paper, we are using the social mixing matrices constructed by [20] . The study by [20] conducted in the UK by surveying a sample of adults and recording their contact patterns after the British government put social distancing measures in place. In Fig. 2 we reconstructed the social mixing matrix from CoMIX study [20] , there are six age groups participated in their study, namely 18-29, 30-39, 40-49, 50-59, 60-69, and 70+. As a result 6 σ=1 C λσ . The CoMIX matrices show the average number of contacts between different age groups in different setups from the 24th of March 2020 one day after the lockdown was imposed in the UK. To control the spread of the virus one should monitor the number of contacts between each age group and minimise them when necessary. In the next section we use the mixing matrices in Fig. 2 to illustrate the effect of social mixing on the expected value of R e . Let us recall Eq. (3.4) and rewrite it as: where we define k = φR = φβT s , in this definition R is the basic reproduction number in the absence of contact tracing, note that this is different than R 0 which was defined earlier. 8 All rights reserved. No reuse allowed without permission. was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint (which this version posted July 7, 2020. . Figure 2 : Reconstruction of CoMIX matrices as given in [20] , each matrix represents a social setup and show the average number of contacts between different age groups after the lockdown was imposed in the UK on the 23rd of March 2020. be the fraction of reported cases which are traced contacts. In fact, q can be interpreted as the conditional probability that a transmission is traced given a cases is being reported. Finally, k m = φR m = φβ m T m where R m is the reproduction number of the contact traced / monitored individuals. Since k and k m are the infected traced contacts identified per untraced and traced reported cases respectively, it is sensible to define C j and T j as the cumulative reported cases and traced-reported cases in week j respectively. We now define the average number of traced infected individuals identified in week j + i per untraced and traced reported case from the preceding week j as, where 0 θ 1 is tracing/monitoring effectiveness, in other words, θ = R m /R ≡ k j,i m /k j,i . Furthermore, the fraction of reported cases that are traced in the week j + i is (3.8) 9 All rights reserved. No reuse allowed without permission. was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint (which this version posted July 7, 2020. . https://doi.org/10.1101/2020.07.05.20146647 doi: medRxiv preprint We can now write the expectation value of the effective reproduction number, given in week j, as Above estimates R (j) e as the ratio of the sum of untraced and fraction of traced reported cases (θ), weighted by probabilities p i . The probabilities p i are calculated using the serial interval of the COVID-19 that is estimated to be a normal distribution with mean of μ = 3.96 and standard deviation of σ = 4.75, [26] , therefore on weekly basis we have: p 1 = P(N < 7), p 2 = P(7 < N < 14) and so on. In perfect tracing (i.e. θ = 0) only weekly untraced reported cases (C i − T i ) play role in the calculation of R e while for imperfect tracing (i.e. θ = 1) all the traced and untraced reported cases are weighted equally. As a result, one does not require specific data about the traced reported cases per week. Utilising θ in this manner could serve as a useful tool for sensitivity analysis and in the countries where obtaining the contact tracing data is difficult. We are now equipped to calculate the expected value of the effective reproduction number for London. We begin by using the social mixing matrices given in Fig. 2 , where there are six age groups in participants (n = 6) reporting their contact pattern with eight different age groups. We note that each matrix element is the average number of contacts between different age groups. As a result, the normalised sum of each matrix's row gives the value of C for the particular age class and particular setup. Using the cumulative reported cases from [ 27] for London and Eq. (3.9) we estimated the expected value of R e -for nine weeks beginning from the 22nd of February 2020 -for different age groups in different setups in Fig. 3 . Our results show that before the lockdown the mean of the effective reproduction number is 2.55 and after the lockdown is imposed the mean of the effective reproduction reduces to 0.77 over the weeks four to nine for all age groups and setups, this is comparable to results of [20] and [25] (when averaged on weekly basis). In estimation of E(R (j) e ) we assumed that the average number of contacts between different age groups in London is roughly the same as the whole UK. We acknowledge that Londoners form only 16.5% of the participants in CoMIX survey yet for the simplicity we assume the contact behaviour of Londoners is the same with the rest of the country. We also assumed perfect tracing in this study, the estimation of θ is the subject of upcoming work. In this study we proposed an SEIR model with contact tracing and social mixing. We have shown that the effective reproduction number is proportional to dominant eigenvalue of the contact matrix, a result which is also derived in [21] using another method. In Eq.(2.17) we have derived contact tracing via parameters φ (probability of contacts of a reported being traced) and α (fraction of untraced cases who are reported) and showed that the critical value of the required tracing to keep the effective reproduction number below one is proportional to αφ. We also derived the effective reproduction number when there is higher order tracing in Eq. ( 3.4), in which case there is a linear contribution to the effective reproduction number given by φβ m T m where β m is the transmissibility of those infected individuals who are monitored and T m is the time between symptom onset and isolation for the monitored infectious individuals. In the case of the COVID-19 we can assume that β m ≈ β, yet to have an effective contact tracing and lower values of R e it is important to keep T m at its minimum -this of course requires an effective human capital to isolate the monitored cases immediately upon developing symptoms. We furthermore gave a formula in Eq. (3.9) to estimate the expected value of the effective reproduction number E(R e ) on a given week. This formula estimates the average reproduction number by using the serial interval of the COVID-19 and the fraction of traced and untraced reported cases. We used this formula to estimate the E(R e ) in London, UK over nine weeks beginning from the 22nd of February 2020. We used the proportionality of the effective reproduction number to social mixing matrices to calculate the E(R e ) for six setups and six age groups. While our method in estimating the E(R e ) is accurate and matches with the values of other studies (such as [25] for London) it is important to improve this calculation by using the formula presented in Eq. (3.4) . This equation depends on the infectivity parameters, such as β, T s and T a , and they can be estimated by methods such as Markov chain Monte Carlo (MCMC) (see for instance [13] ). This task is the subject of our upcoming work. From our model and analysis it is evident that contact tracing and social mixing are both playing important roles in controlling the value of the effective reproduction number (see also [22, 23, 24, 28, 29, 30] ). While there is an increasing demand to open up the society since the value of R e is below the critical value of one it is important for policymakers to note that the value of R e can rebound to higher values should the easing of the restrictions cause higher number of contacts between individuals. In fact, in opening up the society one must suppress the opportunities for transmission. From our model this can be done by firstly, putting an effective contact tracing measure in place, that is to trace, monitor and isolate the contacts of reported cases without time-wasting and secondly, monitoring the number of contacts between different age groups and the fraction of the reported cases classified by their age α λ to prioritise contacts tracing by age groups when there is a lack of health workers. Early transmission dynamics in Wuhan, China, of novel coronavirus?infected pneumonia A novel coronavirus from patients with pneumonia in China Explaining the Bomb-Like Dynamics of COVID-19 with Modeling and the Implications for Policy An interactive web-based dashboard to track COVID-19 in real time. The Lancet infectious diseases Modeling contact tracing in outbreaks with application to Ebola Time, space and social interactions: exit mechanisms for the COVID-19 epidemics A conceptual model for the outbreak of Coronavirus disease 2019 (COVID-19) in Wuhan, China with individual reaction and governmental action Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2) The effect of control strategies to reduce social mixing on outcomes of the COVID-19 epidemic in Wuhan, China: a modelling study Equivalence of the Erlang SEIR epidemic model and the renewal equation Effects of the infectious period distribution on predicted transitions in childhood disease dynamics World Health Organization, 2020. Contact tracing in the context of COVID-19: interim guidance Statistical inference in a stochastic epidemic SEIR model with control intervention: Ebola as a case study Response to COVID-19 in Taiwan: big data analytics, new technology, and proactive testing Lessons drawn from China and South Korea for managing COVID-19 epidemic: insights from a comparative modeling study Are high-performing health systems resilient against the COVID-19 epidemic Estimation of the asymptomatic ratio of novel coronavirus infections (COVID-19) Clinical features of COVID-19 in elderly patients: A comparison with young and middle-aged patients The construction of nextgeneration matrices for compartmental epidemic models Quantifying the impact of physical distance measures on the transmission of COVID-19 in the UK. BMC medicine Using data on social contacts to estimate age-specific transmission parameters for respiratory-spread infectious agents Effectiveness of isolation, testing, contact tracing and physical distancing on reducing transmission of SARS-CoV-2 in different settings 19 working group, Age-dependent Effects in the Transmission and Control of COVID-19 Epidemics Modelling transmission and control of the COVID-19 pandemic in Australia COVID-19: nowcast and forecast Serial interval of COVID-19 among publicly reported confirmed cases Coronavirus (COVID-19) Cases, Greater London Authority Feasibility of controlling COVID-19 outbreaks by isolation of cases and contacts. The Lancet Global Health Effectiveness of Isolation, Testing, Contact Tracing, and Physical Distancing on Reducing Transmission of SARS-CoV-2 in Different Settings: A Mathematical Modelling Study. The Lancet. Infectious diseases Case isolation, contact tracing, and physical distancing are pillars of COVID-19 pandemic control, not optional choices. The Lancet Infectious Diseases Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application The author would like to thank Mikhail Goykhman for useful feedbacks and discussion. All rights reserved. No reuse allowed without permission.