key: cord-0987167-ma1t4uyz
authors: Cont, R.; Xu, R.; Kotlicki, A.
title: Modelling COVID-19 contagion:Risk assessment and targeted mitigation policies
date: 2020-09-01
journal: nan
DOI: 10.1101/2020.08.26.20182477
sha: c2efa6bff3f2df03994f7ab7bbbd370e54c2d356
doc_id: 987167
cord_uid: ma1t4uyz

We use a spatial epidemic model with demographic and geographic heterogeneity to study the regional dynamics of COVID-19 across 133 regions in England. Our model emphasises the role of variability of regional outcomes and heterogeneity across age groups and geographic locations, and provides a framework for assessing the impact of policies targeted towards sub-populations or regions. We define a concept of efficiency for comparative analysis of epidemic control policies and show targeted mitigation policies based on local monitoring to be more efficient than country-level or non-targeted measures. In particular, our results emphasise the importance of shielding vulnerable subpopulations and show that targeted policies based on local monitoring can considerably lower fatality forecasts and, in many cases, prevent the emergence of second waves which may occur under centralised policies.

The novel coronavirus pandemic of 2019-2020 has led to disruption on a global scale, leading to more than 800,000 deaths worldwide at the time of writing, and prompted the implementation of government policies involving a variety of 'non-pharmaceutical interventions' [20] including school closures, workplace restrictions, restrictions on social gatherings, social distancing and, in some cases, general lockdowns for extended periods. This has led to a range of different public health policies across the world, and the efficiency of specific policy choices has been subject to much debate.

While the nature of these restrictions has been justified by the severe threat to public health posed by the virus, their design and implementation necessarily involves a trade-off, often implicit in the decision-making process, between health outcomes and the socioeconomic impact of such social restrictions.

An important feature of the COVID-19 pandemic has been the heterogeneity of epidemic dynamics and the resulting mortality across different regions, age classes and population categories. The importance of these heterogeneities suggests that homogeneous models -often invoked in discussions on reproduction number and herd immunity-may provide misleading insights, and points to the need for more granular modeling to take into account geographic, demographic and social factors which may influence epidemic dynamics.

We propose a flexible modelling framework which can serve as a decision aid to policy makers and public health experts by quantifying this trade-off between health outcomes and social cost. Using a structured population model for epidemic dynamics which accounts for geographic and demographic heterogeneity, we formulate this trade-off as a control problem for a partially observed distributed system and provide a quantitative framework for comparative analysis of various mitigation policies. We illustrate the usefulness of the framework by applying it to the study of COVID-19 dynamics across regions in England and showing how it may be used to reconstruct the latent progression of the epidemic and perform a comparative analysis of various mitigation policies through scenario projections.

Several recent studies have used homogeneous compartmental models [3, 38, 17, 29, 44, 50, 47, 51] or age-stratified versions of such models [1, 15, 16, 36, 46, 54] to analyse the dynamics and impact of the COVID-19 epidemic in various countries. Our framework, while compatible with such homogeneous models at aggregate level, accounts for demographic and spatial heterogeneity in a more detailed manner, leading to regional outcomes which may substantially deviate from such models, as discussed in Section 5.2. Similar, though somewhat less detailed, heterogeneous models have been recently used to study COVID-19 outbreaks by Birge et al. [10] for New York City and Roques et al. [49] for France.

We first present below an overview of the main features of our approach and the key findings, before going into more detail on the methodology and results.

epidemic and apply this model to the study of COVID-19 dynamics across regions in England.

The model takes into account

• Epidemiological features estimated by previous studies on COVID-19;

• The lack of direct observability of the total number of infectious cases and the presence of a non-negligible fraction of asymptomatic cases;

• The demographic structure of UK regions (age distribution, density);

• Social contact rates across age groups derived from survey data;

• Data on inter-regional mobility; and

• The presence of other random factors, not determined by the above.

We first demonstrate that this model is capable of accurately reproducing the early regional dynamics of the disease, both pre-lockdown and a month into lockdown, using a detailed calibration procedure that accounts for demographic heterogeneity across regions, low testing rates, and existence of asymptomatic carriers. The calibration reveals interesting regional patterns in social contact rates before and during lockdown. Underlying any public health policy is a trade-off between a health outcome -which may relate to mortality or hospitalisations-and the socio-economic impact of measures taken to mitigate the magnitude of the impact on public health. We present an explicit formulation of this trade-off and use it to perform a comparative analysis of various 'social distancing' policies, based on two criteria:

• the benefit, in terms of reduction in projected mortality; and

• the cost, in terms of restrictions on social contacts.

The goal of our analysis is to make explicit the policy outcomes for decision-makers, without resorting to (questionable) concepts such as the 'economic value of human life' used in some actuarial and economic models [1, 44, 51] .

In our comparative analysis, we consider a broad range of policies and pay particular attention to population-wide versus targeted mitigation policies, feedback control based on the number of observed cases and the benefits of broader testing. We introduce a concept of efficient policy, and show how this concept allows to identify decision parameters which lead to the most efficient outcomes for each type of mitigation policy. The granular nature of our model, together with validation based on epidemiological data, provide a more detailed picture of the relative merits of various public health policies.

• Using a baseline epidemic model consistent with epidemiological data and observations on fatalities and cases reported in England up to June 2020, we estimate the current number of infected individuals in England on August 1, 2020 to be more than 1 million, including 233,000 (or 22%) symptomatic and 858,000 (or 78%) asymptomatic carriers. We also estimate more than 17.8 million persons in England (31.7% of the population) to been exposed to COVID-19 and recovered by August 1, 2020. These estimates are much higher than numbers discussed in media reports, often based on the number of reported cases, suggesting that England is closer to 'herd immunity' than previously thought.

• Based on a comparison of fatality counts and reported cases, we infer that less than 5% of cases in England had been detected prior to June 2020. This low reporting probability has important consequences: in particular, it implies that one may observe a streak of many consecutive days without any new reported cases whilst the epidemic is in fact silently progressing.

• We observe significant differences in epidemic dynamics across regions in England, with higher rates of contagion in northern regions compared to southern regions, both before and during the lockdown period.

• We estimate that, in absence of social distancing and confinement measures, the number of fatalities in England may have exceeded 216,000 by August 1, 2020, indicating that the lockdown has saved more than 174,000 lives.

• Comparison with a homogeneous model with compartments defined at national level reveals large differences in regional forecasts of case numbers and fatalities, pointing to the importance of demographic and geographic heterogeneity for modeling the impact of COVID-19.

Once the model has been calibrated to replicate the regional progression of COVID-19 in England for the period March 1-May 31, 2020, we use it for scenario projections under various mitigation policies. Comparative analysis of mitigation policies reveals that measures targeted towards subpopulations -vulnerable age groups or regions with outbreaksare more efficient than measures applied at the population as a whole. More specifically:

• Shielding of elderly populations is by far the most effective measure for reducing the number of fatalities.

• By contrast, school closures and workplace restrictions are seen to be less effective than social distancing measures outside of school and work environments.

• Adaptive policies ('feedback control') which trigger measures when the number of daily observed cases exceed a threshold, are shown to be more effective than preplanned policies, leading to a substantial improvement in health outcomes.

• Decentralised policies based on regional monitoring of daily reported cases are found to more efficient than centralised policies based on national indicators, resulting on average in an overall reduction of 20,000 in fatalities and, in many cases, the prevention of a 'second wave' which may occur under centralised policies.

• Comparative analysis of policies (Table 8) shows a wide range of health outcomes, ranging from 53,000 to 165,000 fatalities. The most effective policy in terms of reducing fatalities involves triggering of regional confinement measures decentralised based on monitoring of new cases, together with shielding of elderly populations.

The modeling framework is described in Section 2. Data sources and parameter estimations are detailed in Section 3. Section 4 highlights the implications of partial observability of state variables and the associated model uncertainty. Section 5 discusses the counterfactual scenario of no intervention, which serves as a benchmark to evaluate the impact of social distancing policies. The outcomes of various epidemic control policies are then discussed in Sections 6 and 7. Pre-planned policies are discussed in Sections 6.1 and 6.2, while Section 7 discusses adaptive ('feedback') control policies, in which measures are triggered when the daily number of new reported cases exceeds a threshold.

We now describe the framework used to model the dynamics of the epidemic. To take into account the role of geographic and demographic heterogeneity, we use a stochastic compartmental (SEIAR) model with age stratification, mobility across sites, social contact across age stratification, and the impact of asymptomatic infected individuals. For general concepts on deterministic and stochastic compartmental models we refer to Anderson and May [5] , Andersson and Britton [6] , Brauer and Castillo-Chavez [11] , Britton et al. [12] , Grassly and Fraser [23] , Lloyd and Jansen [37] .

We consider a regional meta-population model with K regions labeled r = 1, . . . , K. Each region r has a population N (r) which is further subdivided into 4 age classes, labeled a ∈ {1, 2, 3, 4} representing respectively children (below 20 years), adults (19-60) and two senior groups (60-70 years and above 70 years). We denote N (r, a) the population in region r in age category a, with N (r, 1) + N (r, 2) + N (r, 3) + N (r, 4) = N (r). Individuals in each region and age group are categorized into six compartments:

• Susceptible (S) individuals who have not yet been exposed to the virus;

• Exposed (E) individuals who have contracted the virus but are not yet infectious. Exposed individuals may then become infectious after a certain incubation period;

• Infectious (I) individuals who manifest symptoms;

6 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08. 26.20182477 doi: medRxiv preprint • Asymptomatic (A) infectious individuals;

• Recovered (R) individuals. In line with current experimental and clinical observations on COVID-19, we shall assume that individuals who have recovered have temporary immunity, at least for the horizon of the scenarios considered, and cannot be reinfected [8] ; and

• Deceased (D) individuals.

The progression of the disease in the population is monitored by keeping track of the respective number

of individuals in each compartment. As the model focuses on the dynamics of the epidemic over a short period (1000 days), we neglect demographic changes over this period and assume that the population size N (r, a) in each location and age group is approximately constant, that is

is constant. Figure 1 : Epidemic dynamics.

7 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08. 26.20182477 doi: medRxiv preprint When each subpopulation (r, a) is large and homogeneous, the dynamics of state variables may be described through the following system of equations, represented in Figure 1 :

(2.1) where • 0 < α < 1 is the infection rate per contact, that is the probability of infection conditional on contact;

• β is the incubation rate, and 1/β is the average incubation period;

• γ is the rate at which infectious individuals recover;

• 0 < p a < 1 is the probability for an infected individual in age group a to develop symptoms;

• f a is the infection fatality rate for age group a, representing the probability that an infected individual in age group a dies from the disease; and

• The force of infection λ t (r, a), which measures the rate of exposure at location r for age group a, is given by

(2.

2)

The force of infection in each subpopulation depends on the rate of contact with (infected) individuals in other subpopulations, leading to interactions across subpopulations which differentiate this model from a homogeneous model. These interactions occur through:

• Contacts across age groups: the term σ r a,a (t) represents the average number of persons from age class a encountered per day by a person from age class a in region r on a day t. For infectious individuals with symptoms, we assume a lower contact rate κσ < σ due to (partial) self-isolation; and

• Inter-regional mobility: M r,r (t) represents the proportion of individuals from region r among the population of adults at a location r on a given day t.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The deterministic model described in Section 2.2 ignores the variability of outcomes [25] , as random factors are not taken into account in the model. To take into account the variability of outcomes and assess their respective probabilities, we model the variables (S(t), E(t), I(t), A(t)) as a continuous-time Markov point process [4, 6] defined through its transition rates conditional on the history H t up to date t:

The stochastic dynamics in (2.3) are consistent with the deterministic dynamics of (2.1) for large populations, in the sense that the population fractions represented by each compartment converge to those represented by the solution of (2.1) as min r N (r) increases. However, even when the overall population is large, the stochastic dynamics (2.3) can substantially deviate from the deterministic model (2.1), especially in small subpopulations and in the early phases of the epidemic when the number of infected individuals in each region may be small.

In the sequel we use the stochastic model (2.3) for the state variables and occasionally compare the outcomes with (2.1) to assess the variability of outcomes and the role of randomness (see Section 5.3).

Social distancing policies (and lockdowns) affect epidemic dynamics by influencing (lowering) the social contact rates σ r ij and the inter-regional mobility M r,r . To discuss targeted policies which may influence differently social contact rates at different locations, we decompose the baseline social contact matrix σ r as

where the components correspond respectively to contacts at home (σ r,H ), at work (σ W ), school (σ S ) and other locations (σ O ). Social distancing policies are then parameterised in terms of their impact on various components of the social contact matrix:

where 0 ≤ u r,X ij (t) ≤ 1 are modulating factors which measure the impact of the policy on social contacts between age groups i and j at a location X in region r. In absence of social 9 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. . distancing or confinement measures, we have u r,X ij (t) = 1; the value of u r,X ij (t) reflects the fraction of social contacts when between age groups i and j at location X in region r when the policy is applied.

This parameterisation allows us to consider policies targeted towards sub-population or specific regions. For example, school closure in region r during time period [t 1 , t 2 ] corresponds to setting u r,S ij (t) = 0 for t ∈ [t 1 , t 2 ], while 0 < u r,S ij < 1 corresponds to social distancing in schools, with lower values of u r,S ij corresponding to stricter enforcement of measures.

Regarding the inter-regional mobility matrix M , following the interpretation discussed in Section 3.2, we modulate its value according to the fraction u r,W 2,2 of the population who continue to commute, that is

Here M r,r is the fraction of population in region r whose habitual residence is in region r before lockdown. The modulating factors u r,X ij may be chosen in advance or expressed as a function of the state of the system. We distinguish:

• Pre-planned (also called 'open-loop') policies, in which target values of modulating factors u r,X ij (t) are decided in advance; and

• Adaptive policies (also called 'closed loop' or feedback control), in which actions are decided and updated as a function of observed quantities such as number of daily reported cases or number of daily fatalities.

Comparative analysis of mitigation policies To perform comparative analysis across different policies, we need to evaluate policy outcomes across two dimensions: health outcome and socio-economic impact. We quantify the health outcome of each policy by the total number of fatalities during a reference period, taken to be t max = 1000 days after the reference date of March 1, 2020. The length of this reference period is chosen such that it takes into account an eventual 'second wave' of fatalities. We denote this outcome by D tmax (u), which represent the total fatalities at date t max associated with policy u.

To quantify the socio-economic impact of a policy, we use as metric the reduction in social contact resulting from the policy over the horizon [0, t max ], that is

defined in terms of man×day units. The range of policies examined below lead to different outcomes in terms of fatalities D tmax (u) and social cost J(u). A policy v dominates (or improves upon) a policy u if it leads to a similar or better health outcome at an equal or lower cost:

and

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. . with at least one inequality being strict. A policy u is efficient among a class of policies U if it cannot be improved upon by any policy in this class. Given a set of policies U , the subset of efficient policies forms the efficient frontier of U .

Some recent economic models [1, 44, 51] formulate the trade-off in different terms, by introducing a concept of monetary value of human life in order to build a (monetary) welfare function combining both terms. Aside from ethical issues linked to the very concept of monetisation of human life, there is no consensus on its actual value, which is a key determinant of the trade-off in this approach. Our approach avoids specifying such a value and aims at identifying the range of efficient policies, leaving the final choice of the trade-off to policymakers.

In what follows, the goal is to determine the efficient frontier and describe the characteristics and outcomes of such efficient policies. Pre-planned policies are discussed in Sections 6.1 and 6.2, while adaptive policies are discussed in Section 7.

The basic inputs of the model are panel data on number of cases and fatalities reported at the level of Upper Tier Local Authorities (UTLA) level in England 1 . This defines the granularity of the model: we partition the population of England into 133 regions as defined by the Nomenclature of Territorial Units for Statistics at level 3 (NUTS-3) 2 . Demographic data for NUTS-3 regions, is available from Eurostat 3 . For the purpose of our study we distinguish four age groups, as shown in Table 1 Appendix A provides the list of UK regions used in this study. The size N (r, a) of age group a in region r is obtained using the 2018 dataset from the Office for National Statistics 5 .

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint

For our baseline estimate of inter-regional mobility we use the data on location of usual residence and place of work, provided by the UK Office for National Statistics 2011 Census data 6 . The dataset classifies people aged 16 and over in employment during March 2011 and shows the movement between their area of residence and workplace, defined in Local Administrative Units at level 1 (LAU-1) terms. We then map this data onto NUTS-3 regions using the lookup table between LAU-1 and NUTS-3 areas provided by the Office for National Statistics 7 .

The data is then represented in the model through the inter-regional mobility matrix M , whose elements M r,j represent the fraction of population in region r whose habitual residence is in region j. Denote by Π(r, j) the population with residence registered in region j and workplace registered in region r for r = j. In addition, we denote Π(r, r) = N (2, r) the population at location i in the age category 20 − 60 years. Then we estimate the coefficients M r,j by

.

(3.1)

Epidemiological parameters were either estimated from publicly available sources [2, 41] or set to values consistent with recent clinical and epidemiological studies in COVID-19 [18, 27, 55] .

Social contact rates Contact rates across age classes have been estimated in studies by Mossong et al. [40, 41] and Béraud et al. [9] . We follow the approach of Mossong et al. [40] to estimate the following baseline social contact rates across our four age groups (see Table 1 ): 

Using the PyRoss methodology of Adhikari et al. [2] , we further decompose the contact matrix, as in (2.4), into four components representing contacts at home (σ H ), work (σ W ), school (σ S ) and other locations (σ O ). Estimation methods and parameter values for these matrices are discussed in Appendix B. However, contact rates vary across different regions due to the heterogeneity in socioeconomic composition structure and specific regional characteristics, such as its population density and level of urabnisation. To account for this heterogeneity, we parameterise the contact matrix in region r as σ r = d r σ where the regional adjustment factors {d r } 133 r=1 are 6 See https://www.nomisweb.co.uk/census/2011/wu01ew. Data retrieved on January 2020. 7 See https://geoportal.statistics.gov.uk/datasets/c893dfece45f465f857ac34641041863_0.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . estimated to reproduce the regional growth rate of reported cases before the lockdown period. The results are displayed in Figure 2 .

As seen in Figure 2 , our findings imply substantial heterogeneity of social contact rates across regions. Average contact rates in some regions (in particular in southern regions) are much lower than the national average, while in other regions (in particular the North-East and North-West) they are 20% to 30% higher. As we will observe below, these differences have a considerable impact on regional epidemic dynamics. Incubation rate, β Following the study of Ferguson et al. [20] , we use the incubation rate β = 0.2, which corresponds to an incubation period of approximately 5 days. This is further supported by several empirical studies on diagnosed cases in China outside Hubei province. An early study of Backer et al. [7] based on 88 confirmed cases, which uses data on known travel to and from Wuhan to estimate the exposure interval, indicates a mean incubation period of 6.4 days with a 95% confidence interval (CI) of 5.6-7.7 days. The study of Linton et al. [33] , based on 158 confirmed cases, estimates a median incubation period of 5.0 days with 95% CI of 4.4-5.6 days. Linton et al. [34] shows the incubation 13 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. period has a mean of around 5 days with 95% CI of 4.2-6.0 days when approximated using the best-fit log-normal distribution using data from 31 January 2020. A more recent paper of Lauer et al. [30] estimates a median of incubation period to be 5.1 days with 95% CI of 4.5-5.8 days, based on 181 cases over the period of 4 January 2020 to 24 February 2020.

Proportion of symptomatic and asymptomatic infections The probability p that an infected individual develops symptoms is an important parameter for epidemic dynamics, yet subject to a high degree of uncertainty: studies on various data sets [13, 16, 39, 21, 43] are based on small samples and yield a wide range of estimates. In particular, an early estimates from the Diamond Princess cruise ship [39] and Japanese evacuation flights from Wuhan yielded estimates as high as p 0.7 − 0.8 [42] , while a July 2020 study by the ONS [43] , based on a much larger sample, showed that p can be as low as 0.23. However, clinical studies [16] indicate that this probability may strongly depend on the age group considered.

In this study, we use a range of values for the age-dependent probability p a whose upper bound is consistent with Davies et al. [16] and whose lower bound is consistent with the estimates provided by the ONS [43] . These values are displayed in Table 2 . Given the much larger sample size used in the study of the ONS [43] , we use the corresponding estimates ('low values' in Table 2 ) as benchmark unless stated otherwise.

Recovery rate γ In line with Cao et al. [14] , Li et al. [32] and Rocklöv et al. [48] , we use a recovery rate γ = 0.1, which corresponds to an average infectious period of 10 days.

We denote by f a the (infection) fatality rate for age group a. In practice, these parameters are difficult to estimate during outbreaks and estimates may be subject to various biases [35] . Note that the infection fatality rate (IFR) is different from (and generally much smaller than) the case fatality rate.

Fatality rates for COVID-19 have been observed to be highly variable across age groups [27, 52, 55] . Based on the infection fatality rates provided in Verity et al. [55] for different age groups and the UK population distribution, we derive the following aggregated IFR for the respective four age groups of interest: f 1 = 0.01% (CI: 0.0008 − 0.037%), f 2 = 0.25% (CI: 0.120 − 0.475%), f 3 = 2.1% (CI: 1.11 − 3.89%), and f 4 = 6.5% (CI: 2.96 − 10.25%). These estimates are consistent with data obtained from other countries; see for example Salje et al. [52] .

14 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . 

f 1 = 0.01, f 2 = 0.25, f 3 = 2.1, f 4 = 6.5 [55]

We use a simulation-based indirect inference method of Gourieroux et al. [22] for estimating the parameter α. We simulate the stochastic model (2.3) for a range of values 0.03 ≤ α ≤ 0.15 and compare the logarithmic growth rates of the simulated reported cases C t (α) with the one estimated from reported cases C t in England. For the simulation we use parameters specified in Table 3 and the following initial conditions for t 0 = March 10, 2020:

These initial conditions ensure that the simulations agree on average with regional case numbers on March 15, 2020, for all values of α.

This procedure yields an estimated value of α = 0.055 and a confidence interval [0.051, 0.062] (see Figure 3a ). As shown in Figure 3b , this value of α, together with the model parameters in Table 3 , yields a good fit of the pre-lockdown evolution of case numbers.

These results are also consistent with estimates obtained in Donnat and Holmes [17] and Dorigatti et al. [18] using data from other countries. 15 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint 3.5 Inter-regional mobility and social contact during confinement Confinement measures were implemented across the United Kingdom starting March 23, 2020 via the Coronavirus Act 8 . During this 'lockdown' period schools and workplaces were closed and social contact was reduced, as evidenced by mobility data 9 . However mobility data also reveal regional differences in the impact of the lockdown.

We model the reduction in inter-regional mobility through an adjusted mobility matrix

and M r,r is the inter-regional mobility matrix defined in (3.1). According to the Labor Force Survey data from 2018/19 [19] , 7.1 million adults across the UK are considered as 'key workers'. We set q = 20% to take into account the fact that these key workers continued to access their workplace during the lockdown period. This is also consistent with the methodology in Rawson et al. [47] and empirical studies of Santana et al. [53] on mobility changes before and after lockdown in the UK. Figure 4 shows the submatrix corresponding to daily mobility across London boroughs, and illustrates the observed dramatic drop in commute patterns. We model the impact of confinement on the social contact matrix through a regional multiplier l r :

l r < 1 represents the reduction in social contacts during the lockdown period; l r = 1 corresponds to the pre-lockdown level of social contact. The value of l r is estimated from 8 https://www.legislation.gov.uk/ukpga/2020/7/contents/enacted 9 https://www.oxford-covid-19.com/

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . panel data on regional epidemic dynamics during the period from March 23 to June 1, 2020, using a least-squares logarithmic regression on the number of observed regional cases.

The average value of this reduction factor is found to be

that is an average reduction of 88% in social contacts, an order magnitude corroborated by mobility data [53] , showing that lockdown was very effective in reducing social contact. However, as shown in Figure 5 and Table 4 : Estimated values for regional adjustments d r and l r in NUTS1 regions.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint 

Having estimated the model parameters using data on reported cases between March 10 and May 20, 2020 we assess the goodness-of-fit and out-of-sample performance using reported cases and fatalities between May 21 and June 22, 2020. The results, shown in Figures 6  and 7 , show that the model is able to reproduce well both the in-sample and out-of-sample evolution of number of cases and fatalities, at national level as well as regional level (see Figure 8 ).

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. 

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. 

When applying such models to epidemic data, a key point is to realize that the state variables S, E, I, A, R are not directly observed (and certainly not in real time) but need to be inferred from other observable quantities.

The two main observables in COVID-19 data are

• the cumulative number of reported cases, and

• the cumulative number of COVID-19 fatalities D t ; broken down by region and age group. Of the two, fatalities are generally considered more reliable, as deaths are nearly always reported, while identification of cases requires testing or self-reporting. We thus identify the observed number of fatalities with the state variable D t .

In absence of widespread testing, as has been the case with COVID-19, only a fraction π of cases are reported. This fraction may change with time due to testing campaigns. 10 10 https://ourworldindata.org/coronavirus-testing 20 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint

We therefore cannot assume the number of infectious cases to be directly observed: rather, we estimate it from the death count D t (see also Jombart et al. [26] ). Let C t be the cumulative number of (symptomatic) infectious cases. Assuming that • the daily number r(t) of reported cases is a fraction π(t) of new cases:

(4.1)

• deaths occur on average T days after detection;

we obtain that the daily death count is proportional to the lagged number of new infectious cases:

where f is the (average) infection fatality rate. We use these relations to obtain an estimate for the cumulative number C t of symptomatic infections and the reporting ratio π(t).

First, using (4.2) we estimate the average delay T between case reporting and death by identifying the lag T which maximizes the correlation between the D t+T +1 − D t and r(t). As shown in Figure 9 , using aggregate fatality counts for England, a maximum correlation ρ max=95% is observed for a lag of T = 8 days between reporting and death. Note that this delay is shorter than the typical recovery period, implying that reporting typically does not occur at the onset of the infection but after a delay of a few days, which is consistent with other studies (for example, Harris [24] ). Using an average fatality rate of f = 0.9% for the UK as in Ferguson et al. [20] (see discussion in Section 3.3), we estimate the reporting probability to be

21 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint which implies that the total number of cases in England is more than 20 times the reported number. This ratio fluctuates between 0.04 and 0.06 during the observation period, as shown in Figure 10 . 

A key issue in epidemic control is the availability of reliable indicators for the intensity of an ongoing epidemic. Public health authorities have communicated the daily number of reported cases and fatalities, and these have served as the main inputs for policy planning.

An important corollary of the above discussion is that, given the combination of random factors affecting dynamics and the considerable uncertainty on the actual number of new infections, it is perfectly possible to observe a run of many consecutive days without new reported cases while in fact the actual number of infections is on the rise.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint (a) I t and A t .

(b) Reported cases. Figure 11 : Example of latent progression of the epidemic with zero reported case for 100 consecutive days (green shaded area). On the last day of this period, we have E t = 0 I t = 0 and A t = 1. Reporting probability is π = 4.5%.) Figure 11 shows an example of scenario in our model where, for 100 consecutive days, although a small number of (symptomatic and asymptomatic) cases appear, due to the low detection probability (π = 4.5%), none of them is reported. Nevertheless, after a run of 100 days without any reported cases (green shaded area in Fig 11) , which would have prompted most public health authorities to lower their guard, the epidemic takes off again. This scenario is not unlike what occurred in South Korea and several other locations in 2020. Figure 12 shows the probability of observing a subsequent (second) peak in infections if social distancing measures are lifted after no reported cases for L consecutive days. This probability is estimated using 500 simulated paths from the model (2.3). It is striking to observe that, even after 100 days with no reported cases, the probability of observing a resurgence of the epidemic is around 40%. Figure 12 (blue dashed line) shows the same probability conditional on observing no fatalities for L consecutive days.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. These observations point to the importance of broader testing: as shown in Figure 13 , an increase in the probability π of detecting new cases leads to a strong decrease in the probability of misdiagnosing the end of the epidemic, as in the scenario described above. In absence of widespread testing, policymakers are faced with the problem of controlling a system under partial observation. Figure 13 : Probability of having a second peak in infections following 60 consecutive days with no reported cases, as a function of reporting probability π (after July 1st) ( solid line for high symptomatic ratios and dashed line for low symptomatic ratios). Estimates based on average of 500 simulated scenarios.

A much debated issue has been whether the lockdown and subsequent social distancing restrictions were necessary or whether health outcomes would have been comparable in absence of any restrictions, eventually leading to herd immunity. To examine this question 24 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint we consider the counterfactual scenario of no intervention and estimate the fatalities and peak number of infections under such a scenario.

Our counterfactual simulations show that, in absence of social distancing and confinement measures, the number of fatalities in England may have exceeded 216,000 by August 1, 2020. This is 174,000 more than the outcome actually observed on this date following the lockdown. Figure 14 displays the evolution of number of symptomatic infections and fatalities in absence of restrictions, under two different assumptions on symptomatic ratios (see Table  2 ). Under the assumption of low symptomatic ratios, the total fatalities in absence of any mitigation policy are estimated to be 216, 000 on average across 100 scenarios, with 12.8 million (more than 20% of England's population) being infected and symptomatic. A peak number of 3, 720, 000 symptomatic individuals in England would have been reached on April 30th. Figure 15 decompose these results across different age groups. The low and high estimates for symptomatic ratios lead to very different simulation outcomes in terms of peak I t values and total death numbers, which illustrates the huge impact of parameter uncertainty on model projections.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. Heterogeneity of regional outcomes Regions exhibit heterogeneous outcomes in terms of peak time, peak value, and fatalities (per 100,000 inhabitants).

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. 

Homogeneous SIR models [3, 38, 17, 29, 44, 50, 47, 51] or age-stratified versions of such models [1, 15, 16, 46, 54] have been used in many recent studies on COVID-19 in the UK and other countries.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. .

The heterogeneity of outcomes observed in our simulation suggests that homogeneous epidemic models may fail to capture some important features of COVID-19 dynamics which are relevant for public health policy. We investigate this point further by comparing our simulations with two homogeneous benchmark: a country-level SEIAR compartmental model and an age-stratified version of it.

Homogeneous SEIAR model The homogeneous SEIAR model corresponds to the case where the country is considered as a single region, assuming away geographic heterogeneity:

where σ is the average contact rate in the population and N is the total population.

We use the values of α, β and γ as specified in Table 3 and population-averaged versions of low symptomatic ratios in Table 2 . We aggregate the POLYMOD age-stratified contact matrix (3.2) from POLYMOD, the probability of developing symptom, and fatality rate using the England population age distribution (Table 1) leading to an aggregate fatality rate of 1.1%, probability of developing symptoms of 45.4%, and contact number of 4.157. As in Sections 3.3 and 3.5, we use an indirect inference method [22] to estimate the implied scaling parameters for social contact rates by matching daily case dynamics before and during lockdown, leading to d = 2.909 and l = 0.115.

We also consider an age-stratified (A) version of this model with 4 age groups. The rate of exposure for age group a is given by

For the age-stratified model, we use α, β and γ as in Table 3 . We estimate the scaling parameters for social contact rates before and during lockdown as above, yielding d = 1.07 and l = 0.112.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. As shown in Figure 17 , these country-level models actually yield reasonable fits to aggregate dynamics of cases and fatalities.

However, the inability of these country-level models to capture regional heterogeneity and inter-regional exchanges leads to regional outcomes which are very different from our model with spatial heterogeneity (F). Figures 19-22 compare simulation results for the homogeneous model (H), the age-stratified model (A) and our spatial heterogeneous model (F) in two regions: Torbay (UKK42) and Birmingham (UKG31). The homogeneous model is observed to over-estimate fatalities in age groups 1 and 2 and underestimates fatalities in age groups 3 and 4. The age-stratified model and the heterogeneous model agree on fatalities across age groups but, as shown in Figures 19-22 , neither the age-stratified model nor the homogeneous model can capture the regional features of epidemic dynamics, such as the difference in peak time and peak value of infections across regions.

We therefore conclude such country-level models should not be used in the context of policy discussions which target regional measures or regional outcomes.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. 

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. 

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. 

Given a set of initial conditions, the stochastic model (2.3) may lead to a range of outcomes due to the randomness present in the dynamics. Figures 23-24 show an example of variability of outcomes across different scenarios. As expected from asymptotic analysis of large population limits [12] , the relative variability across scenarios is of order ∼ 1/ N (r, a) for a sub-population of size N (r, a). Although not negligible, especially at the onset of the epidemic, this variability remains small at the regional level given the granularity used in our model. In the sequel we have thus reported the average outcomes across 100 or 1000 simulated scenarios for each case examined.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. 6 Comparative analysis of epidemic control policies

We first consider the impact of a 'lockdown' followed by social distancing, which reflects the situation in the UK since end March 2020. We examine in particular the impact of a lockdown duration T and the level of social distancing after lockdown on the number of fatalities and the associated social cost. To do so, we parameterize the contact matrix as σ r (t) = l r σ r for t 0 ≤ t ≤ t 0 + T (lockdown), ((1 − m)l r + m) σ r for t > t 0 + T (after lockdown), 

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . All scenario simulations correspond to a lockdown starting at t 0 = March 23, 2020. We consider a range 105 ≤ T ≤ 335 for the lockdown duration and 0.2 ≤ m ≤ 1 for post-lockdown compliance levels. Note that the actual lockdown duration in England corresponds to T = 105.

As shown in Figure 25a , the level of social distancing after the confinement period is observed to be more effective in controlling the epidemic (Figure 25b) , than extending the period (Figure 25a ). This is consistent with the findings in Lipton and Lopez de Prado [36] . Smaller values of m, associated with stricter social distancing, lead to a lower of fatalities but for at an increased social cost (Figure 25b ). On the other hand, the lengthening of the lockdown duration T , while significantly increasing the associated social cost, does not result in a significant reduction in the number of fatalities, especially if social distancing after lockdown is relaxed. is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. Table 5 : Outcomes for policies represented in Figure 26 .

By comparing the orange and blue plots, which represent the same post-lockdown compliance level (m = 0.5), we observe that extending the lockdown duration increases social cost without reducing the total number of fatalities. On the other hand, comparing the orange and green plots, which correspond to the same lockdown duration of T = 105 days, shows that moving the compliance level from m = 0.5 to m = 0.4 reduces the second peak amplitude by 35% and fatalities by 12.5%.

Impact of parameter uncertainty The above results are highly sensitive to the value of the symptomatic ratios which, as noted in Section 3, are highly uncertain (see Table  2 ). Figure 27 shows the policy outcomes for low versus high symptomatic ratios across different compliance levels and lockdown duration. As observed in this figure, while the overall pattern of the efficiency diagram is similar, the projected fatality levels shift considerably depending on the assumption on the symptomatic ratio: from 70,000-200,000 for low symptomatic ratios to 170,000-430,000 for high symptomatic ratios.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . Figure 27 : trade-off between fatalities and social cost for T -day lockdown followed by social distancing (0.2 ≤ m ≤ 1, 105 ≤ T ≤ 335): low symptomatic ratio (orange) and high symptomatic ratio (blue).

Regional heterogeneity While the policies discussed here are applied uniformly across all regions, we observe a significant heterogeneity in mortality levels across regions, as well in terms of the timing and amplitude of a second peak tin infections. As shown in 28, some regions exhibit mortality levels up to 4 times higher than others. This huge disparity in mortality rates cannot be explained by demographic differences alone, which are much less pronounced: more important seem to be the differences in social contact patterns, as illustrated in Figure 2 . Indeed, as shown in Figure 29a , there is a positive correlation (> 40%) between regional COVID-19 mortality and the intensity of social contact as measured by the parameter d r defined in Sec. 3.3. Figure 29b shows that this heterogeneity is also reflected in the timing and amplitude of second peaks. We observe that East Cumbria has a small second peak compared to its first one, Birmingham and Berkshire experience a second peak with a similar size compared to the first one, while North Northamptonshire suffers from a much more severe second peak. The second peak in North Northamptonshire and Berkshire occur around t = 175 and t = 231, respectively.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . Figure 28 : Lockdown of 105 days followed by social distancing (m = 0.5): regional mortality per 100,00 inhabitants. 

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint

We now consider the impact of social distancing measures targeting particular age groups or environments (school, work, etc.) following a lockdown of duration T , by setting

We consider different targeted measures after a lockdown period of T = 105 days (the actual duration of the lockdown in England): school closure, shielding of elderly populations and workplace restrictions, restrictions on social gatherings and combinations thereof. Note that there is no control over the social contacts at home.

Closure of schools Although most of the infected population in age group 1 is asymptomatic, they may in turn infect the population in age groups 3 and 4 who are more likely to develop symptoms. School closure corresponds to u S = 0, school reopening with social distancing correspond to u S = 0.5, and school reopening without social distancing correspond to u S = 1.

Shielding The high infection fatality rates among elderly populations (age groups 3 and 4) have naturally lead to consider shielding policies for these populations. We model this as a reduction in social contacts of age groups 3 and 4 to the level observed under lockdown:

Workplace restrictions We model the impact of a restricted return to work after confinement by assuming different proportion of workforce return after the lockdown period by choosing

the lower bound u W = 0.2 corresponding to restricting workplace return to 'essential workers', as discussed in Section 3.5. Since workplace restrictions have an effect on commuting, such measures also have an impact on the inter-regional mobility matrix

where C 0 (r, r ) is the baseline mobility matrix defined in (3.1).

Restrictions on social gatherings Although social activities, such as gatherings at pubs or sports events, may aggravate the contagion of COVID-19, keeping certain levels of social activities is important to the economic recovery and the well-being of individuals. The parameter u O measures the fraction of social gatherings: during the lockdown this fraction was estimated to be as low as 20% (see Section 3.5) . In what follows, we consider u O ∈ [0.3, 1.0] after the period of lockdown.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . Table 6 : Impact of school closures and social distancing at schools: outcomes averaged across 50 simulated scenarios, u H = u W = 1, u O = 0.5. Table 6 show the impact of school closures and social distancing at schools on projected fatalities and social contacts. Reopening of schools, while reducing significantly the social cost, does not seem to lead to a significant increase in fatalities.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. Figure 31 , the 'open school' policy leads to 35% fewer fatalities 35% compared to the 'open pubs' policy.

We have examined the impact of shielding in isolation and also in combination with other measures such as school closure and social distancing.

As shown in Figure 30a , whether applied in isolation or in combination with other measures, shielding of elderly populations is by far the most effective measure for reducing the number of fatalities. As clearly shown in Figure 30a , regardless of the trade-off between social cost and health outcome, a policy which neglects shielding of the elderly is not efficient and its outcomes can always be improved through shielding measures. 

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . Impact of shielding on policy efficiency For policies without shielding, the level of social gatherings, u O , is the leading factor to determine the efficiency frontier. In Figures 33a , the efficiency frontier contains two classes of policies: 'School and Work' and 'No Pubs'.

• 'School and Work' policies, which do not include any restrictions on school or work (u S = 1, u W = 1) but varying level of restrictions on social gatherings (0.3 ≤ u O ≤ 1).

Within this class of policies, different level of social gatherings lead to very different outcome of fatalities, as illustrated in Figure 33a . However, as observed in Figure 33b , these policies are not efficient when shielding measures are put in place for the elderly. Under shielding, the spectrum of efficient policies is parameterised by the fraction u W of the workforce returning to work. As shown in Figure 33c , we can distinguish two classes of efficient policies under shielding:

• 'School and Pubs', consisting of policies without restrictions on schools or social gatherings (u S = 1, u O = 1) and different levels u W of restrictions on workplace gatherings.

• 'Restricted Work' policies, under which only 'essential' workers are allowed on-site work (u W = 0.2), with either (i) no school restrictions (u S = 1) and different levels of restrictions on social gatherings (0.2 ≤ u O ≤ 1) or (ii) restrictions on social gatherings (u O = 0.3, that is 'no pubs') and different levels of social distancing in school (0 ≤ u S ≤ 1).

As Figure 33d illustrates, 'School and Pubs' and 'Restricted Work' policies are not efficient without shielding. In absence of shielding, social gatherings seem to be the main vector for contagion. When shielding measures are put in place, the social contacts associated with the elderly are reduced to the same level as under lockdown; in this case, contacts at work become the main vector of contagion.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. 

We now consider adaptive mitigation policies, in which the daily number of (national or regional) reported cases is used as a trigger for social distancing measures. Such policies have been recently implemented, in the U.K. and elsewhere, at a local or national level with various degrees of success. We distinguish centralised policies, based on monitoring of national case numbers, from decentralised policies where monitoring and implementation of measures are done at the level of (NUTS-3) regions.

We consider centralised policies which monitor the number of daily reported cases at country level. Whenever the whenever the number of daily reported cases (per 100,000) exceeds a threshold R on , confinement measures are imposed for a minimum of L days, until the number of daily reported cases falls below the threshold R off < R on . Outside these lockdown periods we assume social distancing is in place with a compliance level m 42 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . is the social compliance; we use a default value of m = 0.5.

This policy is implemented after the initial lockdown (i.e after July 4, 2020). In terms of the social contact matrix, we have, for t > t 0 + T ,

100000 R on and i t−1 = 0); σ r (t) = l r σ r , and I(t) = 1, if C t > N 100000 R on or Π t−1 s=t−L i s = 1.

(7.1)

Here T = 105, i t is the indicator of whether lockdown is applied on day t and C t is the daily reported cases in England on day t. Π t−1 s=t−L i s = 1 if lockdown has been applied for L consecutive days during the period [t − L, t − 1].

We simulate the dynamics with various choices of R off and R on :

• R on ∈ {2, 4, 6, 8, 10} (daily reported cases per 100,000 inhabitants); and

We assume that once a lockdown is triggered it lasts a minimum of L = 7 days and that, once lockdown is removed, individuals continue to observe social distancing as measured by the parameter m ∈ [0, 1]. Data on real-time mobility monitoring in the UK 11 , indicate mobility to be at 50% of normal level during the post-lockdown period, and thus we use m = 0.5 as a default value.

(a) Influence of the threshold R on to resume lockdown.

(b) Influence of the threshold R off to lift lockdown. Example Figure 35 shows an example of such an adaptive policy, where lockdown is triggered when daily cases exceeds 2240 nationally, and maintained until the count of new daily cases drops to 896. In the scenario shown in Fig. 35a , this results in four short lockdowns, totaling 34 days in all, which bring under control the national progression of the epidemic and avoid a 'second peak' at national level. However, as shown in Figure 35b , this policy is less successful at regional level, resulting in a regional outbreak in Leicester.

11 https://www.oxford-covid-19.com/

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. We observe in our simulations a second peak in I t for England when R on = 100, while we observe no second peak when R on = 20. When R on = 20, I t remains at level 2 × 10 5 with frequent interventions for 200 days and then decreases to zero. The social cost for policy R on = 100 and policy R on = 20 are 2.9 and 3.2, respectively. Policy R on = 20 have 20% fewer fatalities compared to policy R on = 100. An example region, Oxfordshire, has the same shape of I t in England when R on = 100. However, the shape of I t is different for R on = 20 where Oxfordshire experiences a small second peak around day 400.

In summary, smaller R on values correspond to a more frequent lockdown interventions and create a resilience to the second peak.

Increasing testing capacity To study the effect of an increased testing capacity, we assume wide testing is adopted such that the reporting probability is increased from 4.5%

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . to a significantly higher level (20%, 50%) on July 4th (see Figure 37 ). Reporting prob. π 4.5% 20% 50% Social Cost (10 11 ) 3.0 3.4 4.0 Fatalities 141,700 120,400 92,300 Table 7 : Average social cost and fatalities for a given policy with different testing capacities (50 paths). Policy: R on = 60, R off = 0.4R on , and m = 0.5.

By increasing the testing capacity, the observable quantity of daily reported cases becomes more consistent with the underlying dynamics of I t . Compared to the policy with a reporting probability π = 4.5% throughout the reference period, we see that the dynamics of I t when π = 50% decrease to a small value rapidly. Increasing the testing capacity also implies a more efficient control and as a result leads to fewer fatalities.

We now consider a decentralised version of the above policies, based on monitoring of regional number of cases as triggers for regional confinement measures. In terms of the social contact matrices, we have, for t > t 0 + T ,

100000 R on and i r t−1 = 0); σ r (t) = l r σ r , and i r t = 1, if C t (r) > N (r) 100000 R on or Π t−1 s=t−L i r s = 1.

Here i r t is the indicator of whether lockdown is applied in region r on day t and C t (r) is the daily number of cases reported in region r on day t. The term Π t−1 s=t−L i r s is used to track if lockdown has been applied in region r for L consecutive days during [t − L, t − 1]. We use the same values of R on and R off as in Section 7.1.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . As an example, for R on = 4 and R off = 0.4R on fatalities in England are 137, 700 under the centralised policy and 126, 300 under the decentralised policy, that is 8% lower. Figure 41 compare regional fatalities per 100,000 habitants for these policies. For more than 90% of the regions, decentralised measures lead to fewer fatalities. The most effective reductions are in Dorset, South West England (UKK22) with 23% fewer fatalities and in Portsmouth (UKJ31) with 22% fewer fatalities. There are a few exceptions (see regions in light blue in Figure 41c ). These regions are already under control before adaptive policies are applied. Therefore the improvement of moving from centralised policy to decentralised policy is limited. Figure 42a compares, for the same example, the dynamics of symptomatic infections (I t ). There is a reduction of 100,000 in the amplitude of the second peak value when moving from the centralised policy to decentralised one. Decentralised policy also damps the 46 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint second-peak values in most of the regions. Similar effects are observed for York ( Figure 42c ) and Leicester (Figure 42b) .

On June 29, 2020, Leicester became the first city in Britain to be placed in a local lockdown, after public health officials voiced concern at the city's alarming rise in COVID-19 cases. Earlier in June, the Government announced that parts of the city would be released from lockdown, while a "targeted" approach will see pockets remain under tighter restrictions. Our simulations indicate a 60% reduction of the second-peak value in Leicester when a decentralised policy is implemented (Figure 42b ).

Example Figure 40 shows an example of such a decentralised triggering policy, with the same triggering thresholds as in the centralised example in Fig. 35 . At regional level, we see in Figure 40a that this policy is more successful than the centralised policy in taming the local outbreaks in Leicester, substantially reducing the second peak through 8 one-week regional lockdowns. At the national level this results in a strong damping of 'second wave' infections, as shown in Fig. 40b (compare with Fig. 35a ). 

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint (a) Centralised (country-level) adaptive policy. (b) Decentralised (regional) adaptive policy.

(c) Increase in fatalities (per 100,000 inhabitants) when moving from regional to centralised policy. Figure 41 : Fatalities per 100,000 inhabitants for centralised (left) verus regional (right) adaptive mitigation policies. Same triggering thresholds are used in both cases: R on = 4 and R off = 0.4R on .

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. 

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. We observe that

• Adaptive policies, in which measures are triggered when the number of daily new cases exceeds a threshold, are more efficient than pre-planned policies; and

• As shown in Figures 43a and 43b , a decentralised policy is more efficient than both centralised policy and pre-planned policy.

In Table 8 , we provide a summary of outcomes for five different types of policies.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint • Confinement followed by social distancing: T = 105 and m = 0.5 and no shielding.

• Pre-planned policy: social distancing at work and school (u H = 1, u S = 0.5,u W = 0.5), restrictions on social gatherings (u O = 0.3) and no shielding.

• Centralised and decentralised triggering policies (Sec. 7.1 and Sec. 7.2) with m = 0.5, R on = 4, R off = 0.4R on and no shielding;

• Decentralised triggering combined with shielding of elderly populations: m = 0.5, R on = 4, R off = 0.4R on ;

• 'Protect Lives' policy: in the range of efficient policies, the one which results in the fewest fatalities is a decentralised triggering policy with R on = 2, R off = 0.2R on ( so more frequent triggering of confinement measures than the above), high degree of social distancing (m = 0.25) and shielding of elderly populations. This policy corresponds to the point in the lower right corner of Figure 43b . The social cost is 4.52, much higher than the other policies considered. Outcomes are averaged across 50 scenarios, starting from the same initial conditions on July 4 (end of the national UK lockdown).

Regional outcomes Comparing the regional outcomes of the centralised, decentralised and pre-planned policies displayed in Table 8 shows that the decentralised triggering policies are able in many cases to considerably damp the 'second wave' of infections. Figure 44 illustrates this in the case of Leicester and Birmingham: the decentralised triggering policy reduces the second peak amplitude by more than two-thirds compared to the pre-planned policy.

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint A Demographic regions Table 9 details the used mapping between Upper Tier Local Authority (UTLA) region codes and the Nomenclature of Territorial Units for Statistics at level 3 (NUTS-3) codes. If a single UTLA region falls within multiple NUTS-3 regions, the data is distributed proportionally to the population size in each of these regions. On the other hand if multiple UTLA regions are contained within a single NUTS-3 region, the data is simply aggregated. 58 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. 59 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. 

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint

This appendix provides a description of sources used for baseline social contact rate parameters. Two sources have been used: the POLYMOD study [41] , processed using PyRoss [54] , and the BBC Pandemic study [28] . These parameters are used as a baseline and a further detailed calibration is carried out region by region to account for heterogeneity of social contact patterns across UK regions.

The PyRoss library [2] uses a Bayesian Hierarchical framework to decompose contact rates into 'work', 'home', 'school', and 'other' [45] .

We estimate a contact matrix for four age groups [0, 20) , [20, 60) , [60, 70) and [70, 100) using UK demographic data 12 . This leads to the following contact matrices, visualized in Figure 45 : 

. CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

The copyright holder for this preprint this version posted September 1, 2020. . https://doi.org/10.1101/2020.08.26.20182477 doi: medRxiv preprint

A multi-risk SIR model with optimally targeted lockdown

Inference, prediction and optimization of non-pharmaceutical interventions using compartment models: the PyRoss library

Investigating the impact of asymptomatic carriers on covid-19 transmission. medRxiv

A primer on stochastic epidemic models: Formulation, numerical simulation, and analysis

Infectious Diseases of Humans: Dynamics and Control

Stochastic epidemic models and their statistical analysis

Incubation period of 2019 novel coronavirus (2019-ncov) infections among travellers from Wuhan, China

Reinfection could not occur in SARS-CoV-2 infected rhesus macaques. bioRxiv

The French connection: the first large population-based contact survey in france relevant for the spread of infectious diseases

Controlling epidemic spread: Reducing economic losses with targeted closures

Mathematical Models for Communicable Diseases

Stochastic epidemic models with inference

The role of asymptomatic SARS-CoV-2 infections: A rapid systematic review

Hongbing Song, and Daniel Dajun Zeng. Estimating the effective reproduction number of the 2019-ncov in china. medRxiv

Modeling strict age-targeted mitigation strategies for covid-19

CMMID COVID-19 working group, et al. Age-dependent effects in the transmission and control of covid-19 epidemics

Modeling the heterogeneity in COVID-19 reproductive number and its impact on predictive scenarios

Severity of 2019-novel coronavirus (ncov)

Key workers: key facts and questions. IFS Observation, Institute for Fiscal Studies

Impact of non-pharmaceutical interventions (npis) to reduce COVID-19 mortality and healthcare demand

Estimating the number of infections and the impact of non-pharmaceutical interventions on covid-19 in 11 european countries

Indirect inference

Mathematical models of infectious disease transmission

Overcoming reporting delays is critical to timely epidemic monitoring: The case of covid-19 in new york city. medRxiv

Assessing the variability of stochastic epidemics. Mathematical biosciences

Inferring the number of covid-19 cases from recently reported deaths

Epidemiological characteristics of covid-19: A systemic review and meta-analysis. medRxiv

Contacts in context: large-scale setting-specific social mixing matrices from the bbc pandemic project. medRxiv

Early dynamics of transmission and control of COVID-19: a mathematical modelling study. The Lancet Infectious Diseases

The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application

The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application

Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia

Incubation period and other epidemiological characteristics of 2019 novel coronavirus infections with right truncation: a statistical analysis of publicly available case data

Epidemiological characteristics of novel coronavirus infection: A statistical analysis of publicly available case data. medRxiv

Potential biases in estimating absolute and relative case-fatality risks during outbreaks. PLOS Neglected Tropical Diseases

Mitigation strategies for covid-19: Lessons from the k-seir model. Available at SSRN 3623544

Spatiotemporal dynamics of epidemics: synchrony in metapopulation models

Fundamental principles of epidemic spread highlight the immediate need for large-scale serological surveys to assess the stage of the sars-cov-2 epidemic

Estimating the asymptomatic proportion of Coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship

Social contacts and mixing patterns relevant to the spread of infectious diseases

Estimation of the asymptomatic ratio of novel coronavirus infections (covid-19). International Journal of Infectious Diseases

COVID-19) infections in the community in England

Covid-19 and the welfare effects of reducing contagion

Projecting social contact matrices in 152 countries using contact surveys and demographic data

The effect of control strategies to reduce social mixing on outcomes of the COVID-19 epidemic in wuhan, china: a modelling study

How and when to end the COVID-19 lockdown: an optimisation approach. medRxiv

Covid-19 outbreak on the diamond princess cruise ship: estimating the epidemic potential and effectiveness of public health countermeasures

A parsimonious model for spatial transmission and heterogeneity in the covid-19 propagation. medRxiv

Impact of lockdown on the epidemic dynamics of COVID-19 in France

A cost-benefit analysis of the Covid-19 disease

Estimating the burden of SARS-CoV-2 in France

Analysis of human mobility in the UK during the COVID-19 pandemic

Age-structured impact of social distancing on the covid-19 epidemic in india

Estimates of the severity of coronavirus disease 2019: a model-based analysis. The Lancet Infectious Diseases