key: cord-0824027-8evdbi4u authors: Chin, V.; Ioannidis, J.; Tanner, M.; Cripps, S. title: Effects of non-pharmaceutical interventions on COVID-19: A Tale of Two Models date: 2020-07-27 journal: nan DOI: 10.1101/2020.07.22.20160341 sha: e392466fc4dcd54b721b3ad3b7b51a5b2df5e14c doc_id: 824027 cord_uid: 8evdbi4u In this paper, we compare the inference regarding the effectiveness of the various non-pharmaceutical interventions (NPIs) for COVID-19 obtained from two SIR models, both produced by the Imperial College COVID-19 Response Team. One model was applied to European countries and published in Nature, concluding that complete lockdown was by far the most effective measure and 3 million deaths were avoided in the examined countries. The Imperial College team applied a different model to the USA states. Here, we show that inference is not robust to model specification and indeed changes substantially with the model used for the evolution of the time-varying reproduction number. Applying to European countries the model that the Imperial College team used for the USA states shows that complete lockdown has no or little impact, since it was introduced typically at a point when the time-varying reproduction number was already very low. We also show that results are not robust to the inclusion of additional follow-up data. The two models (Flaxman et al. 1 and Unwin et al. 2 ) produced by the Imperial College COVID-19 Response Team aim to explain the evolution of R t . We will refer to these models as model 1 (the model applied to European countries in the Nature publication 1 ) and model 2 (the model applied to the USA states 2 ), respectively. The two models for the R t are fundamentally different. In model 1, the proportional variation of R t from the initial R 0 is modelled as a step function and only allowed to change in response to an intervention. Therefore, any decrease in R t (even if this decrease is a result of the increasing proportion of the population who are infected, to changes in human behaviour, clustered contact structures and/or pre-existing immunity 3 ) must, by the model construction, be attributed to interventions and the impact is immediate with no time lag or gradual change, when a new intervention is adopted. In model 2, the proportional variation of R t from R 0 has a different functional form and is allowed to vary with mobility indicators for various activities. These mobility indicators are proxies for changes in human behaviour, whether that change is due to one or more centrally imposed interventions or whether it is the product of individuals responding to the epidemic on their own initiative independently of centrally imposed interventions. Model 2 also does not presume step functions and is therefore capable of capturing more gradual changes over time. Model 1 is deliberately simple and designed to test the impact of interventions without the confounders of mobility data, which themselves could be the product of the interventions. Model 2 leaves out interventions and uses mobility data as predictors in the evolution of R t . The advantage of model 2 is that it gives a more flexible estimate of R t , by allowing it to change with mobility trends. Although there is no explicit causal structure in model 2, the time sequence of the data and events makes inference around the impact of interventions possible by observing if the change in R t precedes a given intervention or interventions or not. Here, we apply both models 1 and 2 to the European data to compare the results and inferences they obtain. See supplementary methods for details on methods and data. There are two main observations when comparing the modeling results in Figure 1a , as well as the corresponding Extended Data Figures 1a-1e. First, and most notable, is that while the models give very different trajectories of R t , both models produce similar and accurate fit to the observed daily death counts. That is, very different processes of R t give rise to the same daily death count data. The second observation is that inference regarding the impact of interventions varies significantly between the two models. The inference from model 1 indicates that lockdown had the biggest impact of all the interventions in all countries. Indeed, Flaxman et al. 1 states that Lockdown has an identifiable large impact on transmission (81% [75%-87%] reduction). In contrast, model 2 shows clearly that R t was falling well before lockdown, which occurred after the sharp decline in R t . With the exception of Belgium (and to a lesser degree France), this is consistently seen across the European countries (see Extended Data Figures 1a-1e) and is in line with the mobility data, see Figure 2 . At the time lockdown was adopted in the UK, R t had already decreased to 1.46 (95% CI, 0.89 to 2.20) from an initial R 0 of 4.46 (95% CI, 2.62 to 7.20) according to model 2, and the largest drop in R t occurred after the implementation of self-isolation measures and encouraging social distancing, see Table 1 . We also evaluated the effect of extending the time horizon of analysis until July 12th where 2 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 27, 2020. . https://doi.org/10.1101/2020.07.22.20160341 doi: medRxiv preprint (a) Daily infections, daily deaths and R t until May 5th. (b) Daily infections, daily deaths and R t until July 12th in columns 1-3. Column 4 is a magnification of column 3 around the period of the NPIs. Figure 1 : United Kingdom. The start time for the plots is 30 and 10 days before 10 deaths are recorded, in the two models, respectively. Observed counts of daily infections and daily deaths are shown in red, and their corresponding 50% and 95% CIs are shown in dark blue and light blue respectively. 3 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 27, 2020. restrictions had been lifted in some of these countries. We include an additional three countries, i.e. Greece, the Netherlands and Portugal, in the analysis. Extended Data Table 1 shows the end dates of school closures, ban on public events and lockdown used in the analysis. Figure 1b shows a comparison of the results obtained from both models for the UK for the extended data while Extended Data Figures 2a-2g show similar results for the remaining 13 countries. Table 2 tabulates 4 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 27, 2020. Table 1 : Basic reproduction number R 0 and time-varying reproduction number R t immediately at time of introduction of NPIs given by both models using data up to May 5th for all eleven countries analysed in Flaxman et al. 1 . These NPIs are self-isolation (SI), social distancing (SD), school closure (SC), event ban (EB) and lockdown (LD). 95% credible intervals are given in parentheses below the corresponding point estimates. Countries where the seeding of new infections occur after the introduction of NPIs are denoted with an asterisk. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 27, 2020. . the original R 0 and R t at the time of adoption of each NPI in each country for both models for the period March 4th to July 12th. The change in inference from model 1 as a result of the additional data is astonishing. Table 2, Figure 1b , and Extended Data Figures 2a-2g show that with the inclusion of the data until July 12th, R t before lockdown was already below 1 for ten of the fourteen countries. The 95% credible intervals for the UK, Austria, Germany and Spain exceeded 1 before lockdown. Indeed, in contrast to the results reported in the Nature paper 1 , results from model 1 suggest that the banning of public events and social distancing are more proximally related to the reduction in R t than lockdown. For example, while the results from model 1 in Figure 1a , which corresponds to the data until May 5th, suggests that lockdown led to the largest decrease in R t , it now indicates that the largest decrease in R t happened after social distancing is practised from a mean value of 5.27 (95% CI, 4.50 to 6.04) to 2.10 (95% CI, 1.61 to 2.64)! Note that the inference from model 1 regarding the relative effectiveness of lockdown is now more consistent with the inference from model 2 across most countries. Finally, we should point out that Figure 1b show that the gradual increase in R t following the minimum value commenced well before the lifting of lockdown. This suggests that as people become less frightened by the prospect of a catastrophe, mobility (see Figure 2) , and hence the time-varying reproduction number increases. These findings clearly present policy makers with a conundrum. Which results should be used to guide policy making in lifting restrictions? Flaxman et al. 1 make the statement We find that, across 11 countries, since the beginning of the epidemic, 3,100,000 [2,800,000 -3,500,000] deaths have been averted due to interventions. However, we have shown that two different models, both of which give plausible fits to the actual death count data, yield vastly different inferences concerning the effectiveness of intervention strategies for the period March 4th to May 5th in the UK. Results are also different when the observation period is extended and easing of restrictions is included in the model. Although it is tempting to congratulate ourselves on our decision to implement lockdown, citing the number of lives that were saved, we should resist this temptation, and examine other possible explanations. Failure to do this and therefore mis-attribute causation could mean we fail to find the optimal solution to this very challenging and complex problem, given that complete lockdown can also have many adverse consequences 4 . We do not want to reach the opposite extreme of claiming with certainty that lockdown definitely had no impact. Other investigators using a different analytical approach have suggested also benefits from lockdown, but of much smaller magnitude (13% relative risk reduction 5 ) that might not necessarily match complete lockdown-induced harms in a careful decision analysis. Another modeling approach has found that benefits can be reaped by simple self-imposed interventions such as washing hands, wearing masks, and some social distancing 6 . Observational data need to be dissected very carefully and substantial uncertainty may remain even with the best modelling 7 . Regardless, causal interpretations from models that are not robust should be avoided. Given the analyses that we have performed using the two models that the Imperial College team has developed, one cannot exclude that the attribution of benefit to complete lockdown is a modelling artefact. 6 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 27, 2020. 7 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 27, 2020. . https://doi.org/10.1101/2020.07.22.20160341 doi: medRxiv preprint . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 27, 2020. . https://doi.org/10.1101/2020.07.22.20160341 doi: medRxiv preprint Model 2 assumes that R t is a function of mobility and allows this impact to vary regionally by the use of regional specific random effects terms. The data used by Unwin et al. 2 to estimate R t is the Google's COVID-19 Community Mobility Report. We adapt this methodology to the European context by modelling potential idiosyncrasies in mobility trends across countries using countryspecific random effects. To complete the specification of model 2 for the European data, we use Google's COVID-19 Community Mobility Report 8 which provides data measuring the percentage change in mobility compared to a baseline level for visits to a number of location categories; retailers and recreation venues, grocery markets and pharmacies, parks, transit stations, workplaces and residential places. We use the average change in mobility across all location categories, excluding residential places and parks, as a measure of the reduction in mobility. The seeding of new infections in model 2 is chosen to be 10 days before the day a country has cumulatively observed 10 deaths so that mobility data are available for all the countries examined. In the UK, this corresponds to March 4th, which is later than the infection start date of February 13th used by Flaxman et al. 1 . For posterior inference of model 2, we use the same prior distributions as in Unwin et al. 2 except for R 0 , where a weakly informative prior of a normal distribution truncated below at 1 with mean 3.28 and standard deviation 2 is used. This prior is chosen so that approximately 95% of the prior density is between 1 and 7 9 , and that R 0 is above the critical value of 1 at the start of the epidemic. For model 1, we use the same priors as in Flaxman et al. 1 for the analysis up to May 5th and July 12th. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 27, 2020. . https://doi.org/10.1101/2020.07.22.20160341 doi: medRxiv preprint Extended Data Figure 1a : Austria and Belgium. Plots of daily infections, daily deaths and timevarying reproduction number R t until May 5th. The start time for the plots is 30 and 10 days before 10 deaths are recorded, in the two models, respectively. Observed counts of daily infections and daily deaths are shown in red, and their corresponding 50% and 95% CIs are shown in dark blue and light blue respectively. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 27, 2020. . https://doi.org/10.1101/2020.07.22.20160341 doi: medRxiv preprint Extended Data Figure 1b : Denmark and France. Plots of daily infections, daily deaths and timevarying reproduction number R t until May 5th. The start time for the plots is 30 and 10 days before 10 deaths are recorded, in the two models, respectively. Observed counts of daily infections and daily deaths are shown in red, and their corresponding 50% and 95% CIs are shown in dark blue and light blue respectively. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 27, 2020. . https://doi.org/10.1101/2020.07.22.20160341 doi: medRxiv preprint Extended Data Figure 1c : Germany and Italy. Plots of daily infections, daily deaths and timevarying reproduction number R t until May 5th. The start time for the plots is 30 and 10 days before 10 deaths are recorded, in the two models, respectively. Observed counts of daily infections and daily deaths are shown in red, and their corresponding 50% and 95% CIs are shown in dark blue and light blue respectively. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 27, 2020. . https://doi.org/10.1101/2020.07.22.20160341 doi: medRxiv preprint Extended Data Figure 1d : Norway and Spain. Plots of daily infections, daily deaths and timevarying reproduction number R t until May 5th. The start time for the plots is 30 and 10 days before 10 deaths are recorded, in the two models, respectively. Observed counts of daily infections and daily deaths are shown in red, and their corresponding 50% and 95% CIs are shown in dark blue and light blue respectively. 14 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 27, 2020. . https://doi.org/10.1101/2020.07.22.20160341 doi: medRxiv preprint Extended Data Figure 1e : Sweden and Switzerland. Plots of daily infections, daily deaths and time-varying reproduction number R t until May 5th. The start time for the plots is 30 and 10 days before 10 deaths are recorded, in the two models, respectively. Observed counts of daily infections and daily deaths are shown in red, and their corresponding 50% and 95% CIs are shown in dark blue and light blue respectively. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted July 27, 2020. . https://doi.org/10.1101/2020.07.22.20160341 doi: medRxiv preprint Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe State-level tracking of COVID-19 in the United States Targets of T cell responses to SARS-CoV-2 coronavirus in humans with COVID-19 disease and unexposed individuals Should governments continue lockdown to slow the spread of COVID-19? Physical distancing interventions and incidence of coronavirus disease 2019: Natural experiment in 149 countries Forecasting for COVID-19 has failed Last accessed The reproductive number of COVID-19 is higher compared to SARS coronavirus Oxford COVID-19 Government Response Tracker Policy Responses to the Coronavirus Pandemic Denmark reports no spike in coronavirus cases since lifting lockdown AFTER LOCKDOWN: Are Denmark's and Norway's restrictions now like Sweden's? We congratulate the Imperial College Response Team for sharing openly the code for their models and for the overall transparency of their work that has allowed performing these analyses. We also thank Jack Wood for his help in the construction of Extended Data Table 1 . All authors contributed equally to this work. VC performed all the computations and produced all the graphics. SC wrote the initial draft. JI and MT wrote subsequent drafts. All authors discussed the results and implications and commented on the manuscript at all stages. None. All source code for the replication of our results is available from the Imperial College COVID-19 Response Team's Github repository: https://github.com/ImperialCollegeLondon/covid19model. Daily confirmed cases and deaths data are publicly available from the European Centre of Disease Control's website. Extended Data Table 1 : End dates for school closures 10 , public events ban 10 and lockdown in each country 11;12;13 . NPIs that are still in place are shown in , while NPIs that were not implemented are shown in .