key: cord-1008068-xxak39px authors: Orea, Luis; Álvarez, Inmaculada C. title: How effective has the Spanish lockdown been to battle COVID‐19? A spatial analysis of the coronavirus propagation across provinces date: 2021-10-23 journal: Health Econ DOI: 10.1002/hec.4437 sha: 7d8185ab62c840f9a84d0817126d4e3271f6f6be doc_id: 1008068 cord_uid: xxak39px This paper examines the propagation of COVID‐19 across the Spanish provinces and assesses the effectiveness of the Spanish lockdown of the population implemented on March 14, 2020 in order to battle this pandemic. To achieve these objectives, a standard spatial econometric model used in economics is adapted to resemble the popular reproduction models employed in the epidemiological literature. In addition, we introduce a counterfactual exercise that allows us to examine the Gross domestic product (GDP) gains of bringing forward the date of the Spanish Lockdown. We find that the number of COVID‐19 cases would have been reduced by 70.4% in the absence of spatial propagation between the Spanish provinces. We also determine that the lockdown prevented the propagation of the virus within and between provinces. As such, the Spanish lockdown reduced the number of potential COVID‐19 cases by 82.8%. However, the number of coronavirus cases would have been reduced by an additional 11.6% if the lockdown had been brought forward to March 7, 2020. Finally, an earlier lockdown would have saved approximately 26,900,000,000 euros. prohibited all in-class teaching in their regions over the following 3 days. Local outbreaks forced the Government of Cataluña to quarantine four Catalan municipalities on 12 March. The Spanish government declared a national lockdown of the population (or state of alarm) and prohibited public events on 14 March in an attempt to combat COVID-19. All shops except pharmacies and stores selling basic necessities were also forced to close. As the pandemic continued to spread after this date, it is germane to assess the effectiveness of this dramatic public intervention as well as the impact of other control measures. How human mobility explains the initial spread of COVID-19 is also an interesting issue worthy of close examination as it might prove helpful in understanding the propagation of the pandemic as well as limiting the impact of future waves. 1 This paper aims to shed some light on the above issues using a spatial econometric analysis of the coronavirus propagation in Spain. Our empirical model aims to explain the daily evolution of the confirmed cases in the Spanish mainland provinces during the period between the onset of the pandemic in each province and April 4, 2020. In line with Giuliani et al. (2020) , we distinguish between the propagation of the virus within a neighborhood, city or province and the propagation of COVID-19 across provinces. The origin of said spatial dimension of propagation is the high mobility of people across provinces. This feature enables us to test whether the lockdown was effective in both preventing the propagation of the coronavirus between provinces and in attenuating the propagation of the virus within each province. The added value of this study is the following. This is the first paper that examines the effectiveness of the control measures in Spain, as well as being one of the first in the recent literature that achieves this objective by controlling for spatial propagation effects, an issue that is treated only marginally in the epidemiological literature. Noteworthy exceptions are Giuliani et al. (2020) , Gross et al. (2020) , and Dickson et al. (2020) . While most of the previous literature aims to estimate reproductive numbers, mortality, and other epidemic features, we apply more standard econometric techniques used in economics to carry out our empirical exercise. We show how such a model can be adapted to resemble the popular reproduction-based models used in the epidemiological literature, which often ignore the existence of spatial propagation effects and unobserved local conditions. In addition, this paper includes a second major contribution, given that we also examine the economic impact of the Spanish lockdown implemented on March 14, 2020 in terms of Gross domestic product (GDP) losses at regional level. Based on the annual GDP growth rate forecasts per week of lockdown provided by BBVA Research (2020), using a counterfactual exercise we compute the economic effect of the actual lockdown and the GDP gains of bringing forward the date of the Spanish Lockdown. The paper is structured as follows. Section 2 summarizes the empirical strategy used in this paper to assess the effectiveness of sizable public control measures implemented nationwide in Spain aimed at containing the outbreak, controlling for (and measuring) expected propagation effects across the Spanish mainland provinces. Section 3 briefly describes the data used in the empirical analysis and its sources. Section 4 provides the parameter estimates and discusses the main results. Finally, Section 5 presents the conclusions. This is an expected result due to the gap, which exists between when a person becomes infected and when they might subsequently infect another person, which is on average about 6 or 7 days (see, Flaxman et al., 2020, p. 18) . Moreover, as pointed out by a referee, this result might also be caused by the lag between infection and the onset of symptoms and the existence of a large proportion of under-reported cases due to testing in March being saved and prioritized for only the most severe hospital cases. Notice that our model specification looks like a Difference-in-Difference (DiD) model where we compare an outcome variable before and after treatment (a policy measure), having controlled for unobserved differences across units (provinces). Although the lockdown of the population in Spain was implemented in all provinces on March 14, 2020, the advance of the pandemic in each province was rather different at that time. Therefore, our identification strategy is based on the relatively large dispersion of pandemic developments (i.e., onset dates) across provinces, and that the onset dates are orthogonal to the lockdown implementation date. We estimate the above model after taking natural logarithms to make it linear. Once we take natural logarithms, and a traditional noise term is added, the model to be estimated is: v is a mean-zero error term capturing random shocks, measurement or specification errors, and other unobservable variables not correlated with the rates of growth determinants. We used the logarithm transformation of the growth rates because it can be estimated using the standard linear Fixed-Effect (FE) estimator, which is equivalent to a linear panel data DiD estimation (Lechner, 2010, p. 189) . This estimator ensures obtaining consistent causal effects attributable to a given policy measure, even in those cases where the time-invariant unobservable variables are correlated with the treatment variable (the lockdown dummy variable in our case). For instance, in our application, we might think that the centrality of Madrid and the greater mobility of the people living in Madrid and other populated cities/provinces were responsible for triggering the implementation of the Spanish lockdown. It is also worth mentioning that in our paper we are not examining causal epidemiological effects in the sense that, for instance, infected individuals in period t cause secondary infections in period t + 1, and so on. This type of causal effect cannot be examined using a reduced-form model that simply aims to fit the observed epidemic curve of cumulative cases. However, the DiD specification of our reduced-form model is able to measure causal effects of a different nature, that is, those attributable to the public control measures implemented nationwide in Spain around March 14, 2020 aimed at containing the coronavirus outbreak during the first wave of the pandemic. There is an extensive literature on human mobility for measuring the spread of infectious diseases. In this sense, it is worth mentioning the articles by Belik et al. (2011) and Bajardi et al. (2011) , among others, that provide computational and theoretical models seeking to address the effect of human mobility and mobility restrictions on containing outbreaks of infectious diseases. Findlater and Bogoch (2018) find that the increasing volume of passenger travel, especially by air, enabled the global epidemic transmission. More recently, the use of new technologies such as mobile phones has facilitated the measurement of human mobility and its effects on disease connectivity (Lai et al., 2019) . The researchers actually focus on severe acute respiratory syndrome coronavirus 2, concluding that human mobility predicts the spread and size of the epidemic and that travel restrictions are particularly useful in the early stage of the outbreak (see, e.g., Kraemer et al., 2020) . This literature also demonstrates that viruses can spread through human contact patterns , given that human mobility contributes to promote social interaction (Mollgaard et al., 2017) . Several studies corroborate these findings for Europe (see, e.g., Iacus et al., 2020; Lemey et al., 2021) . Please note that we use a Spatial Lag Model (SLX) specification to examine the role of human mobility in spreading the virus across the Spanish provinces. Inter-provincial mobility is captured using the spatial weight matrix W W W N    1 , , & . This spatial matrix can be computed in different ways. We follow Giuliani et al. (2020) and Gross et al. (2020) and use a contiguity or binary E W matrix, where the weights equal one for adjacent units and zero for non-bordering units. In their spatial analysis of the spread of COVID-19 in Italy, Bourdin et al. (2021) performed several tests to select the best spatial weight matrix and selected, like us, the first-order contiguity matrix. We select the epidemic time of neighboring provinces (i.e., X ln K it it  ) in order to capture the potential propagation effects between provinces for two reasons. First, this variable is exogenous by construction. In a Spatial Autoregressive model (SAR) specification, it E X is replaced with (a transformation of) the dependent variable, which is endogenous and should thus be instrumented as long as good instruments are available. Second, Vega and Elhorst (2015, p. 342) suggest taking the SLX model as a point of departure because this is not only the simplest specification but is also more flexible in modeling spatial spillover effects than other specifications. Three drawbacks of our empirical strategy are worth noting. First, although the linear FE model (3) has some features that are very appealing for our application, estimating the above logged linear model implies dealing with the zero growth rates of cumulative cases that often appear at the beginning of outbreaks. We can address this issue by dropping such observations from the sample. As this approach might generate some kind of sample selection bias if the missing observations are not random, we instead replace the zero values with a tiny but positive number before taking logs and keep the adjusted zero-value observations in our sample. We include a new dummy variable controlling for (adjusted) zero values as an additional explanatory variable. This variable not only allows us to control for potential measurement issues but also to prevent the observed sharp declines in growth rates caused by zero values to distort the third-order parametric function of epidemic times. Second, in the first wave of the pandemic, no European country had sufficient testing capacity so that reported cases are a small fraction of the true number of infections. We can discuss whether this issue matters in our empirical application using the preliminary results of Orea et al. (2021) , an ongoing study that complements the current paper as it tries to account for the prevalence of undocumented cases. In this paper we propose a stochastic frontier analysis approach for estimating epidemic curves, where the unobserved cases are proxied using a one-sided random term in the same fashion as firms' inefficiency in production economics. We find that the average reporting rate is around 42%. Despite this, we obtain very similar effects due to lockdown on the growth rates of coronavirus cases (6.8 percentage points [pp] on average) compared to our non-frontier application. So, our results would seem to be quite robust in terms of this issue. Another but related matter has to do with the onset date of the pandemic used in our paper. Our epidemic time variable is defined as the number of days relative to the observed onset date of the pandemic, which relies on reported cases. Therefore, it is not a necessary circumstance that a single reported case on a certain date seeded the pandemic in a particular province due to underreporting of cases. In order to see whether in practice the gap between observed and true onset dates is an important issue, we have modified the simulation of Chudik et al. (2020) and simulated several scenarios with different observed onset dates due to underreporting. 3 Two results of the simulation are worth mentioning. First, the goodness-of-fit of our model does not deteriorate when underreporting increases if the level of underreporting is common to all provinces. Second, the goodness-of-fit of the model does deteriorate when underreporting is large and the gap between observed and true onset dates varies notably across provinces. In this case, however, a linear model with fixed effects allowed us to retrieve the predictive capabilities of the model. In this sub-section we discuss the nature of the spillovers generated by the SLX spatial specification of our epidemic curve. The spillovers induced by an SLX model are local in the sense that once the virus is transmitted from a province to another neighboring province, the transmission does not feedback and does not reverberate to other provinces. In this case, only adjacent neighbors are involved, but not higher-order neighbors. In contrast, the SAR model yields a more global spillover effect because it assumes that an impact on neighboring provinces reverberates to the neighbors of the neighboring provinces, neighbors to the neighbors, and so on, thus generating endogenous interaction and feedback effects (see LeSage, 2014) . In this case, the propagation of an original outbreak involves more spatial observations. The epidemiology literature focusing on the spatial propagation of COVID-19 highlights the contribution to the spread of the virus of both cross-border travel (Lemey et al., 2021) and local transmission (du Plessis et al., 2021). However, these papers do not discuss explicitly whether their transmission channels do have feedback effects between geographical units. This is the key issue that should guide the selection of a spatial econometric model. Although we believe that most of the inter-provincial mobility is local in nature due to regular commuting, we cannot rule out the possibility of more global effects caused by the transportation of goods or by business and leisure travelers. As we do not have a theoretical justification for the selected spatial specification, we will proceed as follows with our empirical application. First, we will verify that the SLX model is able to capture all the spatial dependence in the dependent variable through a set of spatial autocorrelation tests on the model's residuals. We will next provide the parameter estimates of an SLX model that uses a W matrix defined using information on human mobility across all Spanish provinces, that is, not only between adjacent provinces. In this case, more spatial observations are involved, as occurs in the SAR and spatial Durbin models. In this sub-section we discuss the advantages of using a linear model instead of a count model. Both models mainly differ in their dependent (outcome) variables and distributional assumptions. 4 Despite these differences, the parameter estimates in our linear model can be interpreted as a semi-elasticity of the number of new cases with respect to an explanatory variable, in the same fashion as in count regression models. 5 Although the interpretation of the estimated parameters is the same, the linear specification has some features that are critical in our application in order to measure the effectiveness of the Spanish lockdown in containing the propagation of COVID-19. First, running a linear model allows us to estimate a DiD model using the traditional FEs estimator. Estimating a DiD model using a count regression model is contentious as different empirical strategies exist for incorporating fixed effects into a count regression model, and some of them are not true FEs models (see Allison & Waterman, 2002) . Moreover, Lechner (2010, p. 196) shows that estimating a DiD model with the standard specification of a count regression models (and other popular nonlinear models) would usually lead to an inconsistent estimator. Second, as the growth rate of cumulative cases is much less volatile than the number of new cases (or its growth rate), our linear model provides more accurate predictions than a count model. This is a feature of the model that is important in our application because we use predicted values to carry out our counterfactual analyses aimed at examining the effect of the Spanish lockdown. Despite the fact that the FE linear model has some features that are very appealing for our application, we also provide the parameter estimates of a Negative Binomial (NB) model for robustness analyses. The NB model is also estimated using two different W matrices, in the same fashion as the linear models. Whereas the contiguity-based W matrix is computed using binary values indicating adjacent provinces, the so-called mobility-based W matrix is computed using information on human mobility across all the Spanish provinces. We have used several sources in order to collect a province-based dataset of coronavirus cases that permits the use of spatial econometric techniques in order to capture spatial propagation effects across Spain. As most control measures began on the days of March 13, 2020 and March 14, 2020, we analyze data on coronavirus cases 2 weeks before and 2 weeks after those dates. In particular, our dataset covers the period between the onset of the pandemic in each province and April 4, 2020. The daily evolution of laboratory-confirmed COVID-19 cases in Spanish mainland provinces was collected manually by the authors from the official press releases of the Spanish regional governments, the Ministry of Health and Wikipedia. In particular, we had to consult these information sources to extend backward the provincial data published by Datadista in GitHub under a free License since March 13, 2020, 6 the latter source extracting their data from a variety of documents published by the Ministry of Health. From March 28, 2020 onward, we collected the data directly using RTVE Flourish. 7 We used the regional online data released by the Ministry of Health 8 and the province-level data released by the Spanish regional governments in order to correct typos and the lack of information on coronavirus cases in some provinces (e.g., in Galicia). It should be noted that we were unable to obtain province-level data for the Cataluña region. For this reason, the whole region is treated as a single province. We do not show the temporal evolution of reported coronavirus cases in each province due to space limitations, but they can be found in Orea and Álvarez (2020) . We instead show the onset pandemic dates for each province in Figure 2 , the latter determining the values of the epidemic times. A feature worth highlighting is the relatively large dispersion of onset dates across provinces. This feature is crucial for the estimation of Equation (3) Figure 3 shows the box-plots of the growth rates of cumulative cases by epidemic time. This figure clearly reveals two relevant features. First, the growth rates are much larger at the beginning of the pandemic than when the epidemic had progressed. That is, our dependent variable tends to decrease over the epidemic time. Second, the volatility is much larger when it E K is small and much smaller when it E K increases. This calls for using heteroskedasticity robust standard errors when estimating our models. Both linear and NB models are also estimated using a spatial W matrix that is computed using information on human mobility across all the Spanish provinces. Data on mobility flows is obtained from the Spanish National Statistics Institute (INE), which in November 2019 initiated an ambitious project aimed at measuring daily mobility based on tracking spatial-temporal mobile position data. 9 Table 1 shows the parameter estimates of several epidemic curves. Whereas the dependent variable using a linear model is the growth rate of cumulative cases, the new cases per day is the dependent variable using the NB model. All models have been estimated using the FE estimator because we reject that no correlation exists between the province-specific effects and the regressors using the traditional Hausman test at any significance level. All specifications in Table 1 provide very similar results, indicating that our empirical strategy is quite robust. The coefficients of the third-order function of it E lnK are all statistically significant. This is an expected result as the traditional epidemic curve is S-shaped and this form requires estimating up to a third-order function of the epidemic time. The coefficients of 14 E M , 21 E M and 28 E M allow us to test whether the Spanish lockdown and the previous control measures enacted by regional governments were successful in attenuating the spread of the virus within each province. As social distancing was encouraged on 9 March and in the following 3 days several regional governments prohibited in-class teaching and forced local quarantines, we find a statistically significant coefficient for M14. We also find a statistically significant coefficient for M28, an expected result due the national lockdown of the population. Figure 4 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 Epidemic time new cases per day over time for the different provinces, sorted by regions. This explains why each plot in Figure 4 includes multiple lines. There are two vertical red lines. Whilst the left one identifies the implementation of the Spanish lockdown (i.e. March 14, 2020), the right vertical line labels March 28, 2020. Notice that the daily incidence peaked around March 28, 2020 (i.e., when  37 E t or so), in many of the Spanish provinces. Therefore, this figure seems to support the idea that the Spanish lockdown started to have a significant effect on new cases and, hence, on cumulative cases 2 weeks after the implementation of the Spanish lockdown. All models in Table 1 include two spatially lagged epidemic time variables. Our SLX spatial specification seems to capture all the spatial dependence in the dependent variable as we cannot reject the null hypothesis that the SLX residuals are not spatially correlated. 10 A key result of our empirical exercise is the positive and statistically significant coefficient found for the spatially lagged variable, i W ln K M indicates that the lockdown has been more effective in provinces that are either close to the epicenters of the coronavirus or adjacent to provinces at a more advanced stage of the pandemic. As in our paper, Dickson et al. (2020) find that in the northern Italian provinces the Government containment measures not only succeeded in drastically reducing the transmission of COVID-19 amongst individuals within the Italian provinces, but also avoided contagions between neighboring areas. Abbreviation: SLX, Spatial Lag Model. *, **, and *** indicate significance at the 10%, 5%, and 1% levels, respectively. To conclude this section, it is germane to mention that we also regressed the estimated province fixed effects against a set of covariates in order to identify province-specific factors which intensify the pandemic's development in each province. 11 This information can be very useful for policy makers and health authorities when planning the relaxation of future lockdowns. We find that the most-populated provinces have suffered more acutely from COVID-19, probably due to the agglomeration of individuals and the more frequent use of public transport in these provinces. Coronavirus proved more intensive in those provinces with a relatively large share of highly educated workers. This result is most probably linked to provincial international connectivity and the probability of traveling abroad and/or importing cases of COVID-19 from other countries. We also found that the COVID-19 pandemic proved more severe in those provinces with a relatively large share of service sector workers. In contrast, the pandemic was less harsh in provinces with a relatively large share of workers in the agriculture and construction sectors. The risk of contagion in the service sector is not surprisingly much higher than in the construction and agricultural sectors because many service jobs are indoors, while most tasks in the other two sectors are mainly outdoors. Although in Table 1 all the models appear to provide similar parameter estimates, we next discuss some subtle but interesting differences. First, whereas the estimated parameters are very similar regardless of whether we use linear or NB models, the goodness-of-fit of the linear models is fairly large (over 80%) compared to the NB models (around 20%), an expected result due to the large volatility of daily new cases. Second, both linear and NB models are estimated using contiguity and mobility-based W matrices. While the first one is computed using binary values indicating adjacent provinces, the second one is computed using information on human mobility across all the Spanish provinces. The results of these two competing spatial specifications are very similar due to 77% of the variation of the weights of the mobility-based W matrix being explained by the binary values of the weights of the contiguity W matrix. For instance, the mobility-based SLX model only attributes a slightly smaller effect to the Spanish lockdown than our preferred model. As the goodness-offit is slightly larger using the contiguity linkages, the latter model is used to carry out our simulation exercises. Interestingly enough, we find that the effect attributable to the Spanish lockdown in the NB models varies considerably when we change the W matrix. This seems to indicate that the linear models are, in our application, more robust to the definition of the W matrix. Our preferred model indicates that on average the growth rates of cumulative cases increases 5.1 pp, from 17.1% to 22.2%, due to the spatial propagation between provinces. The spatial spillover varies over time. For instance, while the growth rate of cumulative cases attributable to inter-provincial propagation is on average about 8.8 pp before March 14, 2020, it decreases up to 3.3 pp after the implementation of the Spanish lockdown. This result again suggests that the lockdown was effective in preventing the propagation of the coronavirus between provinces. We also performed a counterfactual exercise using the parameter estimates of our preferred model in order to simulate what would have happened on April 4, 2020 in the case of no spatial propagation between provinces. Table 2 provides the results of this simulation exercise. This table shows remarkable reductions in cumulative cases in the absence of spatial spillovers between provinces. The number of reported cases in the mainland Spanish provinces on April 4, 2020 was 126,859. This number would have decreased to 37,557 if we drop the propagation between provinces. Therefore, the number of COVID-19 cases would have been reduced by 70.4% in the absence of spatial spillovers between the Spanish provinces. Reported A Simulated B Figure 5 allows an examination of the geographical distribution of the estimated spatial effects if we compare the actual distribution of cases on April 4, 2020 (top map) with the distributions of cases that would have been observed on April 4, 2020 in our hypothetical scenario (bottom map). This figure suggests that the spatial effect varies across provinces. Indeed, we found that while the reduction of cases in the event of no spatial propagation is much larger in provinces that are either close to the epicenters of the coronavirus or adjacent to provinces at a more advanced stage of the pandemic, it is smaller in the other provinces. We find using the parameter estimates of our preferred model that the growth rates of coronavirus cases decrease, on average, 6.4 pp (from 28.6% to 22.2%) due to the Spanish lockdown. 12 As aforementioned, the reduction in the growth rate of cumulative cases attributable to the lockdown in provinces that are close to (far from) the epicenters of COVID-19 or adjacent to provinces at more advanced stages of the pandemic, are much larger (smaller) than the abovementioned average value. To provide information about the effectiveness of the lockdown by provinces, we have carried out two new counterfactual exercises that simulate what would have happened in two different hypothetical scenarios. We first simulate the number of coronavirus cases if the lockdown had not been implemented around March 14, 2020. The counterfactual values were simulated from March 14, 2020 onward by adding the difference between simulated and predicted growth rates to the observed growth rates cumulative cases. The counterfactual values are then used to compute reductions in the number of coronavirus cases for each province and not only for the whole country as in Flaxman et al. (2020) . The second counterfactual exercise tries to examine what would have happened if the lockdown had been implemented on March 7, 2020. This information can be very useful for policy makers and health authorities in the event of new outbreaks of COVID-19 in Spain. The counterfactual values were simulated here from March 7, 2020 onward by subtracting the province-specific average of differences between simulated and predicted growth rates from the observed growth rates of confirmed cases. Table 3 provides the results of these two simulation exercises. Figure 6 compares the actual geographical distribution of coronavirus cases (shown in the middle map) with the counterfactual geographical distributions in the case of non-intervention (bottom map) and in the case of a hypothetical lockdown implemented on March 7, 2020 (top map). The number of reported cases in Spanish mainland provinces on April 4, 2020 was 126,859. This number would have increased to 737,663 in the absence of lockdowns. Therefore, the lockdown implemented on March 14, 2020 reduced the number of potential COVID-19 cases by 82.8%. Similar numbers are found by Nussbaumer-Streit et al. (2020) in their rapid review of the literature related to COVID-19. They find that the quarantine measures reduce the number of people with the disease up to 81%. Using a similar approach, Cho (2020) recently found that the infection cases in Sweden would have been reduced by almost 75% had its policy makers followed stricter containment policies. The largest reductions in coronavirus cases attributable to the Spanish lockdown are found again in provinces that are either close to the epicenters of the coronavirus or adjacent to provinces at more advanced stages of the pandemic, as OREA and ÁLVAREZ 13 F I G U R E 5 Spatial spillovers: geographical distribution of cumulative cases on April 4, 2020. (a) Actual cases with spatial spillovers. (b) Counterfactual cases with no spatial spillovers the two last maps in Figure 6 suggest. We next discuss what would have happened if the lockdown had begun on March 7, 2020. If the lockdown had been brought forward to March 7, 2020, the number of additional coronavirus cases would have been reduced from 126,859 to 41,318 in the Spanish Peninsula. Taken together both counterfactual analyses, the lockdown implemented on March 7, 2020 reduced the number of potential COVID-19 cases by 94.4%. Therefore, the number of coronavirus cases would have been reduced by an additional 11.6% if the lockdown had been brought forward to March 7, 2020, a reduction that potentially would have prevented the collapse of many hospitals in Spain. We finally examine the GDP gains of bringing forward the date of the Spanish lockdown. The second counterfactual exercise carried out in the previous section shows that many provinces would have had less than 28 new confirmed cases per 100.000 habitants during the 2 weeks prior to April 4, 2020. This in turn implies that on April 4, 2020 many regions would have already met one of the conditions stipulated by the Spanish Government to initiate the relaxation of the lockdown measures. The easing of the lockdown restrictions in Spain began on May 11, except in Castilla-León, Cataluña and Madrid where it started on May 25. Table 4 shows the duration of the actual lockdown in the Spanish mainland regions (see the second column). The strictest part of the confinement lasted 8.3 weeks in all regions, except in the three aforementioned regions where the confinement was extended by 2 weeks. The first column in Table 4 shows the annual GDP growth rate forecasts per week of lockdown provided by BBVA Research (2020). Based on this information, the third and fourth columns use this information to compute the economic effect of the actual lockdown in terms of GDP growth rates and Table 4 provide an estimate of the economic disruption corresponding to a hypothetical lockdown implemented on March 7, 2020. The lockdown of a province is assumed to start easing on April 11, 2020 if two conditions are satisfied. The first is that it meets the criterion mentioned above of having less than 28 new confirmed cases per 100,000 habitants during the 2 weeks prior to April 4, 2020. The second is a condition that has to do with the (relative) capacity of its health services to deal with new cases of COVID-19, which a province meets if it has less confirmed cases per capita than the median province on April 4, 2020. The lockdown would have lasted only 4 weeks if a province had met these two conditions on April 4, 2020. If only one condition is met, the easing of lockdown restrictions is assumed to start on April 25, 2020, in which case the lockdown would have lasted 6 weeks. Finally, if neither of the conditions is met, the easing of lockdown is assumed to start on May 9, that is, 2 weeks later. Once the duration of the lockdown has been simulated for each province, a weighted average is computed for the whole region using the relative GDP of each province as weights. The simulated regional lockdown durations are shown in the fifth column. The next two columns show the simulated annual GDP growth rate and GDP losses using the annual GDP growth rates per week of lockdown shown in the first column. Given the BBVA forecasts, our simulated lockdown implemented on March 7, 2020 would have reduced Spanish GDP by 69,700 million euros. Finally, the last column on Table 4 shows the difference in GDP losses between the simulated and real lockdown. Summing across all regions, the estimated difference in GDP losses is around 26,900 million euros. Therefore, the simple economic analysis in Table 4 suggests that the final economic consequences of the confinement of population would have been much less severe if the Spanish lockdown had been brought forward to March 7, 2020. This paper examines the propagation of COVID-19 across the Spanish provinces and assesses the effectiveness of the Spanish lockdown of the population implemented on March 14, 2020 to combat the pandemic. To achieve these objectives, we use a spatial econometric model that somehow mimics the popular reproduction-based models used in the epidemiological literature. The main findings of the paper are the following. We provide evidence supporting the belief that human mobility did spread the virus across the country given that we observe that the growth rate of COVID-19 cases in one province depends on the development of the pandemic in other provinces. We also find that the lockdown has been effective in both attenuating the propagation of the virus within each province as well as preventing the propagation of the coronavirus between provinces. Our counterfactual analyses show that local and national lockdowns of the population are effective measures to combat COVID-19 in the absence of both pharmaceutical related measures (e.g., vaccines) and other non-pharmaceutical interventions (e.g., massive testing, face-masks available for the whole population, etc.). However, they should be implemented at the very early stages of the pandemic. On the one hand, our analyses suggest that carrying out a gradual relaxation of the control measures in Spain, both across provinces and sectors is preferable. On the other hand, we find that the GDP losses attributable to the confinement of the population would have been reduced by 26.9 thousand million euros if the Spanish lockdown had been brought forward to March 7, 2020. As such, we find that a rapid institutional response to the COVID-19 outbreak not only saves lives but would also have attenuated the economic impact of the Spanish coronavirus pandemic. editing the English grammar of the text. We also gratefully acknowledge the data provided by Datadista in GitHub and Flourish in RTVE. OREA and ÁLVAREZ 20 F I G U R E A 1 Growth rates of cumulative cases in an Susceptible, Infected and Recovered models Fixed-effects negative binomial regression models Human mobility networks, travel restrictions and the global spread of 2009 H1N1 pandemic Regional analysis Spain. Second The econometric analysis of non-stationary spatial panel data Natural human mobility patterns and spatial spread of infectious diseases Does lockdown work? A spatial analysis of the spread and concentration of Covid-19 in Italy Quantifying the impact of nonpharmaceutical interventions during the COVID-19 outbreak: The case of Sweden Voluntary and mandatory social distancing: Evidence on COVID-19 exposure rates from Chinese provinces and selected countries Assessing the effect of containment measures on the spatio-temporal dynamic of COVID-19 in Italy Establishment and lineage dynamics of the SARS-CoV-2 epidemic in the UK Human mobility and the global spread of infectious diseases: A focus on air travel Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe Modelling and predicting the spatio-temporal spread of COVID-19 in Italy Spatio-temporal propagation of COVID-19 pandemics. medRxiv preprint Human mobility and COVID-19 initial dynamics The effect of human mobility and control measures on the COVID-19 epidemic in China Measuring mobility, disease connectivity and individual risk: A review of using mobile phone data and mHealth for travel medicine The estimation of causal effects by difference-in-difference methods SARS-CoV-2 European resurgence foretold: Interplay of introductions and persistence by leveraging genomic and mobility data. This preprint is under consideration at a What regional scientists need to know about spatial econometrics What are the underlying transmission patterns of COVID-19 outbreak? An age-specific social contact characterization Correlations between human mobility and social interaction reveal general activity patterns Quarantine alone or in combination with other public health measures to control COVID-19: A rapid review How effective has the Spanish lockdown been to battle COVID-19? A spatial analysis of the coronavirus propagation across provinces Estimating the propagation of the COVID-19 virus with a stochastic frontier approximation of epidemiological models: A panel data econometric model with an application to Spain. Efficiency Series Paper, 01/2021 The SLX model How effective has the Spanish lockdown been to battle COVID-19? A spatial analysis of the coronavirus propagation across provinces The authors want to say a very big personal thank you to all frontline staff in the Spanish health system and social care who battled and continue to combat against COVID-19, as well as those workers and volunteers who were key in supporting basic services during the Spanish lockdown. This manuscript was written during our visiting scholarship to Loughborough University. In this sense, the authors would like to thank the two "Salvador de Madariaga" grants obtained from the Spanish Ministry of Science, Innovation and Universities (Grants PRX19/00596 and PRX19/00589). Authors thank Alan Wall (University of Oviedo) and Ángel de la Fuente (Universitat Autònoma de Barcelona), who read a preliminary version of this article and provided several valuable comments and suggestions. Both authors also wish to acknowledge the financial support from the project INNJOBMAD-CM (Ref. H2019/HUM-5761) and the Ministry of Science and Innovation (Project PID2020-113076GB-I00) and Joanna Bashford, assistant researcher of the aforementioned project for Luis Orea and Inmaculada C. Álvarez declare that they have no conflict of interest. The data that support the findings of this study are available from the corresponding author upon reasonable request. Inmaculada C. Álvarez https://orcid.org/0000-0003-0495-623X ENDNOTES 1 We thank the reviewers for pointing out to us that human mobility is a proxy for what we really think effects the spread of a contagious disease, that is, human contact patterns.2 Separate analyses or more flexible models must be implemented in order to account for more than one contagion waves (see, e.g., Dickson et al., 2020 , who use B-spline regressors to model complex nonlinear spatio-temporal dynamics in the propagation of the virus). 3 We have modified the simulation of Chudik et al. (2020) as follows. We first generate the true evolution of total coronavirus cases in a representative province using the discrete-time SIR model developed by Chudik et al. (2020) . Observed values for each province are then obtained by adjusting the theoretical values with simulated values for a one-sided (half-normal) random term capturing the proportion of undocumented cases. We replicate this procedure for different levels of underreporting. In all replications, the onset of the pandemic is associated with the day on which we observe the first case. Although all provinces have the same true onset date, the observed onset date of each province might differ due to underreporting. 4 When the outcome is considered to be continuous, the data are frequently assumed to be normally distributed and linear least squares regression techniques are applied. The count regression models provide an alternative approach for the analysis of discrete data, provided that the outcome follows , for example, a Poisson distribution, the over dispersion issue is correctly specified, and the model adequately fits the data. 10 According to Beenstock and Felsenstein (2019) , the lack of spatial correlation in the residuals should be tested for each time period. The set of performed Moran's I and Geary's tests are available from the authors upon request. 11 The parameter estimates are available from the authors upon request. 12 Although the accumulated effect of this reduction is remarkable (see our results in Table 3 ), the epidemic did not stop growing by April 4, 2020. Using our model, we can only conclude that the Spanish lockdown helped to attenuate the COVID-19 propagation during the first wave of contagion.