key: cord-0750473-ly3fx3u4 authors: Ciapanna, Emanuela; Rovigatti, Gabriele title: The Grocery Trolley Race in Times of Covid-19: Evidence from Italy date: 2022-03-03 journal: Ital Econ J DOI: 10.1007/s40797-022-00189-0 sha: 26541ed05e8feda0f71e7c152c10d2bc859cef28 doc_id: 750473 cord_uid: ly3fx3u4 We study the effects of the containment measures imposed by the Italian Government during the first wave of the Covid-19 pandemic on sales of the retail trade sector, focusing on different types of grocery chain stores. We document a sustained growth in revenues for storable products, beginning right before restrictions on mobility were introduced, and lasting throughout the whole lockdown period. The surge has been driven by the dynamics of smaller outlets, located in urban areas and closer to the city centre, while hypermarkets experienced a drop, probably relating to their more peripheral position. Thanks to the remarkable granularity of the Nielsen scanner data and the staggered implementation of restrictions across regions, we causally identify the short-term effects of mobility constraints on outlets’ sales. According to our estimates, large grocery stores in areas subject to lockdown measures earned revenues around 10% lower than their control group did. In view of the protraction of the pandemic and the need for the Government to continue managing the containment of infections, our study helps quantifying the costs for the economy of mobility restrictions, also highlighting possible distortions arising among different categories of outlets. Italy has been among the first European countries to experience a widespread diffusion of the Covid-19, as well as one of the most hit in terms of deaths during the first wave. Since early March 2020, in order to contain the pressure on the overloaded healthcare system, several measures have been put in place by the Government under the guiding principle of social distancing to limit individuals' exposure to the virus in the workplace and in public spaces. Productive activities deemed 'non-essential' and suspended by the Prime Minister's Decree (DPCM) of 22 March 2020 until May 4th represented about one third of total value added, with percentages of up to around two thirds for the accommodation and catering services component and almost 100% for recreational activities. The impact of the pandemics on the various economic sectors, also due to unprecedented fall in demand, has been uneven. The immediate effects have been particularly severe in manufacturing, transports, catering, accommodation, recreation and culture, personal services, and in large swathes of retail trade. In March and April, the latter experienced a drop in sales of 23% compared to the corresponding period of 2019, driven by the severe decrease in the non-food compartment (− 45%). Instead, the food sub-sector outperformed, registering an overall increase of 5% in sales year on year (Fig. 1) . The present analysis focuses on the effects of the Government restrictions to mobility, imposed in Italy between the end of February and the beginning of May 2020, on sales of fast moving consumer goods (FMCGs) in grocery non specialized stores, namely hypermarkets, supermarkets, superettes and discounts. 1 We document the dynamics of revenues in the period immediately before and during the lockdown, distinguishing between different types of stores, products and by geographical breakdowns. Our findings point to a sustained growth in sales in the 2-month period March-April 2020 of more than 16% with respect to the corresponding period in 2019, a considerable increase, comparable in magnitude to Christmas' sales peaks in "normal" times. The revenue surge during the restrictions is ascribable to both quantity and price increase, once accounting for the large composition effects at play. As of product categories, the most sustained dynamics concerned medical, pharmaceutical and food products. The type of outlet also represents an interesting dimension of heterogeneity throughout the spreading of the pandemics: restrained mobility favoured proximity stores of smaller size (superettes and smaller supermarkets), while penalizing larger ones (e.g. hypermarkets), generally located in more peripheral areas. Along these lines, we propose an attempt to causally identify the effects of mobility restrictions on sales dynamics, by disentangling them from the general impact of fear of contagion risks. To this aim, we perform a difference-in-difference estimation, exploiting the time lapse between the institution of the first partial red zone, involving 10 municipalities within the Lodi province on February 23rd (treated group), and the total national lockdown, starting on March 9th (control group). Our results indicate that during the partial lockdown week, large grocery chain stores (super and hypermarkets) in the treated province registered revenues around 10% lower than those of their control group. The effect is stronger for larger stores (e.g. hypermarkets), more exposed to mobility restrictions, due to their peripheral location, and for "less essential goods", such as cosmetics and alcoholic beverages (− 19 and − 12% respectively). The temporary nature of such shocks-which peaked for all categories at the onset of the lockdown, just to slowly fade away when restrictions were lifted-traces their source back to Government restrictions, rather than being the signal of structural changes in the market structure. In view of the protraction of the pandemic and the need for the authorities to continue managing the containment of infections, our study has important policy implications: it provides an estimate of the costs (in terms of lost revenues) of anti-contagion interventions for a large sector of the economy, also highlighting possible distortions among different categories of outlets. Our results prove robust to different specifications. In particular, by exploiting the quasi-random location of the first Covid-19 clusters and the diffusion that ensued, we are able to split our sample according to the levels of contagion risk, and show that the latter does not bias our estimates. 2 Our work is related to several recent empirical research contributions examining many aspects of the Covid-19 pandemic, and using different data sources. A first strand of literature analyses the effects of lockdown measures and contagion on consumers' behavior. Three notable studies for Denmark, Spain, and the US present evidence on the impact of Covid-19 on consumer spending. Andersen et al. (2020) use transaction-level bank account data from a large Danish bank to find a decline in spending-following the Covid-19 outbreak-which varies across product categories and correlates with Government's restrictions. Carvalho et al. (2020) utilise a large high-frequency point-of-sale transaction dataset from a major Spanish commercial bank to find large overall spending declines across various product categories, following a Government lockdown. Baker et al. (2020a, b) use transaction-level household financial data from a personal-finance website to observe a substantial increase in consumer spending as Covid-19 cases increase, followed by a significant decline in general spending. The authors also observe heterogeneity in spending responses across states (depending on the severity of the virus outbreak). Chronopoulos et al. (2020) examine consumer-spending responses to the onset and spread of the pandemics and the subsequent government-imposed lockdown in Great Britain. Based on data from a personal finance app, which aggregates all transactions from linked bank accounts and credit or debit cards, they find that overall consumer spending declined as the UK government lockdown became imminent and has continued to decline since, although with some heterogeneity by age, gender, and income level. Our work is especially related to two other recent contributions. The first one is an article by O'Connell et al. (2020) , where the authors use household-level scanner data for the UK to analyse purchase dynamics during the pandemics. They document large spikes in spending on storable products in the 4 weeks preceding the lockdown, particularly visible for FMCGs. The second piece of literature is the work by Goolsbee and Syverson (2020) , which proposes a causal estimate of the effects of the shutdown policy conditions at the county and city level in the US on foot traffic, a proxy for economic activity. The results indicate that legal shutdown orders have accounted for a modest share of the massive overall changes in consumer movements. However, Goolsbee and Syverson (2020) consider how often people visit shops rather than how much do they actually spend, which makes their findings perfectly compatible with a surge in grocery stores' revenues overall. Compared to the the aforementioned contributions, the present article adopts a somewhat different perspective. In fact, our focus is on outlets' performance (revenues and sold quantities) rather than on consumers' preferences and spending behavior. The latter is clearly a fundamental driving force within the analysis, yet our main objective is to disentangle the effect on the observed dynamics of Government restrictions (imposed by decree with different timing) from those associated to contagion risks fear. The original contribution of our analysis is twofold: on the one hand, with respect to the aforementioned literature, we propose an attempt of causal identification of the lockdown effect on sales, on the other hand, our estimation method presents some elements of novelty in the designing of distance-depending control groups, borrowed from the spatial econometrics techniques (Diao et al. 2017) . The paper is structured as follows. In Sect. 2 we present the institutional setting and the timing of the Covid-19 restriction measures in the different Italian regions, in Sect. 3 we outline the dataset and document revenues dynamics across broad sets of outlet types, regions and products. In Sect. 4 we propose our identification analysis of the the effects of restricted mobility on revenues dynamics, and discuss our results. A final section concludes. Robustness checks, additional tables and figures are relegated to the Appendices. In this section we provide a general description of the relative weight of the grocery chain-store sector in Italy, with particular reference to market share dynamics of the different types of outlets in the last decade with a geographical detail. Successively, we present the timing of the Covid-19 crisis from its onset in late February, until the end of the restrictions period and we show how the containment measures have differently affected the grocery retail sector, depending on store type as well as on geographical location. In the food sector alone, the market share of modern distribution (grocery retail chain stores) represents 74.5%, in Italy, corresponding to about 88 billion of food consumption. The number of outlets has decreased of around 11% within the last decade, from around 29 thousand in 2009 to 26 thousand in 2018, due to the sharp reduction in smaller-size shops, having a sales area between 100 and 400 m 2 (− 26.6%), partially offset by the expansion of discounters (+ 24.2%). Supermarkets are also growing, albeit to a less extent (1%), while hypermarkets are downsizing (Fig. 2) . Grocery large-scale retail in Italy remains one of the most atomised markets in Europe. The top five operators represent just over half of the market (51.8%) while in countries such as Germany, France and the United Kingdom the share is between 75 and 80%. A closer look, however, unveils local as well as structural differences, which characterize both the market structure and its evolution-including different types of dynamic responses to shocks. The most striking difference between geographical areas concerns the distribution of outlet types. In Fig. 3a , we report the relative market shares of store categories by NUTS1 in 2019. 3 While Supermarkets are the best-selling format in most of the country, the market share of large-scale stores strictly depends on the macro-area, being very popular in the North West, while representing only the 10% of the market in Southern regions-where consumers seem to prefer smaller-scale stores. The wide gap in Discount market shares-more than 10% between the South and the North West-reflects crucial differences in market structure and consumer habits toward the low price policies followed in those stores. In panel (b), we report the dynamics of the average store size between 2003 and 2017: in line with the (static) market share figures, in the Northwest stores are bigger, and have been growing at a higher rate than those located in other areas of the country, since the early 2000s. Not surprisingly, there is a very strong and positive association between the number of outlets and population density (ρ = 0.95) on the one hand, and total sales area (ρ = 0.45) on the other hand. We also find a negative correlation between population density and the presence of large sales surfaces (ρ = −0.13). In fact, hypermarkets arise mostly in peripheral areas, which implies the need for customers to drive to the store location (for a detailed discussion on the topic, see Viviano et al. 2012 As we show in the next sections, this feature helps us to explain the more modest dynamics in sales registered by this category of stores compared to the others during the period of mobility restrictions, which we exploit for identification purposes later in the discussion. Figure 4 depicts the main phases of the Covid-19 crisis. The first acknowledgment of the existence of a suspect "viral pneumonia" cluster of cases in Wuhan, China, dates back to December 31, 2019, and was closely followed by the first reported cases in the EU, and by the lockdown imposed to the Hubei region in mid-January. In Italy, the first reported cases, not directly linked to the Wuhan cluster, date back to mid-February and were detected in Lombardy. On February 23, 2020, the Government declared the first-partial-lockdown, involving 10 municipalities in the Lodi province, which were quarantined to prevent the spread of contagion. Soon after, the sudden increase in the number of infected patients, the pressure on the Intensive Care Units (ICUs) in the most affected areas, and the rise in the count of Covid-19-related deaths prompted the authorities to take unprecedented actions. On March 7 2020, all of Lombardy and 14 additional provinces were quarantined. 4 After only 48 h, on March 9 2020, the whole Italian territory was placed under lockdown. Moreover, the closure of all "non-essential" activities (including bars, cinemas, restaurants) was imposed, in order to ensure social distancing, and reduce the rate of contagion. Covid-19 cases started to decline in May 2020, thanks to the 2-months lockdown. Freedom of movements was re-established on May 4, and other not essential activities re-opened later in the month, putting an end to the so-called "phase 1" of the first wave to enter "phase 2", or Respite period (for further details regarding the evolution of the pandemics and the relative containment measures, see Appendix A). After the summer lull, the virus regained strength in late September, giving rise to the second wave. Non-Alcoholic COICOP 2 DIGITS Notes: Nielsen/ECR scanner data structure, split in information available (black font) and missing (red font) in our dataset. This section presents the data sources that are employed throughout the analysis and the first descriptive evidence regarding the dynamics of grocery stores' revenues following the introduction of Government restrictions in place between early March and the first week of May 2020 in Italy. We employ NUTS1 and NUTS3 level scanner data on revenues and sold quantities, collected by Nielsen and covering about 85% of total grocery chain stores in Italy. 5 The Nielsen scanner data are structured according to the general classification provided by the Efficient Consumer Response (ECR) community, used for category management by both industrial and distribution companies. The ECR classification involves a hierarchical, multi-layer structure, with increasingly granular and more specific clusters of products. In the analysis, we match its broader categories (Product Area) with corresponding Classification of Individual Consumption by Purpose (COICOP, UN Statistical Division, 2018) classes. Despite the differences in the classification scope-distributive and consumption-based, respectively-the two overlap pretty well. In Fig. 5 , we report an example of the Nielsen data structure alongside the general nomenclature of the ECR classification. In particular, out of the 6 layers available, our data is aggregated at L3 (product group, black font)-as a result, we are able to track down the dynamics of the granular products, but we miss potentially interesting information related to the marketability and packaging (L4 and L5, respectively), as well as individual product information like brand, producer, etc. (red font layers), which all appear aggregated and not distinguishable. We employ weekly data for 462 product categories, sold in Italy from January 1, 2018 to October 30, 2020. The NUTS1-level data (North-West, North-East, Center and Sardinia, and South and Sicily) further identify the type of store, whereas the NUTS3level one (province) only accounts for total sales in larger outlets (i.e. all the super and hypermarkets). In Table 1 we describe the two datasets (Channels for NUTS1-level, and Provinces for NUTS3) in terms of coverage, granularity, and variables reported. Based on the COICOP, we identify 19 products at the most granular 4-digits level, Approximate* Prices (P) Prices (P) * The computation of weekly pricesP = P * Q/Q is influenced by (i) changes in the composition of product baskets, (ii) discounts and offers, and (iii) the product formats purchased. Available Nielsen dataset description and main characteristics Table 2 Revenues per product Elaborations on Nielsen data. Descriptive statistics: average weekly revenues per product group and provinces groups-treatment (i.e., Lodi, in column 1), all other provinces (2), and other Lombardy provinces (3). In columns 4 and 5 we report the results of t-tests on the difference of the means between the control and treatment municipalities further aggregated into seven 2-digits clusters: food-which is the main segment, soft drinks, alcoholics, household equipment, medical appliances, and personal care supplies (see Table 2 ). For exposition purposes, in what follows we distinguish three sub-periods along the first wave of the Covid-19 crisis: Anticipation (February 1 until March 8 2020), Lockdown (March 8 until May 4 2020), and finally the Respite, from the end of the lockdown to October 30 2020. In Fig. 6 we report the overall sum of revenues per week in 2018 (maroon dashed), 2019 (green dotted), and 2020 (yellow solid) for the whole sample in the period January-October. The graph highlights the presence of a structural break in the 2020 series, coinciding with the first wave of the pandemics, starting around the last week of February (anticipation period), until the beginning of May, which gradually vanishes throughout the Summer (respite period). The peak is registered on March 8th, i.e. the first week of national lockdown. Looking at revenues dynamics, the year on year growth rate of weekly revenues 6 at the national or NUTS1regional level (and/or by product categories), writes: In Fig. 7a , we report the time series of r (y w ) from October, 2019 and October, 2020, for the whole country. The series presents two spikes: the first coincides with the national lockdown on March 8, and the second with the end of the lockdown-i.e., the first week of May, 2020. The plot, however, suggests that the lockdown regulatory measures did not fully channel the increase in revenues: the "excess" demand (i.e., precautionary spending) appeared already on early February, as the first news of the pandemics outbreak started spreading (consumption hoarding), and abnormal revenues ceased only in June, when mobility across regions was fully re-established, just to reemerge with the onset of second wave in September. Despite the acceleration in revenue growth registered in aggregate terms, the lockdown effect appears to be composite, depending on store types. In panel (b) we report the same r (y w ) series by outlet category: while the anticipation effect generates common upward trends, these diverge as soon as the restrictions to the mobility take place. While medium-and small-scale stores, including Discounts, reach unprecedented levels of revenues, with + 20 to + 40% increases with respect to the same week of 2019, hypermarkets experienced negative fluctuations for the whole lockdown subperiod. We argue that a similar diverging pattern, far from being the result of changes in consumption habits, is rather attributable to the more peripheral position of large retail surfaces, less easily accessible throughout the time lapse of mobility restrictions. Indeed, especially in smaller municipalities, hypermarket customers were unable to drive beyond municipal boundaries and were induced to shop in vicinity outlets (generally superettes or specialized stores), within few hundreds meters from their homes. This intuition finds further confirmation in the sudden reversion of the trend to the old habits immediately following the end of lockdown (see the positive spike since May 4). Apparently, as shown in panel (c), the same revenue patterns have been recorded all over the country, with only minor differences across NUTS1 regions. Finally, in panel (d), we exploit the 2-digit COICOP classification to show how rather diverging trends emerge across products: while relatively important product clusters (in terms of total market share), such as food and household appliances experienced sizeable increases, the personal products were negatively affected by the lockdown. Unsurprisingly, medical equipments (including masks, pharmaceuticals, etc.) recorded unprecedented peaks both during and after the lockdown period (further details about the lockdown effects, distinguishing between geographical areas, product/outlet types and price/quantity impact are relegated to Appendix B). In this section, we attempt to identify the causal impact of the mobility restriction measures adopted during the first wave on sales of large grocery stores in Italy. We first describe our empirical strategy, which exploits the gradual entry into force of the provisions by different geographical areas; then we report our results and discuss their significance and limitation. Finally, we conduct a battery of robustness checks to corroborate our evidence. The present analysis is based on NUTS3 data, which are available for the universe of large stores only (as stated in Sect. 3), so all our results in this section solely apply to this sub-category. The partial lockdown imposed on ten municipalities within the province of Lodi on February 23, 2020, provides an exogenous variation in the treatment status that we use to estimate the causal lockdown effects on grocery stores revenues. More specifically, we exploit the fact that the red zone-nearly the whole province-had been isolated from the rest of Lombardy with the prohibition for its citizens of moving from home, except for reasons of emergency and for the supply of essential and basic necessities. We propose the following difference-in-difference specification: where our outcome variable y is the log of revenues for product i in province p at week t; treatment i pt is an indicator function for products sold in the Lodi province after February 23, 2020, while τ t and γ i p are time and province/product fixed effects, respectively. 7 In Fig. 8a , we plot revenues dynamics (index February 5, 2020 = 1) for the treatment (solid blue line) and the control group (dashed maroon line)-the latter including all Italian provinces but Lodi. 8 The plot confirms that there are no detectable pre-trends that differently affected the treatment and the control group. Moreover, despite the upward dynamics, the graph highlights how the treated units "catch-up" the level of spending of control units right after the national lockdown date (black dotted line). Figure 8b , depicts the within-province mobility for the treatment and the control group: in the time lapse between the treatment date (maroon vertical line) and the national lockdown (black vertical line), the mobility within the treated area decreased dramatically, lying well below the average level in control provinces. The gap was closed only when the national lockdown extended the same measures to the whole country. The proposed model faces a few potential threats to the identification of the correct treatment effect, which we discuss below and address with a series of robustness checks-reported in the body of the paper below or in Appendix C. First, and foremost, one could argue that our results are driven by the heterogeneous perceived risk of contagion in the two groups rather than by the different restrictions applied to treated and controls. The descriptive evidence shown in Fig. 8a , could induce to believe that the treatment affects provinces belonging to the control group(s) as well as Lodi, which would violate the main assumption for the validity of the differencein-differences identification-i.e., absent the treatment, the outcome of treatment and control groups would hold parallel. We argue that the regulatory intervention imposed on February 23, 2020-the lockdown-had two main effects: on the one hand, it generated a wave of concern about contagion risk (signalling effect) that invested the whole country, and caused a spike of consumption hoarding for precautionary reasons (O'Connell et al. 2020 ). On the other hand, for quarantined municipalities only, it imposed strict constraints to personal mobility (mobility effect). As long as the extent of the signalling effect is the same for treated and control units, the proposed Revenues -Index 01feb2020 16feb2020 02mar2020 17mar2020 01apr2020 Date Control -All Distance-based controls Notes: (a) Total Covid-19 identified cases between March and September, 2020, per 100,000 inhabitants at the province level (February/August, 2020). (b) Revenues indexed at February 1, 2020, reported for the treatment group (Lodi, solid blue), averaged across the control group provinces for the baseline control (Control -All, dashed maroon), and across all distance-based control groups (dashed grey). identification captures an unbiased estimate of the mobility effect. In other terms, we assume the lockdown effect to be a combination of two components, β lock = β mob + β sign , one affecting, by definition, the treated group only (β mob treatment > 0 = β mob control ) and captured by (2), whilst the other assumed equal for both groups (β sign treatment = β sign control ), irrespective of the contagion risk faced. The latter assumption, though, has clear observable implications-i.e., control provinces facing higher contagion risk must not show significantly higher levels of revenues than control provinces with fewer contagions-that we empirically test in Sect. 4.3. In order to correctly identify the impact of mobility restriction measures on total revenues, as distinct from the effect of perceived contagion risk, we also propose an exercise with distance-dependent control groups, whose size in terms of included provinces is an increasing function of the distance from the epicenter of contagion (i.e. the province of Lodi). Following the methodology proposed in Diao et al. (2017) , we exploit the heterogeneous geography of the first wave of infections-which we report, in terms of total cases per 100,000 inhabitants within the province, in Fig. 9a . Most heavily affected provinces are located in the North-West, and in particular around Lodi (which is by far the worst case, with an incidence of around 175). The "quasiexogenous" geographical distribution of the starting clusters allows us to partition provinces into gradually less exposed units, depending on their physical distance from the epicenter. More specifically, we compute the distance between each province and Lodi, 9 and use the deciles of the distance distribution to generate gradually narrower balls around the treated province. For each decile, all provinces whose centroid lies within the relative ball belong to the control group. In Fig. 9b , we report the trends for each distance-based control group: all of them confirm the absence of any significant pre-trend. In addition, we run the baseline model excluding all provinces which share any boundary with Lodi: in this way, we are able to exclude from the analysis the units facing the highest (perceived) contagion risk. Second, our analysis suffers from data limitations. In fact, the maximum degree of geographical detail available in the Nielsen data-set is NUTS3, i.e. the province level. On the other hand, the quarantine was imposed at the municipal level, for a subset of municipalities within the Lodi province. The latter account for 22% of the whole province population, and are home to more than 40% of sales area within the grocery retail sector, respectively. Provided that the data points in our treatment group include both the treated and a subset of control units, the estimated β is downward biased by construction and attains a lower bound of the actual effect. Third, the province of Lodi may have unique features possibly driving our results, and our model could be capturing them, rather than the lockdown effect. Italian provinces are in fact characterized by an extreme degree of heterogeneity. In this context, the very small size of the treatment group relative to the control raises similar concerns. In order to address both issues, we propose (i) an inverse propensity scoreweighted version of model (2)-which corrects for possible selection of treated units into the treatment, and (ii) a synthetic control group analysis, with province level data. Both exercises yield very robust, and statistically significant results (see Appendix C). Table 3 reports the results of the difference-in-differences estimation on total revenues (column 1, Baseline). Our results point to an impact of lockdown measures of around 10% lower revenues for grocery stores located in the red zone with respect to those located in areas not subject to mobility restrictions. These findings are remarkably robust to the choice of the model: columns 2 to 10 report the estimates using the distance-dependent control groups, increasingly more restrictive-from the 9th (corresponding to a radius of around 750 km around Lodi) to the 1st decile (which corresponds to a very narrow 64 km radius)-with qualitatively the same estimates. These results provide a strong support to the correctness of our identification: irrespective of the level of contagion risk considered in the control group, in fact, the point estimate is relatively unaffected. looseness-1In Table 4 we test the robustness of our results to different compositions of the control group, and for both total revenues (panel a) and within-province mobility (panel b). Column 2 reports the results obtained keeping in the control group only the provinces within the first decile of distance from Lodi, whereas in column 3 we implement a somewhat opposite approach and, focusing on the contiguity rather than the distance between provinces, we exclude from the sample all Lodi's neighbor provinces in order to avoid possible confounding effects coming from the contagion risk component (see next section for a more detailed analysis on the point). Finally, in columns 4 and 5 we split the sample according to macroareas (Center-South and Table 3 Baseline results and distance-based robustness checks Baseline Difference-in-differences results, as in Eq. (2). In column 1 (Baseline) we use all available data and keep the control group as broad as possible. In columns 2 to 10 we use increasingly restrictive definitions of control groups, using the deciles of distance from the epicenter. In all models, we cluster the standard errors at the province level (2), on log revenues (a) and mobility (b). In column 1 (Baseline) we use all available data and keep the control group as broad as possible. In column 2 we focus on a restrictive definitions of control group, using the nearest decile of distance from the epicenter. In column 3 (No Neighbors), we exclude from the sample the provinces contiguous to Lodi, while in columns 4 to 6 we compare Lodi to Southern and Central, or Northern provinces. In all models, we cluster the standard errors at the province and date level. Avg log(PQ) and Avg mob report the average (log) revenues and the average mobility measure in the sample, respectively North), which we use as control group: all models yield very robust results for both outcomes. In Table 5 , we run the baseline exercise by product group, both for revenues (panel A), and for its main component, quantity (panel B). As expected, the product dimension provides an interesting source of heterogeneity: for instance, the estimated parameters for Food and Medical equipment are significantly lower in absolute value than the aggregate model ones, indicating a lower elasticity of demand for goods deemed as "more essential", even in the quarantined zones. Consistently, coefficients referred to Soft Drinks and Personal Appliances are higher in absolute value (− 12 and − 19%, respectively): when forced to limit their shopping trips, consumers choose to reduce their consumption of those products. Finally, the quantity channel is the one driving the estimates: apart from the Household products, the estimates virtually don't change when using Q i pt instead of P Q i pt . The estimate of the lockdown effect does not provide information on the objective risk of contagion. Even in the absence of direct restrictions to personal mobility, in fact, customers may decide to loose their shopping habits and avoid social contacts while shopping the stores. On the other hand, a higher contagion risk may lead to higher levels of precautionary spending and stockpile shopping, because of the uncertainty (2), run per product group. Column 1 reports the baseline estimates, whereas in columns 2 to 7 we restrict the estimation sample to the individual product groups. Panel b) Difference-in-differences results, as in Eq. (2), run per product group and on quantity demanded. In all models, we cluster the standard errors at the province level related to the evolution of the pandemics, to supply capacity of stores, and to the regulatory measures. In this context, the lockdown imposed on February, 23rd in Lodi signalled the urgency and magnitude of the sanitary crisis across the country, irrespective of the objective contagion risk faced (i.e., β sign high Risk = β sign low Risk ). We test this conjecture by both exploiting the geographical variation of infection rate and the regulatory interventions that followed the partial lockdown. First, we exclude Lodi from the analysis (because subject to mobility restrictions, too) and we partition the provinces into treated and control units depending on the physical distance from the epicenter. More specifically, we use the provinces neighboring Lodi as the treatment group, given that they faced the highest contagion risk. The second approach follows from the definition of Red Zone provided by the government 2 days before declaring the nationwide lockdown: we define the treatment group accordingly. We then compare treatment and control group(s) by applying model (2) to obtain estimates of the contagion risk effect only, as long as neither group was subject to any restriction to the mobility, and all differences in revenues must be attributed to the differential contagion risk that informs the definition of treatment. The results are shown in Table 6 , for both Neighbor provinces (columns 1 to 4) and Red Zone provinces (columns 5 to 8). There is no statistically significant difference in any of the group. More specifically, β sign high Risk -β sign low Risk = 0 independently of the control group we choose, whether it be the whole sample (Baseline), the provinces belonging to the first decile of distance from the epicenter (Nearest Neighbors, marked by the solid blue line in Fig. 10 ), all the Southern and Central provinces (Treated vs. Center-South) or the sole Northern provinces (Treated vs. North). This robust evidence also serves as (2). The treatment group are the provinces contiguous to Lodi (Neighbor Provinces, columns 1 to 4) or the Red Zone provinces (columns 5 to 8)-Lodi is excluded from the sample. Column 1 (5) reports the baseline estimates. In column 2 (6) we focus on a restrictive definitions of control group, using the nearest decile of distance from the epicenter, while in columns 3 and 4 (7 and 8) we compare Lodi to Southern and Central, or Northern provinces, respectively. In all models, we cluster the standard errors at the province and date level. Avg log(PQ) reports the average (log) revenues in the sample a placebo test to supports our identifying assumption by confirming that there is no detectable difference in the signalling effect across the country. The grocery chain stores have played a crucial role during the first wave of the Covid-19 pandemics. The sector faced a huge positive demand shock for most product categories-which means, in turn, being able to cope with shortages, adjust the logistics, increase hours and working shifts, all of it right in the middle of an unprecedented crisis, which raised uncertainty and blocked most economic activities. As the Covid 19 is becoming endemic, it is likely to require recurrent restrictions in order to prevent the spreading of contagion: hence, understanding the effects of containment measures on new consumption habits and therefore on retailers' performance has become of first-order importance. This paper contributes by showing that even in sectors not subject to extreme shut down policies, as it has been the case for grocery retailers (i) lockdown measures curb revenue increases by a considerable amount; in particular, they do so through the mobility channel, which mainly affects larger peripheral outlets, whose revenues were around 10% lower than those in their control group; (ii) the contagion risk plays a minor role, its effects being homogeneous across provinces; (iii) changes in quantity purchased, rather than price movements, drive most of the revenue dynamics; (iv) there exist huge variations in both quantity demanded, prices charged and dynamic response across areas and outlet types. Notes: Total Covid-19 identified cases between March and September, 2020, per 100,000 inhabitants at the province level (February/August, 2020). The province of Lodi is in white; the solid blue lines mark all provinces within the first quartile of dsitance from Lodi. The evolution of the "first wave" of Covid-19 pandemics in Italy is reported in Fig. 11a . In particular, we plot the total number of cases-dashed blue line-and the number of new cases recorded-solid red line. The dashed vertical lines mark the starting (red) and ending (black) dates of the national lockdown. The number of cases peaked a few days after the regulatory measures, and slowly decreased afterwards: during the summer, the "respite" period has been characterized by a very low number of new infections. Panel (b) reports the geographical dispersion of the incidence across provinces-i.e., the total number of cases over total population-during the first wave. The two plots highlight the main features of the pandemics in Italy: first, its impact peaked during the lockdown, whereas the rate of infections was sensibly lower from May on, second, it was characterized by a strong local variation: while all provinces recorded at least a few cases, by far the most hit places were located in the northwestern regions (Lodi, Bergamo, Milano) which experienced the worst consequences in terms of lost lives and healthcare systems collapses. The descriptive evidence shown in Fig. 7 , although extremely informative, fails to address more specific questions related to the quantification of the lockdown effect, and does not account for the differentials between Northern and Southern regions. In order to shed further light on these features, we propose a simple fixed effects linear regression model, aimed at capturing the impact of the pandemics on product revenues for each sub-period. More specifically, we run a fixed-effects regression of product revenues (in logarithms) of the following form where i, w, a, k, and s represent the product, the week, the region and the store type, respectively. L w is an indicator function for the weeks from March 9 to May 4, 2020, and captures the gross effect of the lockdown period on product revenues, net of all factors related to the specific period of the year (τ t = week t + month t + year t , which controls for week of the year, month of the year, and year fixed effects), macro-region characteristics (α r ), store type (σ s ) and product group specificity (γ k in the baseline model, replaced by ψ i -i.e., product fixed effects-in a more saturated specification). We augment model (3) in several ways. First, we add indicator functions to capture the anticipation effect ( A i,w,r ,s , active between Feb 1 and Mar 8, 2020) and the respite period (R i,w,r ,s , since the lockdown end)-i.e., All Periods model: (1) Estimated results of model (3) with increasing number of fixed effects: week (column 2), area (column 3), the product class and the store type (column 4), and the individual products (column 5). (2) Estimated results of model (4) with increasing number of fixed effects Table 7 reports the results: the estimated βs are all positive and statistically significant, although less and less as we add fixed effects. Notice that in the All periods specification, the estimated lockdown effect gets reduced by about a half when controlling for each individual product instead of product group (column Prod). Thus, the extent of within-product group variation is an important driver of revenue growth. Despite the generally positive trend, we retrieve a negative Anticipation parameter (δ). The latter seems to indicate that, conditioning on product, area, and outlet type fixed effects, revenues decreased in the weeks immediately preceding the restrictions. Accordingly, this result could hinge on a pattern of spending substitution towards products already highly represented within the consumption basket (e.g., bottled water, canned goods), at the expenses of products unaffected by precautionary stockpiling (e.g. cosmetics and perishables), which experienced a significant reduction. Finally, the Respite parameter (η) is never statistically significant, in line with the descriptive evidence. In order to further investigate how the heterogeneous within-group patterns and the difference across store types drive the lockdown effects, we conduct an analysis at the individual product level and implement two sequential exercises. First, we descriptively quantify the lockdown effect on sold quantities by regressing their (log) value on an indicator for the lockdown period. The estimated model, run product by product, writes (5)). Estimated parameters are sorted in ascending order. The horizontal line is set atβ Q = 0 (b) Density distributions of the lockdown effect on quantity purchased per store type (β Q as defined in (5)). The vertical line is set atβ Q = 0. where q t is the log of total quantity sold at time t, summed across NUTS1 regions and distributive channels, and L t is an indicator function equal to 1 for the lockdown weeks and zero otherwise. Thus, the coefficient β q captures the dynamics observed during the restriction period. It is crucial to stress that such effect is averaged across all areas and distributive channels, and provides an indication of the overall "appeal" of individual products between March and May. In Fig. 12a , we plot all the 450 estimated β qwith their confidence intervals-divided by product category. Results, as expected, show a high degree of heterogeneity across and within categories-e.g., on average, Food products show marked increases, whereas Personal Care goods have usually recorded less sustained growth, except for hygiene-related products, whose request surged, following recommendation on hygiene protocols to limit contagion. Second, we further investigate the differences in quantity demanded among products, conditional on store types, by looking at the distribution of the product-store fixed effects interacted with the lockdown period indicator. In formulas, we estimate a fixed effects model of the form where lockdown × ψ i × σ s is an interaction term that captures the combined (fixed) effects of store types, individual products, and the lockdown period. Such a saturated model yields 462 × 4 = 1848 parameters for the baseline estimate, and likewise for the lockdown period. In order to find the lockdown net effect for each product/store couple, we subtract one another, and in Fig. 12b , we plot the resulting densities. The results confirm a massive detrimental effect of the lockdown on hypermarkets, where most products show negative fluctuations (i.e., estimates below zero) during the mobility restriction period, whereas supermarkets, superettes and mostly discounts faced positive demand shocks. Sum of r (y) (columns 1 and 3) and r 2 (y) (column 2) per period, in million euros. The Full period figure is reported in % change Interacted estimated parameters of (3) for area (column 1), outlet type (column 2), product cluster (column 3) and all categories (column 4) indicators. We only report the parameter estimated for the lockdown period (i.e., lockdown = 1) (3) per store type. All estimated models include week, area, and individual product fixed effects. (b) Lockdown parameter estimates interacted with store type fixed effects: we report the difference between the lockdown and the no-lockdown periods estimates the control series regards the "Respite" period: even after the end of the lockdown, abnormal revenues amount to more than ×33 times those in the 2018-2019 period. Finally, in Table 9 we report the estimated marginal effects of the full Eq. (3). More specifically, we interact the lockdown indicator function with indicator functions for areas (column 1), stores (column 2), product clusters (column 3), or full model (column 4), and report all estimates obtained for the lockdown period. Note that, in order to retrieve the full effect per category, one should subtract from the reported parameter the one obtained for lockdown = 0 (see e.g. panel b of Table 10 ). Propensity score matching and weighting In order to relax the concern that our estimates are driven by unique features of the Lodi province, we both run a propensity score matching model and reweight the difference-in-difference model by the inverse of the propensity score: this way, we make sure that the estimation is driven more by provinces that are, ex-ante, more similar to Lodi, according to the chosen observables. More specifically, to estimate the propensity score we use measures related to both the physical/demographic characteristics and to the market structure whose relative weight might likely influence grocery chain stores revenues (data source in parenthesis): Propensity score-weighted models using kernel-based weights for ATT (column 1) and ATE (column 2), inverse propensity score weighted (column 3) and propensity score matching difference-in-differences (column 4) Using the above variables, we are able to compute the propensity score at the province/product level, and run a full battery of robustness checks. The results, reported in Table 11 , include propensity score-weighted models (using kernel-based weights for ATT, column 1, and ATE, column 2), the inverse propensity score weighted model (column 3) and a classical propensity score matching difference-in-differences (column 4). All of them are extremely robust with respect to the baseline results. Synthetic control group analysis We use a similar approach to overcome the concerns related to the small number of treated units with respect to the control group product/provinces-in fact, a 1/86 ratio. To address this, we run a synthetic control group analysis: in a nutshell, we build a "synthetic Lodi province" using all provinces within the control group, reweighted in order to mimic the observables of the treated province. Once obtained the optimal weights, we confront the (untreated) synthetic Lodi with the actual Lodi, and estimate the effect of the treatment-with extremely robust results. We also repeat the same approach with all untreated provinces, and we use these placebo experiments to yield a sort of inferential analysis on the goodness of the results. Figure 14 reports the very satisfactory results: the counterfactual Lodi leads to a bigger drop in revenues (red line) with respect to all its placebo counterparts (grey lines). Consumer responses to the COVID-19 crisis: evidence from bank account transaction data Yannelis C (2020a) How does household spending respond to an epidemic? consumption during the 2020 COVID-19 pandemic Income, liquidity, and the consumption response to the 2020 economic stimulus payments Tracking the COVID-19 crisis with high-resolution transaction data Consumer spending responses to the COVID-19 pandemic: an assessment of Great Britain Spatial-difference-in-differences models for impact of new mass rapid transit line on private housing values Fear, lockdown, and diversion: comparing drivers of pandemic economic decline 2020. NBER Working Papers 27432 COVID-19 outbreak response, a dataset to assess mobility changes in Italy following national lockdown The retail trade sector and the food industry in Italy Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations The length of the available data allows us to check the robustness of the year-onyear changes reported in Fig. 7a . Indeed, a similar picture can be retrieved by using data on 2018 revenues (Fig. 13a) , which supports the hypothesis that the 2020 revenue dynamics were actually abnormally high with respect to previous periods. A further test for robustness is provided in panel (b), where we plot r (y) referring to the 2019 series ("control" sample): aside from increases due to Easter dates, there is no detectable trend in the series, as expected.In Table 8 we analyze the three time intervals individually, for each adjusted series (i.e., r (y), r 2 (y) and r (y) 2019 vs. 2018). We report their sum, period by period, in levels (first three rows, in million e) and in percentage change for the full period (fourth row). Results confirm that the pandemic period dramatically affected total revenues of the GDO. Despite a very modest increase between 2019 and 2018-amounting to about 2% for the whole period, column 3-the effect of the pandemics is clearly identified both by contrasting the 2020 revenues with those in 2019 (+ 11.3%), and with those in 2018 (+ 13.6%). The most striking difference between the 2020 and