key: cord-0708369-2fokjcjr authors: Chernozhukov, V.; Kasahara, H.; Schrimpf, P. title: Causal Impact of Masks, Policies, Behavior on Early Covid-19 Pandemic in the U.S. date: 2020-05-29 journal: nan DOI: 10.1101/2020.05.27.20115139 sha: de6d37298368b4f1c1b217f9093ac17ff32551dc doc_id: 708369 cord_uid: 2fokjcjr This paper evaluates the dynamic impact of various policies, such as school, business, and restaurant closures, adopted by the US states on the growth rates of confirmed Covid-19 cases and social distancing behavior measured by Google Mobility Reports, where we take into consideration of people's voluntarily behavioral response to new information of transmission risks. Using the US state-level data, our analysis finds that both policies and information on transmission risks are important determinants of people's social distancing behavior, and shows that a change in policies explains a large fraction of observed changes in social distancing behavior. Our counterfactual experiments indicate that removing all policies on April 1st of 2020 would have lead to 30 to 200 times more additional cases by late May. Removing only the non-essential businesses closures (while maintaining restrictions on movie theaters and restaurants) would have increased the weekly growth rate of cases between -0.02 and 0.06 and would have lead to -10% to 40% more cases by late May. Finally, nationally mandating face masks for employees on April 1st would have reduced the case growth rate by 0.1-0.25. This leads to 30% to 57% fewer reported cases by late May, which translates into, roughly, 30-57 thousand saved lives. Accumulating evidence suggests that various policies in the US have reduced social interactions, and have slowed down the growth of Covid-19 infections. 1 An important outstanding issue is, however, how much of the observed slow down in the spread is attributable to the effect of policies per se relative to a voluntarily change in people's behavior out of fear of being infected. This question is critical to evaluate the effectiveness of restrictive policies in the US relative to an alternative policy of just providing recommendations and information such as the one adopted by Sweden. More generally, understanding people's dynamic behavioral response to policies and information is indispensable for properly evaluating the effect of policies on the spread of Covid-19. This paper quantitatively assesses the impact of various policies adopted by the US states on the spread of Covid-19, such as non-essential businesses closure and mandatory face masks, paying a particular attention to how people adjust their behavior in response to policies as well as new information on cases. We present a conceptual framework that spells out the causal structure on how the Covid-19 spread is dynamically determined by policies and human behavior. Our approach explicitly recognizes that policies not only directly affect the spread of Covid-19 (e.g., mask requirement) but also indirectly affect its spread by changing people's behavior (e.g., stay at home order). It also recognizes that people react to new information on Covid-19 cases and voluntarily adjust their behavior (e.g., voluntary social distancing and hand washing) even without any policy in place. Our casual model provides a framework to quantitatively decompose the growth of Covid-19 cases into three components: (1) direct policy effect, (2) policy effect through behavior, and (3) direct behavior effect in response to new information. 2 Guided by the causal model, our empirical analysis examines how the weekly growth rates of confirmed Covid-19 cases are determined by policies and behavior using the US statelevel data. To examine how policies and information affect people's behavior, we also regress social distancing measures on policy and information variables. Our regression specification for case growth is explicitly driven from a SIR model and the estimated regression coefficients can be interpreted from the viewpoint of the effective reproduction number although our causal approach does not hinge on validity of a SIR model. As policy variables, we consider state of emergency, mandatory face masks for employees in public business, stay at home order, closure of K12, closure of restaurants except take out, closure of movie theaters, and closure of non-essential business. Our behavior variables are four mobility measures that capture the intensity of visits to "transit," "grocery," "retail," and "workplaces" from Google Mobility Reports. We take time trend, the lagged growth rate of cases, and the log of lagged cases as our measures of information on infection risks that affects people's behavior. We also consider the growth rate of tests, the log number of tests, and the state-level characteristics (e.g., population size and total area) as the confounders that has to be controlled for in order to identify the causal relationship between policy/behavior and the growth rate of cases. Our key findings from regression analysis are as follows. We find that both policies and information on past cases are important determinants of people's social distancing behavior, where policy effects explain at around 80% or more of observed decline in four behavior variables. We also find that people dynamically adjust their behavior in response to policies and information with time lag. There are both large policy effects and large behavior effects on case growth. The estimates from our lag cases information model imply that all policies combined would reduce the weekly growth rate of cases by around 1. Taking into account of a change in the number of tests, the weekly growth rates of infections arguably declined from around 2 in Figure 1 shows how removing all policies would have changed the average growth rate of cases as well as the number of cases across states. The effect of removing policies on case growth peaks at 0.6 after two to three weeks, and then gradually declines. This decline happens because people voluntarily adjust their behavior in response to information on a higher number of cases. The estimate implies that there would have been 30-200 times more cases (10 to 100 million additional cases) by late May if all policies had never been implemented. Figure 2 illustrates how allowing non-essential businesses open would have affected the case growth and cases. Keeping non-essential businesses open (other than movie theaters, gyms, and keeping restaurants in the "take-out" mode) would have increased the cases by −10% to 40%. These estimates contribute to the ongoing efforts of evaluating various reopening approaches (Stock, 2020a) . Figure 3 show that implementing mandatory face masks on April 1st would have reduced the case growth rate by 0.1-0.25, leading to reductions of −30% to −57% in reported cases in late May. This roughly implies that 30-57 thousand lives would have been saved. 3 This finding is significant: given this potentially large benefit for reducing the spread of Covid-19, mask mandates is an attractive policy instrument especially because it involves relatively little economic disruption. Again, these estimates contribute to the ongoing efforts of designing approaches to minimize risks from reopening. A growing number of other papers have examined the link between non pharmaceutical interventions and Covid-19 cases. 4 Hsiang et al. (2020) estimate the effect of policies on Covid-19 case growth rate using data from the United States, China, Iran, Italy, France, and South Korea. In the United States, they find that the combined effect on the growth rate of all policies they consider is −0.347 (0.061). Courtemanche et al. (2020) use US county level data to analyze the effect of interventions on case growth rates. They find that the combination of policies they study reduced growth rates by 9.1 percentage points 16-20 days after implementation, out of which 5.9 percentage points is attributed to shelter in place orders. Both Hsiang et al. (2020) and Courtemanche et al. (2020) adopted "reducedform" approach to estimate the total policy effect on case growth without using any social HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME SC SC SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT UT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT VT HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX Figure 3 . Average change in case growth and and relative change in cases from mandating masks for employees on April 1st Existing evidence for the impact of social distancing policies on behavior in the US is mixed. Abouk and Heydari (2020) employ a difference-in-differences methodology to find that statewide stay-at-home orders has strong causal impact on reducing social interactions. In contrast, using the data from Google Mobility Reports, Maloney and Taskin (2020) find that the increase in social distancing is largely voluntary, and driven by information. 5 Another study by Gupta et al. (2020) also found little evidence that stay-at-home mandates induced distancing using mobility measures from PlaceIQ and SafeGraph. Using data from SafeGraph, Andersen (2020) show that there has been substantial voluntary social distancing but also provide evidence that mandatory measures such as stay at home order have been effective at reducing the frequency of visits outside of one's home. Pei, Kandula, and Shaman (2020) use county-level observations of reported infections and deaths in conjunction with mobility data from SafeGraph to estimate how effective reproductive numbers in major metropolitan areas change over time. They conduct simulation of implementing all policies 1-2 weeks earlier and found that it would have resulted in reducing the number of cases and deaths by more than half. Without using any policy variables, however, their study does not explicitly analyze how policies are related to the effective reproduction numbers. Epidemiologists use model simulations to predict how cases and deaths evolve for the purpose of policy recommendation. As reviewed by Avery et al. (2020) , there exist substantial uncertainty in parameters associated with the determinants of infection rates and 5 Specifically, they find that of the 60 percentage point drop in workplace intensity, 40 percentage points can be explained by changes in information as proxied by case numbers, while roughly 8 percentage points can be explained by policy changes. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. . 6 VICTOR CHERNOZHUKOV, HIROYUKI KASAHARA, AND PAUL SCHRIMPF asymptomatic rates (Stock, 2020b) . Simulations are often done under strong assumptions about the impact of social distancing policies without connecting to the relevant data (e.g., Ferguson et al., 2020) . Furthermore, simulated models do not take into account that people may limit their contact with other people in response to higher transmission risks. When such a voluntary behavioral response is ignored, simulations would produce exponential spread of disease and would over-predict cases and deaths. Our counterfactual experiments illustrate the importance of this voluntary behavioral change. Whether wearing masks in public place should be mandatory or not has been one of the most contested policy issues, where health authorities of different countries provide contradicting recommendations. Reviewing evidence, Greenhalgh et al. (2020) recognizes that there is no randomized controlled trial evidence for effectiveness of face masks but they state "indirect evidence exists to support the argument for the public wearing masks in the Covid-19 pandemic." 6 Howard et al. (2020) also review available medical evidence and conclude that "mask wearing reduces the transmissibility per contact by reducing transmission of infected droplets in both laboratory and clinical contexts." Given the lack of experimental evidence, conducting observational studies is useful and important. To the best of our knowledge, our paper is the first empirical study that shows the effectiveness of mask mandates on reducing the spread of Covid-19 by analyzing the US state-level data. This finding corroborates and is complementary to the medical observational evidence in Howard et al. (2020) . . We introduce our approach through the Wright-style causal diagram shown in Figure 4 . The diagram describes how policies, behavior, and information interact together: • The forward health outcome, Y it , is determined last, after all other variables have been determined; • The adopted policies, P it , affect Y it either directly, or indirectly by altering human behavior B it ; • Information variables, I it , such as lagged valued of outcomes can affect both human behavior, adopted policies , and future outcome; • The confounding factors W it , which could vary across states and time, affect all other variables. The index i describes observational unit, the state, and t describes the time. We begin to introduce more context by noting that our main outcome of interest is the forward growth rate in the reported new Covid-19 cases; behavioral variables include 6 The virus remains viable in the air for several hours, for which surgical masks may be effective. Also, a substantial fraction of individual who are infected become infectious before showing symptom onset. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. proportion of time spent in transit or shopping and others; policy variables include stayat-home order and school and business closure variables; the information variables include lagged values of outcome. We provide detailed description and time of these variables in the next section. The causal structure allows for the effect of the policy to be either direct or indirectthrough-behavior or through dynamics; and all of these effects are not mutually exclusive. The structure allows for changes in behavior to be brought by change in policies and information. These are all realistic properties that we expect from the contextual knowledge of the problem. Policy, such as closures of schools, non-essential business, restaurants, alter and constrain behavior in strong ways. In contrast, policies, such as mandating employees to wear masks, can potentially affect the Covid-19 transmission directly. The information variables, such as recent growth in the number of cases, can cause people to spend more time at home, regardless of adopted state policies; these changes in behavior in turn affect the transmission of Covid-19. The causal ordering induced by this directed acyclical graph is determined by the following timing sequence within a period : (1) confounders and information get determined, (2) policies are set in place, given information and confounders; (3) behavior is realized, given policies, information and confounders; and (4) outcomes get realized, given policies, behavior, and confounders. The model also allows for direct dynamic effect of the information variables on the outcome through autoregressive structures, that capture persistence in the growth patterns. As further highlighted below, realized outcomes may become new information for future periods, inducing dynamics over multiple periods. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. . Our quantitative model for causal structure in Figure 4 is given by the following econometric structural equation model: which is a collection of functional relations with stochastic shocks, decomposed into observable part δW and unobservable part ε. The terms ε y it and ε b it are the centered stochastic shocks that obey the orthogonality restrictions posed in these equations. (We say that The policies can be modeled via a linear form as well, although our identification and inference strategies do not rely on the linearity. 7 . The observed variables are generated by setting ι = I it and propagating the system from the last equation to the first: The system above implies the following collection of stochastic equations for realized variables: These equations form the basis of our empirical analysis. Identification and Parameter Estimation. The orthogonality equations imply that these are all projection equations, and the parameters of SEM are identified by the parameters of these regression equation, provided the latter are identified by sufficient joint variation of these variables across states and time. The last point can be stated formally as follows. Consider the previous system of equations, after partialling out the confounders: denotes the residual after removing the orthogonal projection of V it on W it . The residualization is a linear operator, implying that (1) follows immediately from above. The parameters of (1) are identified as projection coefficients in these equations, provided that residualized vectors appearing in each of the equations have non-singular variance, that is Our main estimation method is the standard correlated random effects estimator, where the random effects are parameterized as functions of observable characteristic. The statelevel random effects are modeled as a function of state level characteristics, and the time random effects are modeled by including a smooth trend and its interaction with state level characteristics. The stochastic shocks {ε it } T t=1 are treated as independent across states i and can be arbitrarily dependent across time t within a state. A secondary estimation method is the fixed effects estimator, where W it includes latent (unobserved) state level effects W i and and time level effects W t , which must be estimated from the data. This approach is much more demanding on the data and relies on long cross-sectional and time histories. When histories are relatively short, large biases emerge and they need to be removed using debiasing methods. In our context debiasing materially change the estimates, often changing the sign. 8 However, we find the debiased fixed effect estimates are qualitatively and quantitatively similar to the correlated random effects estimates. Given this finding, we chose to focus on the latter, as a more standard and familiar method, and report the former estimates in the supplementary materials for this paper. (I2) Information variable is lagged value of outcome: 8 This is cautionary message for other researchers intending to use fixed effects estimator in the context of Covid-19 analysis. Only debiased fixed effects estimators must be used. 9 The similarity of the debiased fixed effects and correlated random effects served as a useful specification check. Moreover, using the fixed effects estimators only yielded minor gains in predictive performances, as judging by the adjusted R 2 's, providing another useful specification check. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. . https://doi.org/10.1101/2020.05.27.20115139 doi: medRxiv preprint 10 VICTOR CHERNOZHUKOV, HIROYUKI KASAHARA, AND PAUL SCHRIMPF (I3) Information variables include trend, lagged and integrated values of outcome: The first information structure captures the basic idea that, as individuals discover more information about covid over time, they adapt safer modes of behavior (stay at home, wear masks, wash hands). Under this structure, information is common across states and exogenously evolves over time, independent of the number of cases. The second structure arises for considering autoregressive components and captures people's behavioral response to information on cases in the state they reside. Specifically, we model persistence in growth Y it rate through AR(1) model, which leads to I it = Y i,t−7 . This provides useful local, statespecific, information about the forward growth rate and people may adjust their behavior to safer modes when they see a high value. We model this adjustment via the term γ I t in the behavior equation. The third information structure is the combination of the first two structures plus an additional term representing the (log of) total number of new cases in the state. We use this information structure in our empirical specification. In this structure people respond to both global information, captured by trend, and local information sources, captured by the local growth rate and the total number of cases. The last element of the information set can be thought of as local stochastic trend in cases. All of these examples fold into a specification of the form: with the initialization I i0 = 0 and Y i0 = 0. 10 With any structure of this form, realized outcomes may become new information for future periods, inducing a dynamical system over multiple periods. We show the resulting dynamical system in a causal diagram of Figure 5 . Specification of this system is useful for studying delayed effects of policies and behaviors, and in considering the counterfactual policy analysis. Letting C it denotes cumulative number of confirmed cases in state i at time t, our outcome approximates the weekly growth rate in new cases from t − 7 to t. 11 Here ∆ denotes the differencing operator over 7 days from t to t − 7, so that ∆C it := C it − C i,t−7 is the number of new confirmed cases in the past 7 days. 10 The initialization is appropriate in our context for analyzing pandemics from the very beginning, but other initializations could be appropriate in other contexts. 11 We may show that log(∆Cit) − log(∆Ci,t−7) approximates the average growth rate of cases from t − 7 to t. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. . We chose this metric as this is the key metric for policy makers deciding when to relax Covid mitigation policies. The U.S. government's guidelines for state reopening recommend that states display a "downward trajectory of documented cases within a 14-day period" (White House, 2020). A negative value of Y it is an indication of meeting this criteria for reopening. By focusing on weekly cases rather than daily cases, we smooth idiosyncratic daily fluctuations as well as periodic fluctuations associated with days of the week. Our measurement equation for estimating equations (BPI→Y) and (PI→Y) will take the form: where i is state, t is day, C it is cumulative confirmed cases, D it is deaths, T it is the number of tests over 7 days, ∆ is a 7-days differencing operator, it is an unobserved error term, where X it collects other behavioral, policy, and confounding variables, depending on whether we estimate (BPI→Y) or (PI→Y). Here are the key confounding variables, derived from considering the SIR model below. We describe other confounders in the empirical section. Our main estimating equation (M) is motivated by a variant of a SIR model, where we incorporate a delay between infection and death instead of a constant death rate, and we add confirmed cases to the model. Let S, I, R, and D denote the number of susceptible, infected, recovered, and dead individuals in a given state or province. Each of these variables All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. . VICTOR CHERNOZHUKOV, HIROYUKI KASAHARA, AND PAUL SCHRIMPF are a function of time. We model them as evolving aṡ where N is the population, β(t) is the rate of infection spread, is the duration between infection and death, and π is the probability of death conditional on infection. 12 Confirmed cases, C(t), evolve asĊ where τ (t) is the rate that infections are detected. Our goal is to examine how the rate of infection β(t) varies with observed policies and measures of social distancing behavior. A key challenge is that we only observed C(t) and D(t), but not I(t). The unobserved I(t) can be eliminated by differentiating (8) and using (5) and (7) asC We consider a discrete-time analogue of equation (9) to motivate our empirical specification by relating the detection rate τ (t) to the number of tests T it while specifying S(t) N β(t) as a linear function of variables X it . This results in , which is equation (M), where X it captures a vector of variables related to β(t). Structural Interpretation. In the early pandemics, when the number of susceptibles is approximately the same as the entire population, i.e. S it /N it ≈ 1, the component X it θ is the projection of infection rate β i (t) on X it (policy, behavioral, information, and confounders other than testing rate), provided the stochastic component it is orthogonal to X it and the testing variables: 12 The model can easily be extended to allow a non-deterministic disease duration. This leaves our main estimating equation (9) unchanged, as long as the distribution of duration between infection and death is equal to the distribution of duration between infection and recovery. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 3.1. Total Change Decomposition. Given the SEM formulation above, we can carry out the following decomposition analysis, after removing the effects of confounders. For example, we can decompose the total change EỸ it −EỸ io in the expected outcome, measured at two different time points t and o = t − into sum of three components: where the first two components are immediate effect and the third is delayed or dynamic effect. In the three examples of information structure given earlier, we have the following form for the dynamic effect: The effects can be decomposed into (a) delayed policy effects via behavior by summing terms containing P EB, (b) delayed policy effects via direct impact by summing terms containing DP E, and (c) pure behavior effects, and (d) pure dynamic feedback effects. 3.2. Counterfactuals. We also consider simple counterfactual exercises, where we examine the effects of setting a sequence of counterfactual policies for each state: All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. . We assume that the SEM remains invariant, except of course for the policy equation. 13 Given the policies we propagate the dynamic equations: In stating this counterfactual system of equations, we make the following invariance assumption Invariance Assumption. The equations of (CF-SEM) remain exactly of the same form as in the (SEM) Let PY it and PY it denote the predicted values, produced by working with the counterfactual system (CEF-SEM) and the factual system (SEM): In generating these predictions we make the assumption of invariance stated above. Then we can write the difference into the sum of three components: 13 It is possible to consider counterfactual exercises in which policy responds to information through the policy equation if we are interested in endogenous policy responses to information. Although this is beyond the scope of the current paper, counterfactual experiments with endogenous government policy would be important, for example, to understand the issues related to the lagged response of government policies to higher infection rates due to incomplete information. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. . https://doi.org/10.1101 /2020.05.27.20115139 doi: medRxiv preprint Similarly to what we had before, the counterfactual dynamic effects take the form: The effects can be decomposed into (a) delayed policy effects via behavior by summing terms containing P EB, (b) delayed policy effects via direct impact by summing terms containing DP E, and (c) pure behavior effects, and (d) pure dynamic feedback effects. 4.1. Data. Our baseline measures for daily Covid-19 cases and deaths are from The New York Times (NYT). When there are missing values in NYT, we use reported cases and deaths from JHU CSSE, and then the Covid Tracking Project. The number of tests for each state is from Covid Tracking Project. As shown in Figure 20 in the appendix, there was a rapid increase in testing in the second half of March and then the number of tests increased very slowly in each state in April. We use the database on US state policies created by Raifman et al. (2020) . In our analysis, we focus on 7 policies: state of emergency, stay at home, closed nonessential businesses, closed K-12 schools, closed restaurants except takeout, close movie theaters, and mandate face mask by employees in public facing businesses. We believe that the first five of these policies are the most widespread and important. Closed movie theaters is included because it captures common bans on gatherings of more than a handful of people. We also include mandatory face mask use by employees because its effectiveness on slowing down Covid-19 spread is a controversial policy issue (Howard et al., 2020; Greenhalgh et al., 2020) . Table 1 provides summary statistics, where N is the number of states that have ever implemented the policy. We also obtain information on state-level covariates from Raifman et al. (2020) , which include population size, total area, unemployment rate, poverty rate, and a percentage of people who are subject to illness. We obtain social distancing behavior measures from"Google COVID-19 Community Mobility Reports" (LLC, 2020) . The dataset provides six measures of "mobility trends" that report a percentage change in visits and length of stay at different places relative to a baseline computed by their median values of the same day of the week from January 3 to February 6, 2020. Our analysis focuses on the following four measures: "Grocery & pharmacy," "Transit stations," "Retail & recreation," and "Workplaces." 14 14 The other two measures are "Residential" and "Parks." We drop "Residential" because it is highly correlated with "Workplaces" and "Retail & recreation" at correlation coefficients of -0.99 and -0.98 as shown in Table 2 . We also drop "Parks" because it does not have clear implication on the spread of Covid-19. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. . Median Max State of emergency 51 2020-02-29 2020-03-11 2020-03-16 Date closed K 12 schools 51 2020-03-13 2020-03-17 2020-04-03 Stay at home shelter in place 42 2020-03-19 2020-03-28 2020-04-07 Closed movie theaters 49 2020-03-16 2020-03-21 2020-04-06 Closed restaurants except take out 48 2020-03-15 2020-03-17 2020-04-03 Closed non essential businesses 43 2020-03-19 2020-03-25 2020-04-06 Mandate face mask use by employees 36 2020-04-03 2020-05-01 2020-05-11 Table 1 . State Policies In our empirical analysis, we use weekly measures for cases, deaths, and tests by summing up their daily measures from day t to t − 6. We focus on weekly cases because daily new cases are affected by the timing of reporting and testing, and are quite volatile as shown in Figure 17 in the appendix. Similarly, reported daily deaths and tests are subject to high volatility. Aggregating to weekly new cases/deaths/tests smooths out idiosyncratic daily noises as well as periodic fluctuations associated with days of the week. We also construct weekly policy and behavior variables by taking 7 day moving averages from day t − 14 to t − 21, where the delay reflects the time lag between infection and case confirmation. The four weekly behavior variables are referred as "Transit Intensity," "Workplace Intensity," "Retail Intensity," and "Grocery Intensity." Consequently, our empirical analysis uses 7 days moving averages of all variables recorded at daily frequencies. Table 2 reports that weekly policy and behavior variables are highly correlated with each other, except for the"masks for employees" policy. High correlations may cause multicolinearity problem and potentially limits our ability to separately identify the effect of each policy or behavior variable on case growth but this does not prevent us from identifying the aggregate effect of all policies and behavior variables on case growth. 4.2. The Effect of Policies and Information on Behavior. We first examine how policies and information affect social distancing behaviors by estimating a version of (PI→B): where B j it represents behavior variable j in state i at t − 14. P it collects the Covid related policies in state i at t − 14. Confounders, W it , include state-level covariates, the growth rate of tests, the past number of tests, and one week lagged dependent variable which may capture unobserved differences in policies across states. I it is a set of information variables that affect people's behaviors at t − 14. Here, and throughout our empirical analysis, we focus on two specifications for information. In the first, we assume that information is captured by a trend and a trend interacted with statelevel covariates, which is a version of information structure (I1). We think of this trend as summarizing national aggregate information on Covid-19. The second information specification is based on information structure (I3), where we additionally allow information to vary with each state's lagged growth of new cases, lag(∆ log ∆C, 7), and lagged log new cases, lag(log ∆C, 14). All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Each off-diagonal entry reports a correlation coefficient of a pair of policy and behavior variables. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Dependent variables are "Transit Intensity," "Workplace Intensity," "Retail Intensity," and "Grocery Intensity" defined as 7 days moving averages of corresponding daily measures obtained from Google Mobility Reports. Columns (1)-(4) use specifications with the information structure (I1) while columns (5)-(8) use those with the information structure (I3). All specifications include state-level characteristics (population, area, unemployment rate, poverty rate, and a percentage of people subject to illness) as well as their interactions with the log of days since Jan 15, 2020. The row " j Policy j " reports the sum of six policy coefficients. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Table 3 reports the estimates. Across different specifications, our results imply that policies have large effects on behavior. School closures appear to have the largest effect, followed by restaurant closures. Somewhat surprisingly, stay at home and closure of non essential businesses appear to have modest effects on behavior. The effect of masks for employees is also small. However, these results should be interpreted with caution. Differences between policy effects are generally statistically insignificant; except for masks for employees, the policies are highly correlated and it is difficult to separate their effects. In columns (1)-(4), the estimated sum effect of all policies are substantial and can account for a large fraction of observed declines in behavior variables. For example, retail intensity was approximately -40% at its lowest point in mid April, and the estimated policy coefficients imply that imposing all six policies would lead to 38.2/40 ≈ 95% of the observed decline. Columns (5)-(8) reports the estimated coefficients for the model with lagged dependent variables. Except for transit, the estimated coefficients of lagged growth of new cases and lagged new cases are negative and often significant at conventional levels. This suggests that both the growth and the number of past cases reduce social interactions, perhaps because people are increasingly aware of prevalence of Covid-19 (Maloney and Taskin, 2020) . The estimated aggregate effects of policies in columns (5)-(8) are largely consistent with those reported in columns (1)-(4). Columns (5)-(8) include lagged dependent variables, so to compare the magnitude of estimates between columns (1)-(4) and (5)-(8), we should look at a "long-run" aggregate effect in (5)-(8) by dividing by the 1 −ρ j whereρ j is the estimated coefficient of lagged dependent variable. We put quotes on "long-run" because this is a long-run effect holding information (i.e. lagged growth and lagged log cases) constant. For retail, the implied information constant long-run aggregate effect of policies is estimated at −16.14/(1 − 0.581) = −38.5%, among which −16.1% is attributable to the short-run effect. 4.3. The Direct Effect of Policies and Behavior on Case Growth. We now analyze how behavior and policies together influence case growth rates by estimating equation (BPI→Y) . where B it = (B 1 it , ..., B 4 it ) is a vector of four behavior variables in state i at t − 14, where a lag length of 14 days is chosen to reflect the delay between infection and confirmation. 15 P it includes the Covid related policies in state i at t − 14 that directly affect the spread of Covid-19 after controlling for behavior variables (e.g., masks for employees). We include information variables, I it , as covariates because some information variables such as the past confirmed cases may be correlated with (latent) government policies or people's behaviors that are not fully captured by our observed policy and behavior variables. W X,it consists of state-level covariates. As discussed in section 2.3, equation (12) corresponds to (M) 15 As we review in the Appendix A.6, a lag length of 14 days is broadly consistent with currently available evidence. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. . https://doi.org/10.1101/2020.05.27.20115139 doi: medRxiv preprint 20 VICTOR CHERNOZHUKOV, HIROYUKI KASAHARA, AND PAUL SCHRIMPF derived from the SIR model, and the terms δ T ∆ log(T ) it and δ D T it ∆D it ∆C it capture the effect of changing test rates on confirmed cases. Table 4 shows the results of estimating (12). In columns (1) and (3), we use "state of emergency" policy as a proxy for all other policies because a "state of emergency" order can be viewed as preparation for implementing a variety of policies subsequently. The results indicate that mandatory face masks reduce infections while holding behavior constant, suggesting that requiring masks may be an effective preventive measure. Except for mask requirements, policies appear to have little direct effect on confirmed case growth when behavior is held constant. This supports a directed causal model where causality flows from policies to behavior to infections. A useful practical implication of these results are that Google Mobility Reports and similar data might be useful as a leading indicator of potential case growth. This should be done cautiously, however, because other changes in the environment might alter the relationship between behavior and infections. Preventative measures, including mandatory face masks, and changes in habit that are not captured in our data might alter the future relationship between Google Mobility Reports and case growth. Table 5 reports the sum of short run policy and behavior effects. Policy effects are simply the sum of coefficients. The behavior effect is the sum of coefficients weighted by the average of the behavioral variables from April 1st-10th. In the trend information model, the short run and long run effects coincide, abstracting from any feedback mechanism. On the other hand, if policies are enacted and behavior changes, then future cases and information will change. The lag cases information model based on information structure (I3) can capture such feedback effects. However, since the lag cases information model includes lags of both case growth and log cases, computing a long-run effect is not completely straightforward. We investigate dynamic effects that incorporate feedback mechanism through a change in information in section 5. In this section, we focus our analysis of policy effects when we hold information constant. The estimates of the effect of policy on behavior in table 4 and of the effect of policies and behavior and case growth in table 3 can be combined to calculate the total effect of policy as well as its decomposition into the direct and the indirect effects. Table 7 show the direct (holding behavior constant) and indirect (through behavior changes) effects of policy on case growth for the lag cases information model, where the standard errors are computed by bootstrap and clustered on state. The estimates imply that all policies combined would reduce the growth rate of cases by 0.41, out of which 0.21 is attributable to direct effect while 0.20 is attributable to indirect effect through their impact on behavior, although we need to interpret these numbers carefully given that information is held constant. The estimate also indicates that the effect of "masks for employees" policy is mostly direct. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Dependent variable is the weekly growth rate of confirmed cases as defined in equation (3). The covariates include 14 days lagged policy and behavior variables, which are constructed as 7 days moving averages between t − 14 to t − 21 of corresponding daily measures. Columns (1)-(2) use specifications with the information structure (I1) while columns (3)-(4) use those with the information structure (I3). All specifications include state-level characteristics (population, area, unemployment rate, poverty rate, and a percentage of people subject to illness) and their interactions with the log of days since Jan 15, 2020. The row " policies except for masks" reports the sum of five policy coefficients except for "masks for employees" while the row " k w k behavior k " reports the sum of four coefficients of behavior variables weighted by the average of each behavioral variable from April 1st-10th. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The reported numbers are from columns (2) and (4) of Table 4 . The row "masks" reports the estimated coefficients of "masks for employees" on weekly case growth. The row " policies except for masks" reports the sum of five policy coefficients except for "masks for employees" while the row " k w k behavior k " reports the sum of four coefficients of behavior variables weighted by the average of each behavioral variable from April 1st-10th. We can also examine the total effect of policies and information on case growth, by estimating (PI→Y). The coefficients on policy in this regression combine both the direct and indirect effects. As show in the final two columns of table 7, the policy effects from estimating (PI→Y) are about twice as large as the policy effect from combining (BPI→Y) and (PI→B). Table 6 shows the full set of coefficient estimates for (PI→Y). The results are broadly consistent with what we found above. As in tables 3 and 7, school closures appear to have the largest effect on growth rates, followed by restaurant closures. However, as mentioned previously, the policies are highly correlated, so it is difficult to separately identify their impacts except mask mandates. The coefficient of mandatory face masks is estimated at -0.174 and significant at 5 percent level. As reported in column "PI→Y Coefficient" of Table 7 , the estimates imply that all policies combined would reduce ∆ log ∆C by 0.101 for the lag information model. For comparison, the daily average of ∆ log ∆C reached its peak in mid-March of about three. Since then it has declined to near 0. This peak of 3 is arguably somewhat overstated because there was also a rapid increase in testing at that time. If we adjust ∆ log ∆C for increasing in testing by subtracting 0.166∆ log T , its peak is around 2. Thus, the estimate of -0.101 roughly imply that policy changes can account for one half of the observed decrease in case growth. The remainder of the decline is likely due to changes in behavior from information. We now turn our focus to the specification with information as lagged dependent information. This allows dynamic feedback effects. Policy and behavior changes that reduce case growth today can lead to a more optimistic outlook in the future, attenuating longer run effects. To simplify and focus on dynamics, in this section we work with the estimates of the total policy effects (equation PI→Y and table 6), rather than decomposing into direct and via-behavior effects. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. . Dependent variable is the weekly growth rate of confirmed cases as defined in equation (3). The covariates include 14 days lagged policy variables, which are constructed as 7 days moving averages between t − 14 to t − 21 of corresponding daily measures. Columns (1)-(2) use specifications with the information structure (I1) while columns (3)-(4) use those with the information structure (I3). All specifications include state-level characteristics (population, area, unemployment rate, poverty rate, and a percentage of people subject to illness) and their interactions with the log of days. The row " j Policy j " reports the sum of six policy coefficients. We first consider the impact of changing from the observed policies to none. For illustrative purpose, we begin by focusing on Washington. Figure 6 shows the observed, estimated average, and counterfactual without policies average of ∆ log ∆C in Washington. To compute the estimated and counterfactual paths we use the estimates from column 2 of table 6. We set initial ∆ log ∆C and log ∆C to their values first observed in the state we are simulating. We hold all other regressors at their observed values. Error terms are drawn with replacement from the residuals. We do this many times All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Direct effects capture the effect of policy on case growth holding behavior, information, and confounders constant. Direct effects are given by π in equation (BPI→Y). Indirect effects capture how policy changes behavior and behavior shift case growth. They are given by α from (BPI→Y) times β from (PI→B). The total effect is π + βα. Column "PI→Y Coefficients" shows the coefficient estimates from PI→Y. Column "Difference" are the differences between the estimates from (PI→Y) and the combination of (BPI→Y) and (PI→B). Standard errors are computed by bootstrap and clustered on state. and report the average over draws of the residuals. To calculate confidence intervals, we repeat the above with coefficients drawn randomly from their asymptotic distribution. The shaded region is a point-wise 90% confidence interval. We can see that fit of the estimated and observed ∆ log ∆C is quite good. Figure 6 shows the change in case growth from removing policies. Since there is a two week lag between policy implementation and their effect on recorded cases, the counterfactual without policy average begins differing from the average with policies two weeks after each policy was implemented. The effect of removing policies on case growth peaks at about 0.75 and then slowly declines. This decline is due to a feedback effect of information. Without policies, cases grow more rapidly. With more dire case numbers, people adjust their behavior to reduce infection. This somewhat offsets the increase in infection from removing policies. Given the delay between infection and reported case numbers, this offsetting effect takes some time to appear. Even small differences in case growth can lead to very large differences in case numbers. Removing policies in Washington increases case growth from around 0 to above 0.5 throughout most of April. Figure 7 shows that this increase in growth implies a tremendous increase in cases. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. The effect of removing cases in other states is broadly similar to that in Washington. Figure 8 shows the change in growth rate and cases from removing policy interventions in Illinois and Massachusetts. Figure 9 shows the average across states of the change in case growth. Similar to what we saw in Washington, removing policies leads to an increase of about 0.6 in case growth throughout April. This increase in growth implies a very large increase in cases by late All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. May. 16 Figure 10 displays the national increase in aggregate cases without any policy intervention. The estimates imply 10 to 100 million additional cases by late-May, or a 30-200 fold increase. 16 This estimate should be interpreted somewhat cautiously because it involves some extrapolation. The information variables, in particular log C, soon reach levels not seen in the data. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI HI ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME SC SC SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IN IN IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX Relative effect of removing policies on cases 5.2. Non essential Business Closures. A particularly controversial policy is the closure of non essential businesses. We now examine a counterfactual where non essential businesses are never closed. Figure 11 shows the effect of leaving non essential businesses open in Washington. Non essential businesses have a modest impact on cases, with an average increase of 1000 by late May, with a 90% confidence interval from roughly -1000 to 4000. Figure 13 shows the national aggregate effect of leaving non essential businesses open. The estimates imply that with non essential businesses open, cases would be about -10 to 50% higher in late May. 5.3. Mask Mandate. As discussed earlier, we find that mask mandates reduce case growth even holding behavior constant. In other words, mask mandates may reduce infections with relatively little economic disruption. This makes mask mandates a particularly attractive policy instrument. In this section we examine what would have happened to cases if all states had imposed a mask mandate on April 1st. Figure 14 shows the changes in case growth and cases in Washington. In Figure 15 , implementing mandatory mask policy on April 1st would have reduced the total number of confirmed cases by 2,500-12,500 in Illinois and 2,000-11,000 in Massachusetts by late-May. Figure 16 shows the national change in case growth and cases. Mandating masks on April 1st leads to 30 to 57% fewer cases nationally in late May. This paper assesses the effects of policies on the spread of Covid-19 in the US using the US state-level data on cases, tests, policies, and social distancing behavior measures from Google Mobility Reports. Our findings are summarized as follows. First, our empirical analysis indicates that mandating face masks have reduced the spread of Covid-19 without affecting people's social distancing behavior measured by Google Mobility Reports. Our counterfactual experiment based on the estimated model indicates that, if all states had have adopted mandatory face mask policy on April 1st of 2020, then the All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020 . . https://doi.org/10.1101 Second, we find that the impact of all policies combined on case growth is quite large, reducing the growth rate of cases by 60 percent. Except for mandatory face mask policy, the effect of all other policies on case growth is realized through their impact on social distancing behavior. Our counterfactual experiment shows that, had all policies have been removed from all states on April 1st of 2020, there would have been 10 to 100 million additional cases ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX Third, we find that people voluntarily reduce their visits to workplace, transit, grocery, and transits when they receive information on a higher number of new cases. This suggests that individuals make decisions to voluntarily limit their contact with others in response to greater transmission risks, leading to an important feedback mechanism to future cases. As our counterfactual experiments illustrate, model simulations that ignore this people's All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. voluntary behavioral response to information on transmission risks would over-predict the future number of cases. Beyond these findings, our paper presents a useful conceptual framework to investigate the relative roles of policies and information on determining the spread of Covid-19 through their impact on people's behavior. Our causal model allows us to explicitly define counterfactual scenarios to properly evaluate the effect of alternative policies on the spread of All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD SD TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TN TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX TX Table 9 . Portion of cumulative cases that are equal between data sets Figure 17 shows the evolution of new cases in each of these three datasets. In all cases, daily changes in cumulative cases displays some excessive volatility. This is likely due to delays and bunching in testing and reporting of results. Table 10 shows the variance of log new cases in each data set, as well as their correlations. As shown, the correlations are approximately 0.9. Also of note, log of The New York Times new case numbers has the lowest variance. 17 In our subsequent results, we will primarily use the case numbers from The New York Times. NYT JHU CTP NYT 1.00 0.88 0.87 JHU 0.88 1.00 0.81 CTP 0.87 0.81 1.00 Variance 5.84 7.17 6.78 Table 10 . Correlation and variance of log daily new cases For most of our results, we focus on new cases in a week instead of in a day. We do this for two reasons as discussed in the main text. First, a decline of new cases over two weeks has become a key metric for decision makers. Secondly, aggregating to weekly new cases smooths out the noise associated with the timing of reporting and testing. Table 11 reports the correlation and variance of weekly log new cases across the three data sets. Figure 18 shows the evolution of weekly new cases in each state over time. 17 This comparison is somewhat sensitive to how you handle negative and zero cases when taking logs. Here, we replaced log(0) with log(0.1). In our main results, we work with weekly new cases, which are very rarely zero. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Table 12 reports the correlation and variance of weekly deaths in the three data sets. Figure 19 shows the evolution of weekly deaths in each state. As with cases, we use death data from The New York Times in our main results. Figure 20 show the evolution of tests over time. A.4. Social Distancing Measures. In measuring social distancing, we focus on Google Mobility Reports. This data has international coverage and is publicly available. We also All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. compare Google Mobility Reports to other social distancing measures from SafeGraph, Unacast, and PlaceIQ. Each of these other data sources are limited to the United States. SafeGraph and Unacast data is not publicly available (both companies make the data available to researchers at no cost). All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 29, 2020. These figures use the "total test results" reported by The Covid Tracking Project. This is meant to reflect the number of people tested (as opposed to the number of specimens tested). The Immediate Effect of COVID-19 Policies on Social Distancing Behavior in the United States Early Evidence on Social Distancing in Response to COVID-19 in the United States Policy Implications of Models of the Spread of Coronavirus: Perspectives and Opportunities for Economists The early phase of the COVID-19 outbreak in Strong Social Distancing Measures In The United States Reduced The COVID-19 Growth Rate Impact of non-pharmaceutical interventions (NPIs) to reduce COVID-19 mortality and healthcare demand Face masks for the public during the covid-19 crisis Causal diagrams for epidemiologic research Tracking Public and Private Responses to the COVID-19 Epidemic: Evidence from State and Local Government Actions The probability approach in econometrics Econometric Evaluation of Social Programs, Part I: Causal Models, Structural Models and Econometric Policy Evaluation Face Masks Against COVID-19: An Evidence Review The Effect of Large-Scale Anti-Contagion Policies on the Coronavirus (COVID-19) Pandemic Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction Bo Yang, Gabriel M. Leung, and Zijian Feng. 2020. "Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia Google COVID-19 Community Mobility Reports Determinants of Social Distancing and Economic Activity during COVID-19: A Global View World Bank Group MIDAS 2019 Novel Coronavirus Repository: Parameter Estimates On the application of probability theory to agricultural experiments. Essay on principles. Section 9 Differential Effects of Intervention Timing on COVID-19 Spread in the United States COVID-19 US state policy database Estimating causal effects of treatments in randomized and nonrandomized studies Epidemiology and clinical features of COVID-19: A review of current literature Reopening the Coronavirus-Closed Economy Data Gaps and the Policy Response to the Novel Coronavirus Determination and interpretation of supply curves: an example Guidelines for Opening Up America Again Tariff on animal and vegetable oils All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity As discussed in the main text, our analysis focuses on seven policies. For stay at home, closed nonessential businesses, closed K-12 schools, closed restaurants except takeout, and close movie theaters, we double-checked any state for which MIDAS (2020) maintain a list of estimates of the duration of various stages of Covid-19 infections. The incubation period, the time from infection to symptom onset, is widely believed to be 5 days. For example, using data from Wuhan, Li et al. (2020) the time between symptom onset and case reporting are less common ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ME ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID ID IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IL IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA IA