key: cord-1010429-p5urgt02 authors: Chen, K. L.; Henneman, L. R.; Nethery, R. C. title: Differential Impacts of COVID-19 Lockdowns on PM2.5 across the United States date: 2021-03-12 journal: nan DOI: 10.1101/2021.03.10.21253284 sha: f189a5cca26eec3401ec1088a88b41438f171908 doc_id: 1010429 cord_uid: p5urgt02 The COVID-19 pandemic has induced large-scale social, economic, and behavioral changes, presenting a unique opportunity to study how air pollution is affected by unprecedented societal shifts. At each of 455 PM2.5 monitoring sites across the United States, we conduct a causal inference analysis to determine the impacts of COVID-19 interventions and behavioral changes ("lockdowns") on PM2.5 concentrations. Our approach allows for rigorous confounding adjustment and provides highly spatio-temporally resolved effect estimates. We find that, with the exception of the Southwest, most of the US experienced increases in PM2.5 during lockdown, compared to the concentrations expected under business-as-usual. To investigate possible drivers of this phenomenon, we use regression to characterize the relationship of many environmental, geographical, meteorological, mobility, and socioeconomic factors with the lockdown-attributable changes in PM2.5. Our findings have immense environmental policy relevance, suggesting that large-scale mobility and economic activity reductions may be insufficient to substantially and uniformly reduce PM2.5. Acute widespread social, behavioral, and economic changes have occurred across the United States in the wake of the COVID-19 pandemic. Most notably, mobility levels decreased substantially 1-3 nationwide during the initial "lockdown period" in March and April 2020 as a result of changes in human behavior and other nonpharmaceutical interventions, such as government-imposed stay-at-home orders, in response to the rapid spread of the virus. The unprecedented actions taken to curb the spread of COVID-19 have created a unique "quasi-experiment" that can be leveraged to study the effect of large-scale behavioral change on air quality. Exposure to fine particle matter, PM 2.5 , has been shown to have significant adverse health effects, including respiratory and cardiovascular morbidity and mortality [4] [5] [6] . To create effective policy to limit PM 2.5 exposure, it is crucial to understand the impact of reductions in certain emission-generating human behaviors on ambient concentrations. Studies of other, smaller scale quasi-experiments have provided some of the strongest evidence for the health impacts of air pollution and effective reduction strategies, e.g., the ban of bituminous coal in Dublin 7 , restrictions on transportation and industrial activities during the 1996 Atlanta 8 and 2008 Beijing Olympic Games 9, 10 . Several recent studies, some of which are in pre-print at the time of writing, conduct analyses of the lockdown effects on air quality using data from a single or a few monitors 11 or using nationwide and/or statewide averages of monitored values 12 . However, due to the vast differences across space in source-specific contributions to PM 2.5 , the crudeness of these aggregated analyses limits their policy relevance. Additionally, a study of 122 US counties by Berman and Ebisu 13 reports a decline in PM 2.5 , on average, during the lockdown period. However, this study did not report or analyze trends over space, nor did they consider the environmental policy implications of their results. In our study, we estimate the effects of the lockdown on PM 2.5 concentrations at each of 455 individual monitors in the Environmental Protection Agency (EPA) monitoring network located throughout the contiguous US. The high spatial resolution of our effect estimates provides more specific policy insights and enables deeper investigation into the factors influencing lockdown-related changes in PM 2.5 . Even in the context of quasi-experimental conditions, isolating the effects of COVID-19 interventions on air pollution in general cannot be achieved through simple before-and-after comparisons nor comparison to a concentrations in the previous year, strategies that are used in much of the existing literature and oft-cited in news media. Numerous time-varying factors that influence air pollution levels, both observable and unobservable, are unaccounted for with such approaches, e.g., meteorology, year-to-year trends, and seasonality. For example, PM 2.5 levels have decreased 44% in the US since 2000 14 . More rigorous approaches are needed to account for pollution trends and time-varying confounders as a means of truly characterizing the causal effects of pandemic-related 2/16 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted March 12, 2021. ; https://doi.org/10.1101/2021.03.10.21253284 doi: medRxiv preprint behavioral changes. The aims of our study are best represented as a two-stage approach: first, we estimate the lockdown-attributable daily changes in PM 2.5 concentrations at 455 monitoring sites using a causal inference approach that can adjust for both observed and unobserved time-varying contributors to PM 2.5 . Formally, at each monitor, we estimate the PM 2.5 concentrations that would have been expected each day in the absence of the COVID-19 pandemic and subsequent interventions (the "counterfactual") and compare these to the corresponding observed levels at the monitor during the lockdown. Second, we use these effect estimates to identify environmental, geographical, meteorological, mobility, and socioeconomic factors that are associated with changes in PM 2.5 during the lockdown. We also quantify the impact of these short-term PM 2.5 changes on respiratory and cardiovascular disease hospitalizations using the EPA's Core Particulate Matter Health Impact Functions. We define the "lockdown period" for each ground monitor site as beginning on the day of the corresponding state's state-of-emergency declaration and ending on the earlier of April 30th, 2020 or the day businesses began to reopen in that state (see Extended Data Table E1 for the state-specific dates). We estimate the lockdown-attributable changes in PM 2.5 at each of 455 EPA ground monitor sites using a causal inference approach called the Synthetic Control Method (SCM) [15] [16] [17] , which leverages a pre/post-intervention study design to estimate intervention effects adjusted for both time-varying and time-invariant unmeasured confounders under mild assumptions. We use SCM to estimate the "counterfactual PM 2.5 concentrations" during the lockdown period, i.e., the daily concentrations that would have been expected during the 2020 lockdown period in the absence of the lockdown or any non-mandated personal behavioral changes that took place due to COVID-19 during same period. Briefly, for a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted March 12, 2021. ; https://doi.org/10.1101/2021.03.10.21253284 doi: medRxiv preprint period for the specified monitor. For consistency with the causal inference literature, we refer to these estimates as "average treatment effects on the treated" (ATTs). smoothed across the US using inverse distance weighting. We find that, during the COVID-19 lockdowns, PM 2.5 increased across most of the US compared to what would have been expected under a business-as-usual scenario. We henceforth refer to smaller lockdown-attributable increases or larger lockdown-attributable decreases as "smaller increases", and larger lockdown-attributable increases or smaller lockdown-attributable decreases as "larger increases". The maps suggest that any lockdown-attributable decreases in PM 2.5 are generally limited to areas of the Western and Southwestern United States. However, substantial lockdown-attributable increases were observed in much of the South, Midwest, and Pacific Northwest. Stratifying by US Census-defined regions, we find average region-wide increases of 1.19 µg/m 3 , 1.11 µg/m 3 , 1.94 µg/m 3 , and 0.80 µg/m 3 for the Northeast, South, Midwest, and West, respectively. Over the entire country, we observe an average increase of 1.36 µg/m 3 attributable to pandemic interventions. The formation of PM 2.5 is known to be a very complex process-along with being directly emitted, much of it is formed secondarily in the atmosphere from other pollutants. To better understand factors that may have contributed to the heterogeneity in lockdown-attributable PM 2.5 changes detected by our study (Figure 1 ), we investigate associations between the monitor-level effect estimates and the environmental, geographical, regional, meteorological, mobility, and socioeconomic conditions in the area surrounding the monitor. County-level measures of each feature are obtained and each monitor is assigned the features of the county in which it lies. Using the . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted March 12, 2021. ; https://doi.org/10.1101/2021.03.10.21253284 doi: medRxiv preprint estimated lockdown-attributable PM 2.5 changes as the outcome, we fit a linear regression model including all of the following features as predictors: 2017 primary PM 2.5 emissions from various sources (residential, industrial processes, industrial boilers, dust, agriculture, and mobile) 18 ; socioeconomic and demographic variables (e.g. poverty rate, proportion of population age 65+, population density) 19 ; county's average relative mobility change during the lockdown period 20 ; level of urbanization 21 ; and indicators for region. Using this model, we find that industrial boiler emissions and population density have significant positive associations with lockdown-attributable increases in PM 2.5 , while a significant negative association was observed for relative mobility decrease, proportion of population age 65+, and the West region. In addition, areas not classified as "large central metro", the most urban characterization, have significantly positive associations with lockdown-attributable increases in PM 2.5 . A summary of model estimates is provided in Figure 2 . Although areas with larger decreases in mobility tended to experience smaller increases in PM 2.5 , our results suggest that any positive air quality impacts of these large-scale mobility decreases were insufficient to offset other non-meteorological factors promoting PM 2.5 formation. Supporting this finding is Table 1 , which shows the amount of PM 2.5 emissions for each US Census region stratified by source 22 . Regions with higher emissions from stationary sources (i.e. fuel combustion sources such as power plants, solvents, agriculture, and other industrial processes) correspond to areas in which we found higher propensity for lockdown-attributable increases in PM 2.5 during the lockdown. Additionally, mobile sources have a comparatively much smaller impact on PM 2.5 emissions (Table 1 ). . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) We also investigate these lockdown-attributable PM 2.5 changes relative to a "baseline" We also assess the sensitivity of our findings to a 1-2 day spike in PM 2.5 of unknown origin that we observed during the lockdown period for many monitors in the Midwest. To do so, we identify and remove these outlier days, defined as days on which PM 2.5 levels spiked to 35 µg/m 3 or higher, from our ATT estimates for all monitors. However, the removal of these outliers did not significantly affect the results (see Extended Data Figure E2 ). We calculate the expected impacts of the estimated lockdown-attributable changes in PM 2.5 on respiratory and cardiovascular disease hospitalizations in the age 65+ population for each county in the US, using the EPA's Core Health Impact Functions for Particulate Matter 23 . We sum these changes across counties within each US region to obtain the results displayed in Table 2 . While these figures may not be truly representative of health impacts during the lockdown due to the numerous changes in the public health sphere catalyzed by the pandemic, they are included in order to quantify the scope of the potential impacts under such changes in air pollution. These estimates are small relative to the overall PM 2.5 burden in the US 24 , as expected given the short time-scale and the varying directions of the PM 2.5 changes. There are a number of limitations of our study. First, due to sporadic data collection at certain monitor sites, there is a considerable amount of missing PM 2.5 data, particularly in prior years that are used to establish expected PM 2.5 in the absence of the lockdown. However, much of the missingness is by-design due to EPA monitoring procedures, and we have set fairly stringent monitor inclusion criteria for our analyses to ensure that observed data are sufficient to capture important trends. In addition, our models are flexible enough to account for some missing data and we have visually inspected our results to ensure suitable model fit (Supplementary Figure 1) . Furthermore, the heterogeneity of lockdown procedures, stringency, and adherence to advisories creates difficulty in a clear definition of the lockdown period. However, the time period we have defined was the period of the most acute lockdowns and . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted March 12, 2021 we have further mitigated this issue by using consistent criteria for each state. Our study has found that the impacts of the 2020 COVID-19 lockdowns on PM 2.5 levels vary dramatically across the US, with strong regional trends. In the Midwestern and Southern regions, we unexpectedly observe consistent increases in PM 2.5 compared to expected levels absent the lockdowns, while some areas in the Southwest experienced substantial decreases. The results of this important quasi-experiment provide evidence that policies to limit individual-level emissions-generating behaviors like mobility may not, alone, reduce PM 2.5 levels, particularly in certain regions and when accompanied by other social and economic changes. Regulation of stationary emissions sources may be necessary to meaningfully and uniformly reduce PM 2.5 and improve human health. All data preparation and analyses are conducted in R statistical software version 4.0.2. The state-level lockdown period dates used throughout this study are given in Extended Data . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted March 12, 2021. ; https://doi.org/10.1101/2021.03.10.21253284 doi: medRxiv preprint monitors (1) with no PM 2.5 measurements during the defined lockdown period for their respective state, roughly mid-March to late-April of 2020 (918 monitors); (2) with less than 30 days of data during the lockdown period (23 monitors); (3) with no data prior to 2016 (117 monitors); or (4) with data entirely missing for five or more total years 2010-2019 (67 monitors). After applying these exclusion criteria, 455 monitors remain to be used in our analyses. We rely solely on observed data for our analysis. Our goal is to estimate the daily PM 2.5 concentrations that would have been expected at each monitoring site during the lockdown period under a business-as-usual scenario (i.e., without the lockdowns or the non-mandated personal behavioral changes that took place due to COVID-19 during the same period). Hereafter we refer to these as the "counterfactual PM 2.5 concentrations". As a result of long-term, seasonal and daily trends in PM 2.5 , complex meteorological variability, and other potential unobserved confounding factors, PM 2.5 concentrations from a single previous year cannot be directly utilized to infer the counterfactual PM 2.5 concentrations, nor can a simple average of PM 2.5 concentrations over multiple years of historical data. We therefore use a causal inference approach often referred to as the "synthetic control method" (SCM) to estimate counterfactual PM 2.5 concentrations [15] [16] [17] . SCM was created to analyze the effects of a policy or intervention on an outcome using a case study. It leverages time series containing pre-and post-intervention outcome data from (1) a single unit that received an intervention (the "treated" unit) and (2) a set of control units that did not receive the intervention. Conceptually, using the pre-intervention data from both treated and control units, it creates a weighted average of the time series from the control units that best captures the pre-intervention trends in the time series for the treated unit. Then that same weighted average of the control units' outcomes is used to estimate the outcome that would have been expected in the treated unit during the post-treatment period, in the absence of the intervention (the counterfactual outcome). This weighted average created by SCM is called a "synthetic control". Formally, the optimal weights are identified by obtaining a latent factor representation of the multivariate time series data. SCM is flexible enough to account for both time-varying and time-invariant confounders of the intervention effect under mild assumptions. In addition, it accounts for any remaining pre-lockdown missing data by imputation. In the classic SCM framework, for a given monitor, we consider each year 2010-2020 to be a "unit", and the time series of PM 2.5 concentrations for each year are the outcomes. 2020 is considered the treated unit and all other years are controls. Thus, SCM will create a weighted average of daily 2010-2019 PM 2.5 concentrations at that monitor, and use that weighted average to estimate the counterfactual daily PM 2.5 concentrations during the lockdown period. To create a proper synthetic control, we must ensure that the time series from each year are aligned so that the day represented at a given position in the time series is comparable across years. Because PM 2.5 exhibits weekday and weekend trends that must be accounted for when creating the synthetic control, we align the time series based . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted March 12, 2021. ; https://doi.org/10.1101/2021.03.10.21253284 doi: medRxiv preprint on day-of-week rather than day-of-year. In particular, we let the time series for each year start on the first Monday of the year, so that aligning entries in the time series represent the same day-of-week and only a few days difference in day-of-year. We implement SCM separately on the data from each of the 455 monitors using the gsynth package in R 16 with a matrix completion estimator 17 . In addition to the default latent factor representation used by SCM, we include in the model fixed effects for year and time series position and adjust for both weather (maximum temperature, maximum relative humidity, precipitation, wind speed, and wind direction) and seasonality (month of year) as time-varying covariates. See Supplementary Figure 1 for monitor-level model fit diagnostic plots. At each monitor, we take the simple difference of the observed PM 2.5 concentrations during the lockdown and the SCM-estimated counterfactual PM 2.5 concentrations to obtain the lockdown-attributable changes in PM 2.5 for each day. We then average these daily effect estimates over the entire lockdown period to obtain the ATT estimate at each monitor, which is shown in Figure 1 . In the second stage of modeling, we use a linear regression model to identify features associated with the is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted March 12, 2021. ; https://doi.org/10.1101/2021.03.10.21253284 doi: medRxiv preprint including interaction effects between relative mobility change and population age 65+, are provided in Extended Data Table E1 . However, the addition of these interaction effects did not significantly affect the model coefficients. We use exposure-response functions from the EPA's Core Health Impact Functions for Particulate Matter and Hospital Admissions 23 to characterize associations between short-term PM 2.5 exposure and (1) respiratory disease hospitalization risk and (2) cardiovascular disease hospitalization risk 28, 29 . To estimate the health impacts across the entire continental US, we first interpolate the estimated lockdown-attributable PM 2.5 changes to obtain an estimate for each county (including those without an included monitor). Inverse distance weighting is used to interpolate the estimated effects at the monitor sites to each county's centroid, and the resulting interpolated values are treated as each county's lockdown-attributable change in PM 2.5 . We insert each county's PM 2.5 changes into the pre-existing short-term exposure-response functions for each hospitalization type for individuals 65+ (a log-linear model with parameter estimates taken from Kloog et al. 28 and Bell et al. 29 ) to obtain an estimate of its change in hospitalization incidence rate. Baseline incidence rates are calculated using a weighted average of hospitalization incidence rates for people age 65+ (3.352 respiratory hospitalizations/100 people per year; 5.385 cardiovascular hospitalizations/100 people per year). The change in incidence rate is then used to obtain the county's absolute change in hospitalizations for each of the two health outcomes in people age 65+. These values are summed across each US census region and scaled to account for the number of days in the state's lockdown period, i.e., the number of days of the specified change in PM 2.5 exposure. We also estimate the changes in hospitalization incidence that would have occurred if these PM 2.5 changes had been sustained for an entire year. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted March 12, 2021. ; https://doi.org/10.1101/2021.03.10.21253284 doi: medRxiv preprint Association between mobility patterns and COVID-19 transmission in the USA: a mathematical modelling study Timing of Community Mitigation and Changes in Reported COVID-19 and Community Mobility-Four U.S. Metropolitan Areas Statewide COVID-19 Stay-at-Home Orders and Population Mobility in the United States Epidemiology of fine particulate air pollution and human health: biologic mechanisms and who's at risk? Environ. Heal The impact of PM2.5 on the human respiratory system Fine Particulate Air Pollution and Hospital Admission for Cardiovascular and Respiratory Diseases Effect of air-pollution control on death rates in Dublin, Ireland: an intervention study Impact of Changes in Transportation and Commuting Behaviors During the 1996 Summer Olympic Games in Atlanta on Air Quality and Childhood Asthma An evaluation of resident exposure to respirable particulate matter and health economic loss in Beijing during Beijing Air quality and outpatient visits for asthma in adults during the Nonuniform impacts of COVID-19 lockdown on air quality over the United States the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted Changes in U.S. air pollution during the COVID-19 pandemic Environmental Protection Agency. Our Nation's Air Synthetic control methods for comparative case studies: Estimating the effect of California's tobacco control program Generalized Synthetic Control Method: Causal Inference with Interactive Fixed Effects Models Matrix Completion Methods for Causal Panel Data Models National Emissions Inventory American Community Survey 5-Year Estimates Facebook Data for Good Movement Range Data NCHS urban-rural classification scheme for counties National Emissions Inventory Environmental Protection Agency. Environmental Benefits Mapping and Analysis Program -Community Edition Global burden of 87 risk factors in 204 countries and territories, 1990-2019: a systematic analysis for the global burden of disease study 2019 Air Quality System Data Mart Google Earth Engine: Planetary-scale geospatial analysis for everyone Protecting privacy in Facebook mobility data during the COVID-19 response Acute and Chronic Effects of Particles on Hospital Admissions Assessment of the health impacts of particulate matter characteristics District of Columbia North Carolina Rhode Island South Carolina The authors gratefully acknowledge funding from NIH grants 5T32ES007142, 1K01ES032458-01, and R01AG060232. KLC led all data preparation, conducted all statistical analyses, and prepared a first draft of the manuscript. LRFH provided expertise in air pollution and atmospheric chemistry and assisted with manuscript preparation. RCN assisted with idea generation and study design, aided in data acquisition and preparation, provided statistical input, and assisted with manuscript preparation. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted March 12, 2021. ; https://doi.org/10.1101/2021.03. 10 .21253284 doi: medRxiv preprint The authors declare no competing interests. Supplementary Information is available for this paper. Correspondence and requests for materials should be addressed to Rachel Nethery at rnethery@hsph.harvard.edu. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)The copyright holder for this preprint this version posted March 12, 2021 . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted March 12, 2021 Table E2 . List of variables included in regression model, with units of measure and abbreviations used throughout. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (Right) ATTs (in µg/m 3 ) smoothed across the US using inverse distance weighting. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)The copyright holder for this preprint this version posted March 12, 2021. ; https://doi.org/10.1101/2021.03.10.21253284 doi: medRxiv preprint