key: cord-0065006-ase67oa5 authors: Chan, Jeff title: The Geography of Social Distancing in Canada: Evidence from Facebook date: 2020-07-01 journal: nan DOI: 10.3138/cpp.2020-050 sha: 31e8821958e372cecf7c7708b7c3d486899b490d doc_id: 65006 cord_uid: ase67oa5 This article analyzes which characteristics are correlated with mobility reductions during the COVID-19 pandemic, using census-division-level mobility data for Canada from Facebook. There is significant variation in the extent to which social distancing was applied in April, relative to a preperiod of February. I find that the population and population density of a census division are strongly correlated with larger mobility reductions. Conversely, I find that areas with a larger share of dwellings that are apartments exhibit smaller mobility reductions, suggesting that those in tighter living conditions may find it less possible to stay at home during the pandemic. Finally, I examine the persistence of mobility reductions into May and show that areas with a larger apartment dwelling share are more likely to maintain their social distancing over time. The COVID-19 pandemic has radically altered the lives of Canadians in a very short span of time. In the absence of a vaccine, the primary tactic of public health offi cials to combat this virus has been to encourage social distancing and staying at home. In a limited window around the middle of March, all provinces enacted states of emergency, encouraging or mandating staying at home and shutting down large swathes of the economy. Preliminary data suggest that mobility reductions around the world have been steep since the declaration of the World Health Organization that COVID-19 was a pandemic on 11 March 2020. Cases of COVID-19 and deaths have continued to climb in many countries around the world, including Canada. This only reinforces the importance of maintaining social distancing while medical researchers work to develop a vaccine. Unfortunately, the extent to which different regions, even within the same country, have adhered to social distancing has varied. Researchers around the world have examined the determinants of why some areas reduce their movements more than others, but Canada has lagged because of a lack of fi nely disaggregated data on mobility. Anecdotal evidence suggests, however, that social distancing has not been a primary concern for everybody. News reports have abounded of people fl outing such guidelines across Canada. 1 Despite this clear anecdotal evidence that social distancing is not uniformly important to all Canadians, no academic research has been done in Canada on the determinants of social distancing using fi nely disaggregated mobility data at the subprovincial level. This article presents this important fi rst evidence, using recently released CDs; in other words, CDs with more people living more densely reduce their mobility by more. I also show that areas in which health occupations are a larger proportion of employment have reduced their mobility less, likely refl ecting continued employment as essential workers. I then examine the decay of mobility reductions over time by comparing April mobility reductions with May mobility reductions. I show that, although mobility has started increasing across Canada in aggregate, this is not occurring differentially by most CD-level characteristics. The one exception is apartment dwelling share, which is correlated with continued mobility reduction into May. These fi ndings highlight the fact that lockdown fatigue, which can cause decreased social distancing adherence, does not appear to be correlated with a host of likely characteristics such as income, age, and education. My article has two main contributions. First, I provide fi rst academic evidence for the determinants of social distancing in Canada, particularly using data below the mobility data from Facebook. Importantly, these data are available at the census division (CD) level, which allows me to utilize signifi cant geographical variation in various characteristics to understand why social distancing varies across Canada. Figure 1 below presents the variation in mobility reductions in April relative to February, across all Census Divisions in my sample. The map, whose colours are broken into seven quantiles, with darker colours representing greater mobility reductions, clearly illustrates the variation that I aim to exploit. 2 Put simply, the extent of mobility reductions post-COVID vary across Canada, and this study will increase our understanding of why this is the case. I present two key results. First, I estimate a series of regressions estimating how mobility reductions (in April, relative to February) are correlated with a series of CDlevel work, demographic, and geographic variables. I fi nd that population and population density consistently and signifi cantly increase the mobility reductions across measure of tiles visited is calculated, with some noise added for privacy. This tile-based measure is then compared with a baseline average number of tiles visited in February (on that same day of the week) to calculate a relative measure of movement change based on these tile visits for all users in each region. 6 I take the daily level data and take the mean for each CD and month after March, resulting in a data set that measures the mean levels of mobility reduction in April and May relative to February. Readers interested in the specifi c methodology used in the construction of this data can refer to a working paper by Facebook ( Maas et al. 2020 ) . Much like any other mobility-based dataset based on cell phone or location tracking, one might be concerned about the representativeness of the Facebook data. Unfortunately, there is little that can be done to verify that Facebook data are representative of the Canadian population for each census division. An overview of Canadian social media in 2017 by Gruzd et al. (2018 ) estimates using online survey data that 84% of online Canadians have Facebook accounts, with 79% of Facebook users reporting daily usage. Importantly, 75% of respondents aged 55 and above reported having a Facebook account, suggesting that a bias towards younger Canadians is not likely to be a major concern with the Facebook data. The prevalence of Facebook use and regular daily use among Canadians is also reassuring for external validity purposes. I pair the Facebook data with CD-level variables from census profi le data based on the 2016 Census of Population. 7 The variables I use are taken from these profi les or are constructed from them. The variables used can be divided into three categories: (a) work (log median household income, share of total income made up by government transfers, the unemployment rate, share of employed workers in health, share of employed workers in sales and service occupations), (b) geographic/population (log population, log population density, share of occupied private dwellings that are apartments), and (c) demographic (share of immigrants, log median age, married population share, share of couples with children, share of the population with no postsecondary education). 8 I calculate CD-level weather for February, April, and May using monthly climate summaries at the weather station level. 9 For each CD, I fi nd the weather station that is closest to its centroid and use that station's data to represent weather for that CD. I make use of the mean temperature and the total precipitation in that month to capture weather in a given CD month. I also make use of GIS data for CD shapefi les from GADM to help facilitate matching the Facebook data and the census data. 10 After merging all data together, I am left with 260 CDs with non-missing data. 11 For reference, there are 293 CDs across Canada. The 260 CDs contain 34,763,738 out of provincial level. I show that denser areas are more likely to reduce their movements, but areas where apartments are a larger share of dwellings are less likely to do so. I also provide evidence on how social distancing has changed over time, a topic that is becoming increasingly important as so-called lockdown fatigue chips away at stay-at-home policies' effectiveness. This paper relates to a small amount of prior work that has attempted to understand how mobility in Canada has changed post-COVID. Chan (2020 ) uses Google Mobility Trends data to provide some preliminary evidence at the provincial level of how mobility has declined in Canada. Cardoso and Wang (2020 ) used data from Environics Analytics, a private company, to provide some descriptive maps of mobility reductions in Canada at a subprovincial level. Environics Analytics also published a blog post looking at mobility reductions in relation to their marketing consumer categories. My article greatly expands on this existing analysis by using subprovincial data that are now freely available to analyze the determinants of social distancing academically, using applied econometric methods. My article also relates to work from other countries that have studied the determinants of mobility reductions. Anderson et al. (2020 ) , Chiou and Tucker (2020 ) , and Gupta et al. (2020 ) examine the determinants of US sub-state-level social distancing. 3 While the prevalence of immediately available data from companies such as SafeGraph, Google, and Descartes Labs led to a large part of the early mobility literature being done on the United States, other countries' researchers have also started expanding this research to other countries. Dahlberg et al. (2020 ) , for example, conduct an analysis of mobility reduction in Sweden, while Egorov et al. (2020 ) study how mobility has declined across cities in Russia; Egorov et al. (2020 ) , in particular, focus on ethnic fractionalization. Similar to these studies, I provide fi rst subprovincial academic evidence of the determinants of social mobility reductions in Canada. The data used in this paper come from two main sources. First, I make use of Facebook's Movement Range Maps data, which provide subprovincial information on mobility reductions freely and at no cost for the fi rst time. The data are gathered from users of the mobile app version of Facebook with Location History enabled. Data are aggregated to the census division level 4 for data privacy, and mobility reduction indices have been constructed by Facebook showing the percentage of relative change in mobility at a daily level starting from 1 March 2020 relative to February movement levels. 5 The relative mobility change variable is constructed by Facebook using small tiles of roughly several hundred square meters and calculating, for all people in their data, the number of tiles each person is observed in each day; using this, an deviation. Consistent with the map-based evidence in Figure 1 , this suggests that social distancing was not uniform across Canada. Another notable statistic is the mean level of mobility reduction in May relative to February. This mean is notably lower, suggesting that movement was only about 20 percent lower on the average in May, relative to February. While some of this may be due to seasonal trends, it motivates my analysis of changes in movement reductions across April and May across CDs, to test whether the same determinants of movement reductions are also correlated with the decay in social distancing adherence over time. I next list the top and bottom fi ve CDs by their mean change in mobility in April relative to February in Table 2 . The CDs with the largest mobility reductions are all in Quebec and have approximately 50% mobility reduction in April relative to February. In contrast, the bottom 5 CDs in terms of movement reduction are spread over a variety of provinces and have reduced their movement by signifi cantly less, by 5-13% relative to the baseline February period. This again suggests that there is considerable heterogeneity in social distancing across Canada. In the fi nal descriptive result, I ill ustrate the correlations between the various dependent and independent variables of interest in the regression analysis. These correlations are calculated by weighting each CD by its population in 2016, as in the regressions. These are 35,151,728 total people in Canada. Importantly, as data on the territories are not consistently available, I omit them from the analysis and focus on the provinces. In Figure 2 , I fi rst display a population-weighted aggregated time series of movement reductions in Canada from 1 March onwards. The fi gure clearly shows that movement remained relatively stable in early March, until approximately 11 March, when the WHO fi rst declared a pandemic. Shortly thereafter, provincial governments declared states of emergency. By late March, mobility reductions had reached their greatest extent; although movement reductions started trending upwards starting in April, movement was still notably low relative to the February baseline. In Table 1 , I display summary statistics at the CD level for my key variables, with statistics calculated by weighting each CD by its 2016 population. Importantly, the relative change variable from Facebook measures signifi cant mobility reductions in April relative to February. On average, mobility declined by over 38 percent at the CD level. This suggests, consistent with Figure 2 , that movements of Canadians were much lower after the onset of the pandemic and the subsequent policies associated with it. In addition, changes in mobility seem to vary substantially across CDs, as can be seen in its standard Date presented in Table 3 . Many of these correlations display the expected relationships between various variables, such as the strong positive correlation (0.74) between log population density and the share of dwellings that are apartments. The table also suggests that there exists some strong correlations, both positive and negative, between the change in mobility during April (in the fi rst column of the matrix) and various CD characteristics such as population, population density, education, immigrant share, and apartment dwelling share. In Table A .1 in the Appendix, I also present unweighted Spearman rank correlation coeffi cients between all left-and right-hand-side variables. 12 I test what types of characteristics at the CD level help determine mobility reductions by estimating the following regression specifi cation: . m c,Apr , the dependent variable, is the mean level of mobility reduction in April for CD c . I regress this against a vector of CD-level characteristics, X c , which are described in the data section. I control for CD-specifi c weather changes using a vector of control variables W c Apr Feb , , which captures the change in the mean temperature and total precipitation experienced by CD c . 13 Finally, I include province fi xed effects; this removes the effect of provincewide policy changes such as province-specifi c restrictions on work or movement. In the analysis, standard errors are clustered by province. Observations are weighted by each CD's population in 2016. Table 4 shows the results from these regressions. In the fi rst three columns, I estimate the work, geographic, and demographic variables separately in sets. Column 1 estimates the impact of the work variables. I fi nd that, in contrast to American results in Chiou and Tucker (2020 ), higher-income CDs actually reduce their mobility by less. In addition, I fi nd that CDs that rely more on government transfers also reduce their mobility by less. Finally, the size of the coeffi cients suggests that CDs with a higher service and sales occupation employment share (which likely had their employment adversely impacted) were more likely to reduce their mobility, while health-employment-intensive CDs (where employment likely continued) were less likely to reduce their mobility, although the health coeffi cient is not statistically signifi cant. I then move on to column 2 of Table 4 , which shows the impact of geographic/population variables on mobility reductions. I fi nd that CDs with a higher population and a higher density are associated with larger mobility reductions. This suggests that larger, more dense urban areas may have reduced their mobility by more than their non-urban counterparts. The share of dwellings that are apartments is positive and statistically signifi cant, indicating that apartment dwellers stay at home less. Column 3, using demographic variables, shows that CDs with higher population shares of immigrants have larger mobility reductions. I also show that the share of the population without postsecondary education is also correlated with lower mobility reduction, staying at home less than those living in more educated CDs; it is possible that this is being driven by omitted work variables, though, which I test in the following set of results. I also show that the share of couples with children is negatively correlated with mobility changes, suggesting that areas where couples are more likely to have children stayed at home more in April than other CDs. Finally, I estimate a regression with all variables and report results in column 4. Many prior variables that were statistically signifi cant are now no longer so. For example, log income and service employment share is of much smaller magnitude and no longer signifi cant in this last column, suggesting that the prior results were partly driven by correlation with omitted variables. I fi nd that the mobility reductions by population and population density remain statistically signifi cant and of similar magnitude, further verifying that these area characteristics have a strong relationship to social distancing adherence. Similarly, the apartment dwelling share and the health employment share remain statistically signifi cant and are still associated with less mobility reduction (with its positive coeffi cient). These four variables therefore seem to have the most robust association with changes in mobility post-COVID, even after controlling for a host of mitigating factors. Having examined the determinants of mobility reductions across CDs, I now turn to analyzing whether the same determinants are correlated with the persistence of mobility reductions across time (specifi cally, from April to May). As Figure 2 showed, mobility reductions clearly decline over time. I provide evidence as to whether these declines in social distancing exhibit a pattern across CDs. I use the same main regressors, but the dependent variable now measures the difference in the mean mobility reduction in May and the mean mobility reduction in April. This captures the average change in mobility reduction that occurred within a given CD. I also alter the weather control variable so that it now accounts for the change in weather that occurred across April and May. The results for these regressions are displayed in Table 5 . I do not explain the bulk of the results in this table, in part for brevity but also because many of the determinants do not seem to be robustly correlated (positively or negatively) with persistence in mobility reductions once all variables are included in the fi nal column. The one exception is the apartment dwelling share, which is negatively correlated with mobility reduction persistence from April to May. In this specifi cation, this means that CDs with more apartments as dwellings persist more in their social distancing from April to May. Otherwise, none of the other determinants from Table 4 are statistically signifi cant. This implies that the declines in mobility reductions seen in April to May are not correlated with Notes: Regressions also include weather control variables and province fi xed effects. *** denotes statistical signifi cance at the 1% level, ** denotes statistical signifi cance at the 5% level, and * denotes statistical signifi cance at the 10% level. Standard errors are clustered by province. The dependent variable is the mean change in mobility in April, relative to February. Observations are weighted by each CD's population. Source: Author's calculations. any particular characteristic. Going back to the motivating example of Trinity Bellwoods Park on 24 May 2020 in the Introduction , my results imply that this is not the byproduct of certain types of areas becoming more fatigued by long-running lockdowns. This paper provides evidence on the determinants of geographical mobility reductions during the COVID-19 pandemic at the subprovincial level in Canada. One of the most striking results is that population and population density are strongly correlated with more mobility reductions. I also fi nd that areas with a higher apartment dwelling share are less likely to have reduced mobility; one potential explanation for this result could be that those living in apartments are unable to socially distance as effectively, owing to their living, social, or work situations. This suggests that more work needs to be done to examine how COVID-19 has impacted the most vulnerable segments of the Canadian population, to ensure that their economic circumstances are not translating into inequality of outcomes during the pandemic. I also highlight the fact that area characteristics are not correlated with the persistence of mobility reductions into May, with the exception of apartment dwelling shares. This highlights that, if Canadians are becoming tired of staying at home and moving around more over time, this is not differentially the case for different types of CDs. Policymakers and the general public should not therefore believe this fatigue is isolated to one particular demographic or region. Another aim of this paper was to highlight the usefulness of subprovincial data in tracking adherence to social distancing both across geographies and over time. My mandates increased staying at home in the US during the COVID-19 pandemic. Given that only 42.4% of employees in Canada have employer-provided sick leave (see Chen and Mehdi 2019 ), a similar study in Canada may fi nd different results. 4 Observations in the Facebook data nearly always match up one to one with 2016 CDs. The lone exception is for Strathcona and Comox Valley, which are presented as one unit in the Facebook data. I simply assign the aggregated value from the Facebook data to each of the two aforementioned CDs in the analysis. 5 Unfortunately, the data are only available from 1 March 2020 onwards. This precludes the use of any preperiod data to form a counterfactual level of movement. 6 There is also another measure in the data that measures the amount of "staying put" that takes place each day in each CD. This is the number of users who stay in one tile during work aggregated the timing of the data to the monthly level, but other work by researchers could take advantage of the daily frequency to examine more specifi c incidents or leverage even more variation. 1 In Toronto during Victoria Day weekend, for example, outrage focused on images and videos of a crowded Trinity Bellwoods Park. 2 Cross-hatched areas represent CDs where data are unavailable or are not in my sample. These largely belong to the territories, or are CDs where population density is very low. 3 While informative, this US-based work also highlights that more work needs to be done in the Canadian context. For example, Anderson et al. (2020 ) fi nd that sick leave Notes: Regressions also include weather control variables and province fi xed effects. *** denotes statistical signifi cance at the 1% level, ** denotes statistical signifi cance at the 5% level, and * denotes statistical signifi cance at the 10% level. Standard errors are clustered by province. The dependent variable is the difference in the mean change in mobility in May minus the mean change in mobility in April. Observations are weighted by each CD's population. Source: Author's calculations. Effect of a Federal Paid Sick Leave Mandate on Working and Staying at Home: Evidence from Cellular Device Data Monthly Cimate Summaries Canadians Sticking Closer to Home amid Shutdown, Analysis Finds Using Google Data to Understand Canadian Movement Reductions During the COVID-19 Pandemic Assessing Job Quality in Canada: A Multidimensional Approach Social Distancing, Internet Access, and Mobility Effects of the COVID-19 Pandemic on Population Mobility under Mild Policies: Causal Evidence from Sweden Divided We Stay Home: Social Distancing and Ethnic Diversity The State of Social Media in Canada Tracking Public and Private Responses to the COVID-19 Epidemic: Evidence from State and Local Government Actions Facebook Disaster Maps: Aggregate Insights for Crisis Response and Recovery Census Source: Author's calculations. Calculated without weighting CDs by population a given day, divided by all eligible users in that CD. I do not use this measure because there is no pre-COVID level of staying put that I can use as a baseline, unlike the change in movement variable, which compares movements to February baseline levels. 7 Data, as of 8 June 2020, are available at Statistics Canada (n.d.) . 8 This includes certifi cates and diplomas, not only bachelor's degrees and above. 9 Data are publicly available from Canada (n.d.) . 10 The Facebook data have GADM geographical identifi ers.Using GADM data helped to map the GADM identifi ers to the CD codes used by Statistics Canada. 11 Many of the missing data constraints come from the Facebook mobility data, likely due to data privacy issues. 12 The Spearman rank correlation coeffi cients convert all variable values to the rank that each particular observation has in terms of the value of that variable. They then correlate the ranks of two different variables, instead of the actual values. This has the benefi t of producing a correlation matrix that is invariant to monotonic transformations of variables, such as taking logs. The correlation coeffi cients are calculated without weighting for population. 13 These variables' coeffi cients are omitted for brevity.