key: cord-1018130-v47mgalr authors: Reinbold, Gary W. title: Effect of Fall 2020 K-12 Instruction Types on COVID-19 Cases, Hospital Admissions, and Deaths in Illinois Counties date: 2021-05-28 journal: Am J Infect Control DOI: 10.1016/j.ajic.2021.05.011 sha: d96df8f9955effdc3d62e54188f088d4b53c0a2e doc_id: 1018130 cord_uid: v47mgalr BACKGROUND: One of the most difficult public policy decisions associated with the COVID-19 pandemic has been about how to offer K-12 instruction. We sought to determine whether differences in instruction types at the beginning of the 2020-2021 school year were related to differences in COVID-19 cases, hospitalizations, and deaths in Illinois counties during the first three weeks of the school year. METHODS: We divided Illinois counties into three groups based on the instruction type used for a majority of K-12 students at the start of the school year: in-person, hybrid, or online-only. We used synthetic control analysis to match counties between the three groups. RESULTS: Both majority hybrid and majority online-only counties had significantly fewer new cases than majority in-person counties. There were no significant differences in new cases between majority hybrid counties and majority online-only counties or in new hospital admissions or deaths between any of the three county groups. CONCLUSIONS: This paper adds to the growing scientific consensus that at least some forms of in-person K-12 instruction have not contributed significantly to the spread of the pandemic. However, our results suggest that there may be an important difference between fully in-person instruction and hybrid instruction. pandemic in March 2020, most states ordered all public schools to close. Most of the remaining states recommended that all public schools should close, and those recommendations were followed by almost all school districts. Only in Montana and Wyoming did a significant number of public schools not close and switch to entirely online learning (Education Week, 2020a). By the time that schools were preparing to resume instruction in August 2020, the situation had changed. The number of new cases had stabilized in most parts of the country and public health experts had a better understanding of the virus. Only two states required schools to provide only online instruction at the start of the 2020-2021 school year, while four states required schools to provide at least some in-person instruction. The remaining forty-four states left decisions about how to provide K-12 instruction to individual counties, cities, districts, or schools (Education Week, 2020b ). An important question, of course, was whether students could return to classrooms in a way that was safe for students, teachers, other school employees, and their families. In July 2020, the Centers for Disease Control and Prevention (CDC, 2020) recommended that schools be reopened in Fall 2020, based on its conclusions that COVID-19 posed low risks to children and that children were not likely to be major contributors to the spread of the virus. However, the CDC's recommendations were issued after President Trump and Education Secretary DeVos had stated that schools should be reopened and had threatened to withhold federal funding from districts that did not reopen, which led some people to question the basis for the CDC's recommendations (Strauss, 2020) . Because almost all states closed their public schools and moved to entirely online instruction at about the same time in March 2020, it has been difficult to determine how effective those closings were in limiting the initial spread of the pandemic. Some researchers have attempted to estimate the effect of those early closings on the spread of the pandemic by comparing changes in the spread within states or counties before and after the closings occurred (Abouk & Heydari, 2020; Auger et al., 2020; Hsiang et al., 2020; Yehya, Venkataramani, & Harhay, in press ). However, because so many other social distancing restrictions were being imposed at the same time, it is difficult to disentangle the effect of those early closings from the effects of those other measures. Other researchers have attempted to estimate the effect of those early closings based on differences across counties in the timing of the closings (Courtemanche, Garuccio, Le, Pinkston, & Yelowitz, 2020) . That approach is also challenging, because so many other factors were changing so quickly during the first few months of the pandemic, such as other social distancing measures, testing capacity, and knowledge about the virus. During the 2020-2021 school year, there has been much greater variation in K-12 instruction types, not only across states, but also across school districts and schools within states. However, it is still difficult to determine the effects of different instruction types on the spread of the pandemic for two reasons. First, it is difficult to find accurate data for the instruction type being used by each district or school at any point in time. Second, districts and schools have changed their instruction types frequently as community metrics have changed and knowledge about the virus has increased. Two working papers have attempted to estimate the effect of inperson schooling during Fall 2020 on the spread of the pandemic using community data. Goldhaber et al. (2020) used data from monthly surveys of school districts in Michigan and Washington in Fall 2020. Harris, Ziedan, and Hassig (2021) used data from Burbio and MCH Strategic Data on instruction types at the start of the 2020-2021 school year in selected school districts nationally. Two CDC-sponsored studies have attempted to estimate the effect of inperson schooling during Fall 2020 on the spread of the pandemic using individual data. Hobbs et al. (2020) used individual data for 397 children in Mississippi who took COVID-19 tests between September 1, 2020 and November 5, 2020 to assess whether various factors, including in-person school or child-care attendance, were associated with a positive test result. Falk et al. (2021) used data from 17 K-12 schools in rural Wisconsin to assess likely transmission locations for students and staff with confirmed COVID-19 cases during 13 weeks of in-person instruction. We sought to determine whether differences in K-12 instruction types at the beginning of the 2020-2021 school year in Illinois school districts were related to differences in COVID-19 cases, hospitalizations, and deaths in Illinois counties during the three-week period from August 24, 2020 to September 13, 2020. For data on instruction types, we started with a dashboard on the Illinois State Board of Education (ISBE) website with the results of the ISBE's July 2020 survey of the 852 school districts in Illinois about their plans for instruction to start the school year (ISBE, 2020). However, we observed that some dashboard data did not accurately reflect instruction types used by districts at the start of the 2020-2021 school year. Therefore, we updated the ISBE dashboard data for the 200 largest districts in Illinois and, to the extent they were not included among the 200 largest districts, for the largest district in each county, by searching district websites and news sources. We coded the districts based on the instruction type that they were using at the beginning of the school year. When a district had different instruction types for different grades, we coded the district with the least restrictive instruction type being used (in-person is the least restrictive type, then hybrid, then online-only). Because COVID-19 case and death data are not available at the school district level, all of our analyses are at the county level. We first divided the counties into three groups: counties where a majority of the students attended districts with primarily in-person instruction, counties where a majority of the students attended districts with primarily hybrid instruction, and counties where a majority of the students attended districts with only online instruction. We excluded Cook County, because it is such an outlier among Illinois counties in so many respects that it was not possible to match it closely with other Illinois counties on the predictor variables that we used, and we excluded six counties with relatively large numbers of college students (Champaign, Coles, DeKalb, Jackson, McDonough, and McLean Counties) , because the resumption of college classes contributed to case spikes in many college towns. Of the 95 remaining counties, 41 were majority in-person counties, 32 were majority hybrid counties, and 17 were majority online-only counties; 5 counties did not have a majority of students in any of the three categories. We obtained daily COVID-19 case and death data for each county from the New York Times (2021). We obtained data on weekly confirmed and suspected COVID-19 hospital admissions from the U.S. Department of Health and Human Services (2021). The hospital admissions data did not include counties of residence for the admitted patients so we used the counties in which the hospitals are located, which is not ideal because some people may have to travel to a neighboring county for hospital access. Also, the hospital admissions data were updated only weekly, reflecting admissions from Friday to Thursday, unlike the case and death data, which were updated daily. Finally, we obtained county-level data on the demographic covariates in our models from various U.S. Census Bureau (2020a, 2020b) and U.S. Department of Agriculture (2020a, 2020b, 2020c, 2020d) datasets. All of the covariates were measured in 2018. Table 1 shows descriptive statistics for the variables included in our analyses. Many Illinois schools started a week or two later than usual in 2020, with most schools starting sometime between August 24 and September 2. Therefore, we used August 24 as the treatment date in our analyses. Although a few schools started earlier than August 24, it usually takes at least a few days for COVID-19 cases to be detected and reported. Therefore, we would not expect school reopenings to have noticeably affected reported COVID-19 case numbers until at least August 24; we would not expect school reopenings to have noticeably affected reported COVID-19 hospitalization or death numbers until at least a few more days after August 24. We first used linear regressions to identify the demographic variables that were significant predictors of new COVID-19 cases or deaths in Illinois counties, using data from August 3, 2020 through September 13, 2020. We considered 23 different demographic variables in those regressions and ultimately selected 10 predictor variables. In addition to those 10 predictor variables, which are listed in Table 1 , the other variables that we considered but that were not significant predictors were percent of population who are Asian, percent of population who are black, percent of population with less than a high school education, percent of population who are over age 17, percent of population who are over age 64, percent of population who are over age 74, percent of population who are over age 84, population, land area, population density, unemployment rate, metro code, and rural-urban code. For our main analyses, we used the synthetic control method to match counties from each group with counties from each other group on the predictor variables, including a pretreatment value of the outcome variable averaged over the 7 days prior to August 24, 2020. Synthetic control analysis constructs a synthetic control unit for each treated unit by finding a weighted combination of control units that matches the treated unit as closely as possible on the pretreatment averages of the predictor variables. An advantage to synthetic control analysis is that a weighted combination of control units can often provide a better match for a treated unit than any individual control unit or even than an average of two or more control units. However, unlike with our nearest neighbor matching that we describe below, we were not able to adjust in our synthetic control analysis for any remaining bias due to differences in predictor variable averages between the treated units and their synthetic control units. We used three other methods as robustness checks for our synthetic control results. For all of these analyses, we compared a three-week post-treatment period from August 24 to September 13, with a three-week pre-treatment period from August 3 to August 23. First, we used nearest neighbor matching to match counties from each group with the three nearest counties from each other group within a caliper of 5; counties that did not have three neighbors in the other group within a caliper of 5 were excluded from the analysis. We used a bias adjustment in our nearest neighbor matching analyses, which adjusts the difference in the outcome variable values between matched units to account for differences in the values of their predictor variables, and we used heteroscedasticity robust standard errors. Second, we used difference-in-differences to compare the post-treatment versus pre-treatment outcome variable difference for each county group with that same difference for each other county group. In the difference-in-differences analyses, we adjusted for differences in predictor variable values and again used heteroscedasticity robust standard errors. Third, we used two-stage propensity score regressions to compare each county group with each other group. In the first stage, we estimated the likelihood of each county being in the more restrictive county group in the comparison using a logistic regression; in these regressions, we used a slightly different set of 11 demographic predictor variables that were significant predictors of instruction types at the start of the school year, including the pre-treatment cases and deaths values. In the second stage, we estimated the effect of being in the more restrictive group on the post-treatment outcome variable value, controlling for the estimated propensity score and the original 11 predictor variables, including the pre-treatment outcome variable value. We conducted three synthetic control analyses each for cases, hospital admissions, and deaths: an analysis that compared majority in-person counties with majority hybrid counties, an analysis that compared majority hybrid counties with majority online-only counties, and an analysis that compared majority in-person counties with majority online-only counties. Table 2 shows the results of those nine analyses. Table 2 shows the estimated effect of being in the more restrictive group on the number of new cases, hospital admissions, or deaths per 100,000 people; negative numbers indicate that the more restrictive instruction type resulted in fewer cases, admissions, or deaths. Day Majority in-person (n = 41) vs. majority hybrid (n = 32) Majority hybrid (n = 32) vs. majority online-only (n = 17) Majority in-person (n = 41) vs. majority online-only (n = 17) Estimated effect of having 50% or more of county students in hybrid districts (as compared with having 50% or more in in-person districts) on 7-day average per 100,000 people for: Estimated effect of having 50% or more of county students in onlineonly districts (as compared with having 50% or more in hybrid districts) on 7-day average per 100,000 people for: Estimated effect of having 50% or more of county students in onlineonly districts (as compared with having 50% or more in in-person districts) on 7-day average per 100,000 people for: Table 3 summarizes the results of the additional analyses that we conducted to estimate the effects of the different county groups on average new cases during the three-week posttreatment period from August 24 to September 13, as compared with the three-week pretreatment period from August 3 to August 23; we also conducted additional analyses for hospital admissions and deaths, but those results were never statistically significant, so we do not present them. The synthetic control estimates in Table 3 differ from the estimates for September 13 in Table 2 , because the estimates in Table 2 are one-week averages while the estimates in Table 3 are three-week averages. In the synthetic control estimates in Table 3 , having a majority of students in hybrid districts resulted in 8.51 fewer new daily cases per 100,000 people over the post-treatment period than having a majority of students in primarily in-person districts, which represents a 32 percent reduction. The other methods all produced smaller estimates of that effect, with the estimates ranging from 1.69 to 6.65 fewer new daily cases per 100,000 people for the majority hybrid counties. Having a majority of students in online-only districts resulted in 0.89 fewer new daily cases per 100,000 people over the post-treatment period than having a majority of students in hybrid districts in the synthetic control estimates. The other methods all produced somewhat larger estimates of that effect, but none of the estimates was nearly statistically significant. Having a majority of students in online-only districts resulted in 10.13 fewer new daily cases per 100,000 people over the post-treatment period than having a majority of students in primarily in-person districts in the synthetic control estimates, which represents a 41 percent reduction although the effect was not quite statistically significant. The other methods all produced smaller estimates of that effect, with the estimates ranging from 5.50 to 6.21 fewer new daily cases per 100,000 people for the majority online-only counties. Majority in-person (n = 41) vs. majority hybrid (n = 32) Majority hybrid (n = 32) vs. majority onlineonly (n = 17) Majority in-person (n = 41) vs. majority onlineonly (n = 17) Estimated effect of having 50% or more of county students in hybrid districts (as compared with having 50% or more in inperson districts) on average new daily cases per 100,000 people from August 24 to September 13 Estimated effect of having 50% or more of county students in online-only districts (as compared with having 50% or more in hybrid districts) on average new daily cases per 100,000 people from August 24 to September 13 Estimated effect of having 50% or more of county students in online-only districts (as compared with having 50% or more in inperson districts) on average new daily cases per 100,000 people from August 24 to September 13 Synthetic control -8.51 † -0.89 -10.13 Nearest neighbor matching -6.65* ‡ -3.58 § -6.21 ¶ Difference-in-differences -1.69 -3.96 -5.65 † Propensity score regression -5.35 -2.06 -5.50 * p < .05; † p < .10. ‡ Two majority in-person counties and six majority hybrid counties were excluded from the analysis because they did not have three matches in the other group within the caliper of 5. § Four majority hybrid counties were excluded from the analysis because they did not have three matches in the other group within the caliper of 5. ¶ Twenty-eight majority in-person counties and five majority online-only counties were excluded from the analysis because they did not have three matches in the other group within the caliper of 5. For the comparison of cases in majority in-person counties and majority hybrid counties, the effect in Table 2 followed a U-shaped pattern, with no effect at the beginning of the period, a statistically significant negative effect in the middle of the period, and no effect at the end of the period. This U-shaped effect was not unexpected. It is not surprising that the difference between in-person and hybrid instruction did not begin to affect the number of reported cases for several days. It is also not surprising that, after several days, majority hybrid counties had significantly fewer reported cases than their synthetic control units of majority in-person counties, because inperson instruction generally involves having almost all students at school at the same time, while hybrid instruction usually means having less than half of students at school at a time. Finally, it is not surprising that the effect disappeared toward the end of the period, because many in-person districts that experienced school or community outbreaks changed quickly to hybrid or onlineonly instruction and some hybrid districts with favorable community metrics changed to inperson instruction. So, there was a significant amount of crossover between treated and control counties by the end of the period. The comparison of cases in majority in-person counties and majority online-only counties also showed a U-shaped effect pattern, although even on the first day of the period, majority online-only counties already had significantly fewer reported cases than their synthetic control units of majority in-person counties. It was also not surprising that there was not a significant effect on hospital admissions or deaths in any of the comparisons. The population of students, teachers, and other employees at K-12 schools are generally not in the high-risk groups for severe complications from the virus as some populations are, such as nursing facility residents. Although there are undoubtedly some students, teachers, and other school employees who are in high-risk groups or who have immediate family members in those groups, many of those people likely chose not to participate in in-person classes in in-person or hybrid districts. The most unexpected result was that there was not a significant effect on reported cases in the comparison of majority hybrid counties and majority online-only counties in any of our analyses. That result suggests that hybrid instruction did not contribute significantly to the spread of the pandemic. As noted above, hybrid instruction generally involves having fewer than half of the students in each class attend school at a time, which may allow the students and teachers to maintain an adequate distance to prevent significant transmission of the virus. Of course, screening and contact tracing likely also contributed, by keeping potentially infected people out of the classrooms and quickly controlling any outbreaks that did occur through quarantines. This study had some important limitations. We were able to consider only a three-week period because there was a significant amount of crossover among the three county groups by the end of that period as districts changed instruction types to respond to community conditions. For a similar reason, we were able to consider only the period at the start of the school year, when COVID-19 case rates were relatively low in Illinois compared to some other periods during the 2020-2021 school year; districts changed instruction types so frequently after the first couple of weeks of the school year that it would be difficult to find a period with stable instruction types across districts later in Fall 2020. We were not able to consider differences by grade level; a few districts had different instruction types for different grades, but there was not enough variation across grade levels within districts to estimate the effects of those differences. Although we attempted to verify and correct the data on the ISBE dashboard for the 200 largest districts in Illinois and the largest district in each county, our data on school district instruction types at the start of the school year likely still had some inaccuracies, which may have affected our results. As we discussed above, the hospital admissions data that we used were not ideal, because they were available only weekly and reflected the location of the hospital rather than the residence of the patient. Finally, we cannot explain why majority in-person counties had more cases per capita than majority in-person and majority hybrid counties; in-person schooling did not necessarily result in more transmission among students, because resuming in-person schooling also enabled more adults to return to workplaces (including schools) and other activities, which may have contributed to the higher case numbers. This paper adds to the growing scientific consensus that at least some forms of in-person K-12 instruction have not contributed significantly to the spread of the pandemic, including all four of the studies using data from Fall 2020 that we cited above (Falk et al., 2021; Goldhaber et al., 2020; Harris et al. 2021; Hobbs et al., 2020) . There is still uncertainty, however, about the specific circumstances under which in-person K-12 instruction can safely be conducted. The CDC has continued to recommend that decisions about instruction types should depend on the level of transmission in the community (CDC, 2021) and two of those four prior studies suggested that in-person schooling may contribute more to the spread of the pandemic in communities with higher baseline levels (Goldhaber et al., 2020; Harris et al., 2021) . Under the CDC's recommendations, as of March 2021, only four percent of children in the United States lived in counties with transmission levels low enough for full-time in-person learning (Keefe, 2021) . However, many experts believe that the CDC has been too slow to adjust its recommendations and that in-person schooling could safely be conducted more widely, regardless of community metrics (Allen & Jenkins, 2021) . We were not able to control for preexisting transmission levels in this study, but our results suggest that there may also be an important difference between fully in-person instruction and hybrid instruction. Illinois counties that started the 2020-2021 school year with a majority of students in hybrid districts had 32 percent fewer new daily cases during the first three weeks of the school year than their synthetic control units of counties that started the school year with a majority of students in in-person districts. The immediate effect of COVID-19 policies on social distancing behavior in the United States. Unpublished manuscript Opinion: The CDC's latest demands will keep millions of kids out of school unnecessarily. The Washington Post Association between statewide school closure and COVID-19 incidence and mortality in the US Operating schools during COVID-19 K-12 school operational strategy Strong social distancing measures in the United States reduced the COVID-19 growth rate Map: Coronavirus and school closures in 2019-2020 Map: Where are schools closed? COVID-19 cases and transmission in 17 K-12 schools -Wood County To what extent does in-person schooling contribute to the spread of COVID-19? The effects of school reopenings on COVID-19 hospitalizations Factors associated with positive SARS-CoV-2 test results in outpatient health facilities and emergency departments among children and adolescents aged < 18 years -Mississippi The effect of large-scale anti-contagion policies on the COVID-19 pandemic See how districts are providing instruction: Virtual, inperson, and blended learning dashboard Should your school be fully open? Here's what the C.D.C. says. The New York Times An ongoing repository of data on coronavirus cases and deaths in the Democratic lawmakers probing whether CDC guidelines on reopening schools were influenced by political pressure. The Washington Post Annual county resident population estimates by selected age groups and sex Annual county resident population estimates by age, sex, race, and