key: cord-0443029-u1fnrf6i authors: Horng, Steven; O'Donoghue, Ashley; Dechen, Tenzin; Rabesa, Matthew; Shammout, Ayad; Markson, Lawrence; Jegadeesan, Venkat; Tandon, Manu; Stevens, Jennifer P. title: Secondary Use of Employee COVID-19 Symptom Reporting as Syndromic Surveillance as an Early Warning Signal of Future Hospitalizations date: 2020-12-10 journal: nan DOI: nan sha: 1a46f7fea41bd05ec919b0b5b356c0dcb9c734b9 doc_id: 443029 cord_uid: u1fnrf6i Importance: Alternative methods for hospital utilization forecasting, essential information in hospital crisis planning, are necessary in a novel pandemic when traditional data sources such as disease testing are limited. Objective: Determine whether mandatory daily employee symptom attestation data can be used as syndromic surveillance to forecast COVID-19 hospitalizations in the communities where employees live. Design: Retrospective cohort study. Setting: Large academic hospital network of 10 hospitals accounting for a total of 2,384 beds and 136,000 discharges in New England. Participants: 6,841 employees working on-site of Hospital 1 from April 2, 2020 to November 4, 2020, who live in the 10 hospitals' service areas. Interventions: Mandatory, daily employee self-reported symptoms were collected using an automated text messaging system. Main Outcomes: Mean absolute error (MAE) and weighted mean absolute percentage error (WMAPE) of 7 day forecasts of daily COVID-19 hospital census at each hospital. Results: 6,841 employees, with a mean age of 40.8 (SD = 13.6), 8.8 years of service (SD = 10.4), and 74.8% were female (n = 5,120), living in the 10 hospitals' service areas. Our model has an MAE of 6.9 COVID-19 patients and a WMAPE of 1.5% for hospitalizations for the entire hospital network. The individual hospitals had an MAE that ranged from 0.9 to 4.5 patients (WMAPE ranged from 2.1% to 16.1%). At Hospital 1, a doubling of the number of employees reporting symptoms (which corresponds to 4 additional employees reporting symptoms at the mean for Hospital 1) is associated with a 5% increase in COVID-19 hospitalizations at Hospital 1 in 7 days (95% CI: (0.02, 0.07)). Conclusions: We found that a real-time employee health attestation tool used at a single hospital could be used to predict subsequent hospitalizations in 7 days at hospitals throughout a larger hospital network in New England. COVID-19 has created a challenge for many businesses --to identify and limit employees sick with COVID-19 to mitigate the spread of infections. However, most organizations have limited access to surveillance testing for the disease. As a surrogate for testing, multiple businesses, including hospitals, have required that employees report any COVID-19-associated symptoms and to direct symptomatic employees to obtain follow up testing. Symptom reporting tools may also have additional secondary benefits. The Department of Veterans Affairs described how their patients' use of a symptom monitoring tool for COVID-19 improved a sense of connection, 1 suggesting such employee attestation strategies may have additional benefits to healthcare organizations. Prior efforts to have employees self-identify have been met with varied success. 2, 3 One critical weakness of symptom-only reporting is the infectivity of asymptomatic employees 4 , who would only be identified with a larger surveillance testing strategy. As members of a community, however, hospital employees' routine reporting of symptoms could serve as a surrogate of symptom reporting for their community as a whole. Given that prior research has noted that community spread of COVID-19 makes up the bulk of the burden of new infections in healthcare settings 5 , employee attestations of symptoms may also have a secondary use as syndromic surveillance to estimate the incidence and prevalence of infections in the communities where employees live. We hypothesized that an employee symptom reporting tool at a single academic medical center could be used as syndromic surveillance and forecast subsequent hospital admissions for COVID-19 in the communities where employees live. This study is a retrospective cohort study and was determined to be exempt by our institutional review board with a waiver of informed consent. The study was performed in a large hospital system in Massachusetts, containing 10 hospitals from April 2, 2020, to November 4, 2020. The hospitals are numbered according to the number of unique employees living in that service area that are filling out the attestation during our sample period (ie: Hospital 1 has the most employees living in their service area that fill out the employee attestation form while Hospital 10 has the fewest). Hospital 1 is a tertiary, academic, teaching hospital in Boston, MA, with 719 staffed beds and 40,000 annual inpatient discharges. Hospital 2: 208 staffed beds, 13,000 annual inpatient discharges; Hospital 3: 102 staffed beds, 6,000 annual inpatient discharges; Hospital 4: 345 staffed beds, 24,000 annual inpatient discharges; Hospital 5: 164 staffed beds, 11,000 annual inpatient discharges; Hospital 6: 58 staffed beds, 2,500 annual inpatient discharges; Hospital 7: 364 staffed beds, 19,500 annual inpatient discharges; Hospital 8: 205 staffed beds, 12,000 annual inpatient discharges; Hospital 9: 140 staffed beds, 5,500 annual inpatient discharges; Hospital 10: 79 staffed beds, 2,500 annual inpatient discharges. Employees were included if they were employed at the main hospital and were working onsite that day. Employees were excluded if they lived outside of any of the hospital network's service area or if they were not working on-site that day. Attending physicians were members of a separate physicians organization and were excluded. Self-reported symptoms were collected using an automated text messaging system ( Figure 1 ). Employees received a text message each morning, asking them to complete the daily symptom monitoring assessment. Employees were first asked whether they will be working on-site that day. Only employees who report that they will be working on-site are prompted to fill out the symptom reporting form, where they are asked if they are experiencing any covid-related symptoms from a list of 12 symptoms. If yes, they are asked to report the specific symptom from the list of 12 that they are experiencing. Our primary outcome was the daily number of COVID-19 hospitalized patients at each of the 10 hospitals in a large New England hospital network. Our secondary outcome was the weekly COVID-19 positive cases within each of the 10 hospital service areas in a large New England hospital network. This outcome has a reduced time period for analysis because of data availability from the state reporting agency. The role of the employee (RN, Operations, Administrative Support, Research, Clinical Technician, Housestaff/Fellows/Residents/Interns, Clinical Assistants, and Other), age, years of service, sex, race (White, Black, Asian, Hispanic, Other), and zip code were collected. We collected the number of employees reporting any covid-related symptom each day, grouped by employee home zip code. Employees' home zip codes were then matched to the service areas of hospitals within the hospital network. Employees were therefore matched to the hospital nearest to which they live, rather than the hospital at which they work. The data were smoothed using a 7 day moving average. The data were then transformed by taking the natural logarithm of the data. We report means and standard deviations of continuous variables and counts and percentages of categorical variables. We use an econometric time series forecasting model described by Dumitrescu and Hurlin 6 to model each hospital as a cross-section of a panel, and test for Granger non-causality in this heterogeneous panel. This type of model has previously been used to forecast COVID-19 cases at the country-level using Google search trends. 7 In this framework, a linear autoregressive model is used, allowing coefficients to differ across hospitals in the panel, but fixed over time. Holding the number of lags constant across hospitals, we select the optimal number of lags that minimizes the Bayesian information criterion and forecast future COVID-19 hospitalizations at each network hospital. We forecast 7 days into the future and we calculate the mean absolute error and mean absolute percentage error of our forecast to measure the accuracy of our model for the network. There are no missing days of hospital census data or employee symptom reporting data. P ≦ 0.05 was considered statistically significant and all tests were 2 tailed. Stata SE version 16 (StataCorp) was used for statistical analysis. Summary statistics about the employees are reported in Table 1 . We studied a total of Using the Dumitrescu and Hurlin (2012) test 6 for Granger non-causality in heterogeneous panel models on our full sample of 10 hospitals, we reject the null hypothesis that the number of employees reporting symptoms is not useful for forecasting future COVID-19 hospitalizations at any of the in-network hospitals (P < 0.001). We find that the optimal number of lags is 7 days, which minimizes the Bayesian information criteria. We estimate a multivariable autoregressive linear regression model that includes each hospitals' daily COVID hospital census, and the number of employees reporting symptoms in each hospital's service area, with a lag of 7 days, to predict the daily COVID hospital census for each hospital over 209 days. The coefficient on Hospital 1's 7-day lag is 0.05 (95% CI: (0.02, 0.07); P < 0.001), which can be interpreted as meaning that twice as many employees reporting symptoms today is associated with a 5% increase COVID-19 hospitalizations in 7 days. At the mean for Hospital 1, this would correspond to 4. In Table 2 , we report the mean absolute error (MAE) and weighted mean absolute percentage error (WMAPE) by hospital for forecasts from November 5 to November 11. The As a secondary analysis, we also tried to predict positive cases, not just hospitalizations. We do this because the lag time between symptom onset and covid-19 test positivity is shorter than symptoms onset and hospitalization. However, this method relies more heavily on adequate and accessible testing, which is why hospitalizations are our preferred outcome measure of interest. Further, the state reporting agency only reports weekly case numbers by town, and only began reporting those numbers at the end of April. Thus, this analysis is done on a reduced sample at the week-level. From a Granger noncausality test for panel data, we cannot reject the null hypothesis that employees reporting symptoms is not useful for predicting changes in positive cases in the employees' home community (P = 0.15) With inconsistent access to broader testing that has varied over the course of the COVID-19 pandemic, many hospitals have relied on employees to accurately identify themselves as having symptoms of COVID-19 infections. In this study, we found a secondary use of these data to predict future hospitalizations in 7 days in the hospitals which service the communities in which these employees lived. There is considerable prior work in epidemiological modelling as well as forecasting. In traditional epidemiological modelling, the Susceptible-Exposed-Infected-Recovered (SEIR) is one of the most common approaches. In June, several prominent SEIR national models averaged an mean absolute percentage error of 32.6%. 8 Another common forecasting approach is to use data from earlier outbreaks, adapting another locations curve to the local area, as the University of Washington's Institute for Health Metrics and Evaluation (IHME) model had done, yielding the lowest MAPE during this time period at 20.2%. 8 However, these are national models that predict further (10 weeks) into the future and predict deaths due to COVID-19. Our model is very localized to a specific area, predicts only into the immediate future (1 week) and forecasts hospitalizations, not deaths. The dynamic, rapidly changing COVID-19 policies for social distancing are challenging for these alternative approaches as they are highly sensitive to any change in transmissibility. In both of these alternative approaches, modellers must estimate not only the effect of any of these policy changes on the transmissibility of COVID-19, but also the rigor of their enforcement and adherence by the community. On the other hand, our approach requires no knowledge or estimation of ongoing policies. We instead use an early signal of community spread, employee reported symptoms, to predict future hospitalizations. This restricts our predictive capability to only the near future (7 days) which may not be sufficient for all purposes, but is sufficient to activate initial hospital and network emergency surge protocols. For example, New York state has mandated hospitals have a surge and flex response plan to expand operational bed capacity by 50% within 7 days. 9 In future work, we plan to investigate other non-traditional data sources used previously in syndromic surveillance that may also act as surrogates for community spread. For example, when one is symptomatic with other types of infectious diseases such as influenza, one may make purchases of items to manage those symptoms such as facial tissues, orange juice, and other over-the-counter products. 10 It is plausible that other overthe-counter remedies such as tylenol, aspirin, and vitamin D that may be associated with COVID-19 could also be used. We also plan to investigate alternative methodological approaches. For example, we currently allow hospitals to have individual coefficients, independent of other hospitals. The coefficients are unlikely to be exactly the same, as hospitalization rates are likely to differ across communities, reflecting the diversity of demographics and underlying risk factors in different communities. However, these coefficients are unlikely to be entirely independent, so some soft parameter sharing across hospitals may be warranted. Our study does have several notable limitations. First, our estimate of COVID-19 daily hospital census represents the number of confirmed cases at each hospital. Thus, this estimate relies on testing data. Second, our independent variable relies on self-reported symptoms of employees reporting on-site for work. This depends on employees diligently filling out the symptom reporting and being honest about their experienced symptoms. While every effort is made to ensure all employees reporting to work are submitting an employee attestation form before they arrive, some employees may be on-site without filling out the employee attestation form. Employees may inconsistently report symptoms based on day of week, access to sick leave, and individual level of concern, all which create different patterns of missingness and bias in the attestation data itself. However, our findings suggest that despite these biases, employee reported symptoms still provide an adequate signal. In conclusion, we find that daily employee symptom reporting can be a useful tool for predicting future COVID-19 hospitalizations in hospital networks. Veterans' response to an automated text messaging protocol during the COVID-19 pandemic A Web-Based, Mobile-Responsive Application to Screen Health Care Workers for COVID-19 Symptoms: Rapid Design, Deployment, and Usage Implementation of a digital chatbot to screen health system employees during the COVID-19 pandemic Asymptomatic SARS Coronavirus 2 infection: Invisible yet invincible Epidemiology of and Risk Factors for Coronavirus Infection in Health Care Workers: A Living Rapid Review Testing for Granger non-causality in heterogeneous panels Tracking COVID-19 using online search. arXiv [csSI IHME COVID-19 Forecasting Team. Modeling COVID-19 scenarios for the United States Implementing syndromic surveillance: a practical guide informed by the early experience