key: cord-0948822-u06azmnk authors: Santostefano, Christopher M.; White, Elizabeth M.; Feifer, Richard A.; Mor, Vincent title: Accuracy of ICD‐10 codes for identifying skilled nursing facility residents with lab‐confirmed COVID‐19 date: 2021-08-16 journal: J Am Geriatr Soc DOI: 10.1111/jgs.17412 sha: 4cc9dece757c0f2cdefd5784edd478450e697ad4 doc_id: 948822 cord_uid: u06azmnk nan In the absence of widely available SARS-CoV-2 testing data, administrative claims will be important for identifying COVID-19 cases among skilled nursing facility (SNF) residents in future research. However, the validity of International Classification of Diseases, Tenth Revision (ICD-10) codes for identifying residents with lab-confirmed infection has not been established. Using electronic health record (EHR) data, we examined the frequency of COVID-19 ICD-10 documentation among SNF residents with lab-confirmed SARS-CoV-2 infection. We used testing records from the EHR of a large multistate long-term care provider to identify SNF residents with SARS-CoV-2 infection confirmed by polymerase chain reaction or rapid antigen testing between March 16, 2020 and May 26, 2021. We queried residents' EHR problem lists for the primary ICD-10 code U07.1 (COVID-19, virus identified) 1 as well as Z86.16 (personal history of COVID-19), and J12.82 (pneumonia due to COVID-19). 2 We examined variation in documentation of the U07.1 code based on whether residents were symptomatic or asymptomatic at the time of their initial positive test, as derived from change in condition notes completed by nurses. 3 Residents with new symptoms from 5 days before up to 14 days after their initial positive test were considered symptomatic. We also evaluated variation in ICD-10 documentation across months. Finally, we examined the correlation between the initial date of code U07.1 and the resident's first positive test date in order to understand whether the initial date of the ICD-10 code could be used to reasonably approximate infection onset. As of May 26, 2021, 17,194 unique SNF residents had labconfirmed SARS-CoV-2 infection across 294 SNFs in 25 states. Of these, 15,949 residents (92.8%) had ICD-10 code U07.1 in their EHR problem list (Table S1 ). Documentation of U07.1 was slightly more common among symptomatic (7064 of 7449 residents, 94.8%) versus asymptomatic residents (8885 of 9745, 91.2%). One-third of all residents (n = 5703, 33.2%) had code Z86.16 listed, and this was slightly more common among asymptomatic (35.2%) versus symptomatic (30.5%) residents. Only 156 residents (0.9%) had code J12.82 code listed. Expanding the EHR query for U07.1, Z86.16, or J12.82 identified only 76 additional residents with lab-confirmed infection compared with U07.1 alone (93.2% vs 92.8%). The U07.1 code was more often in the EHR of residents who tested positive from March to May 2020 (97.5%) or June 2020 to August 2020 (97.4%). From September 2020 to February 2021, the frequency of U07.1 among positive cases dropped to 88% (Table S2 ). There were 226 cases of COVID-19 from March 2020 to May 2021, of which only 138 (61.1%) had code U07.1 present. Among 15,949 residents with U07.1 in their EHR problem list, the initial date of the code was strongly correlated with the first positive test date identified from testing records (r = 0.99) with a mean difference of 0.16 days (Figure 1 ). Due to insufficient resolution dates in the problem list, it does not appear that ICD-10 codes are able to accurately capture the cases of reinfection. To our knowledge, this is the first attempt to validate ICD-10 codes against clinical testing data for the purpose of identifying COVID-19 cases. These findings from a large multistate sample of SNF residents suggest that the U07.1 code can be used with reasonable accuracy to identify individual cases of COVID-19 in this population. Additional codes, including Z86.16 and J12.82, do not appear to meaningfully improve the sensitivity of detecting lab-confirmed cases. Although these data come from a geographically diverse sample of SNF residents, all of the facilities are owned and operated by a single long-term care provider. Still, because patient-level testing data are not widely available to researchers, these results provide important validation to support the use of ICD-10 codes for future research using claims data. Although we queried the EHR problem list, rather than actual claims files, these are the ICD-10 codes that SNFs submit for clinical reimbursement and are expected to align well with actual claims submissions. Because code U07.1 triggers important reimbursement for SNFs, it is likely given high priority across most long-term care providers. Symptom presentation may have contributed to the lower frequency of U07.1 being coded in Fall 2020/Winter 2021 versus Spring/Summer 2020, since a greater proportion of cases of COVID-19 were asymptomatic in the later months. Another potential explanation could be a lag in the documentation of ICD-10 codes, suggested by the observation that only 61% of cases from March 2020 to May 2021 had U07.1 listed. This temporal variation should be considered when using ICD-10 codes to compare cases at different stages of the pandemic or in settings where high numbers of asymptomatic cases are suspected. Additional supporting information may be found in the online version of the article at the publisher's website. Table S1 . ICD-10 COVID-19 code presence among SARS-CoV-2-positive skilled nursing facility residents, by symptom status Table S2 . Prevalence of ICD-10 COVID-19 codes for skilled nursing facility residents with lab-confirmed SARS-CoV-2 infection, by quarter How to cite this article: Santostefano CM, White EM, Feifer RA, Mor V. Accuracy of ICD-10 codes for identifying skilled nursing facility residents with lab-confirmed COVID-19. J Am Geriatr Soc. 2021;1-3. doi:10.1111/jgs.17412 Centers for Disease Control and Prevention. New ICD-10-CM code for the 2019 novel coronavirus (COVID-19) Correlation between positive SARS-CoV-2 test date and U07.1 ICD-10 code start date. Pearson's correlation coefficient (r) Centers for Disease Control and Prevention. New ICD-10-CM code for the 2019 novel coronavirus (COVID-19) Asymptomatic and presymptomatic severe acute respiratory syndrome coronavirus 2 infection rates in a multistate sample of skilled nursing facilities We thank Jeffrey Hiris from Brown University; and Richard Castor, Cliff Boyd, and Denine Hastings of Genesis HealthCare for their extensive data management support. The authors thank National Institute on Aging, 3P01AG027296-11S1 (PI: Vincent Mor) for funding. Vincent Mor is Chair of the Scientific Advisory Board at NaviHealth, Inc., former Chair of the Independent Quality Committee at HCR ManorCare, and former Director of PointRight, Inc., where he holds less than 1% equity. The other authors have no conflicts of interest to disclose. The National Institute on Aging had no role in the design or conduct of the study; collection, management, analysis, or interpretation of the data; preparation, review, or approval of the manuscript; or decision to submit the manuscript for publication. Elizabeth M. White https://orcid.org/0000-0003-4175-8662