key: cord-0694595-kb2o58hj authors: Symanski, Elaine; Ensor, Katherine B; Piedra, Pedro A; Sheth, Komal; Caton, Kelsey; Williams, Stephen L; Persse, David; Banerjee, Deborah; Hopkins, Loren title: Population-Based Estimates of SARS-CoV-2 Seroprevalence in Houston, TX as of September 2020 date: 2021-04-29 journal: J Infect Dis DOI: 10.1093/infdis/jiab203 sha: 449bc399d14f8e1590e12162daab9146b0f9b990 doc_id: 694595 cord_uid: kb2o58hj BACKGROUND: In contrast to studies that relied on volunteers or convenience sampling, there are few population-based SARS-CoV-2 seroprevalence investigations and most were conducted early in the pandemic. The health department of the fourth largest city in the U.S. recognized that sound estimates of viral impact were needed to inform decision-making. METHODS: Adapting standardized disaster research methodology in September 2020, the city was divided into high and low strata based on RT-PCR positivity rates, and census block groups within each stratum were randomly selected with probability proportional to size, followed by random selection of households within each group. Using two immunoassays, the proportion of infected individuals was estimated for the city, as well as by positivity rate and by sociodemographic and other characteristics. The degree of under ascertainment of seroprevalence was estimated based on RT-PCR positive cases. RESULTS: Seroprevalence was estimated to be 14% with a near two-fold difference in areas with high (18%) versus low (10%) RT-PCR positivity rates and was four times higher compared to case-based surveillance data. CONCLUSIONS: Seroprevalence was higher than previously reported and is greater than that estimated from RT-PCR data. Results will be used to inform public health decisions about testing, outreach, and vaccine rollout. The Coronavirus Disease 2019 , caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection, has led to significant morbidity and mortality worldwide. [1] Real-time reverse-transcriptase polymerase chain reaction (RT-PCR) testing measures viral RNA during an acute infection. It is critically important for diagnosing individuals with acute SARS-CoV-2 infection but provides limited information about the extent of infection in a population because testing protocols are highly variable over time [2] and RT-PCR positivity generally declines a few weeks after the onset of symptoms as viral RNA is cleared. [3, 4] In contrast, serologic testing measures antibodies against the infection, but requires 7-14 days post-infection to generate a measurable antibody response. [5] Further, for identifying acute infection, it also suffers from testing protocols that are highly variable and, if performed too early into the infection, can yield false negative results. [5] On the other hand, serological testing for SARS-CoV-2 has advantages for estimating cumulative prevalence of infection (but not for identifying acute infection) because it captures infected individuals who were tested, as well as those who were not because they had mild symptoms or were asymptomatic. Owing to the potential for underestimating infection based on case-based surveillance, seroprevalence surveys can inform both the epidemiology of SARS-CoV-2 and future actions of public health officials. Further, they may provide information on the proportion of the population that remains susceptible to the virus should antibodies provide immunity. [6] A c c e p t e d M a n u s c r i p t Estimates of SARS-CoV-2 infection in the U.S. using antibody testing have varied widely, due to differences in study design (e.g., employing random or convenience sampling) and locale, and were largely conducted early in the pandemic. A key advantage of random sampling is that meaningful inferences can be made about the target population, whereas studies that rely on convenience sampling are often less expensive and easier to implement but suffer potentially from selection bias. In a statewide study that used convenience sampling of New Yorkers visiting grocery stores in April 19-28, 2020 Jefferson Parishes, Louisiana (6.9%, 95% CI: 6.8-6.9%) [13] . Population-based studies with random sampling of participants were also conducted in Europe around the same time as the U.S. investigations. For example, in Geneva, Switzerland during April 6 to May 9, 2020, seroprevalence ranged by week of sampling from 4.8% (95% CI 2.4-8.0%) in the first week to 10.8% (8.2-13.9%) in the fifth week. [14] A c c e p t e d M a n u s c r i p t In Spain, seroprevalence during April 27 to May 11, 2020 was 4.6% (95% CI: 4.3%-5.0%). [6] Based on estimates from PCR testing results in July 2020, Houston, the most populous city in Texas and the fourth largest city in the U.S., had reached a test positivity rate of 28% and, by September 1, 2020, had 75,207 RT-PCR-confirmed cases (3,241 cases per 100,000 population). Given the absence of population-level estimates of SARS-CoV-2 infection in the city, the Houston Health Department (HHD) in collaboration with academic partners at Baylor College of Medicine (BCM) and Rice University designed a population-based seroprevalence investigation and, herein, we report on the results of seroprevalence data that were collected on a representative sample of Houstonians in September 2020. This investigation was determined by HHD to be a public health surveillance activity and was exempted from human subjects review. The Community Assessment for Public Health Emergency Response (CASPER) methodology, [15] which applies a two-stage clustered sampling design, was adapted to estimate seroprevalence in the city. In total, 1,109 census block groups that fell within HHD's jurisdiction comprised the sampling frame. Data on occupied households within each block group were obtained from the 2010 U.S. Census; [16] where necessary, contiguous block groups were combined so that each group contained a minimum of 15 occupied households. Nasal swab RT-PCR results as of July 20, 2020, available at the address level, were used to compute the cumulative A c c e p t e d M a n u s c r i p t positivity rate by block group and then two strata were created based on the median value (20.4%). For each stratum separately, 30 block groups were randomly selected with a probability proportional to the number of occupied households and with replacement (see Figure 1 ). The targeted enrollment was 7 households per census block group (420 households across the 60 block groups). Initially, a listing of 21 households was randomly generated for each census block group to be visited in order. If the initial list of 21 households was exhausted because individuals were not at home or were not interested in participating, another listing of households was randomly generated until the target number of 7 households in each census block group was met. In each household, all residents, ages 5 years and older, were invited to participate. At the HHD laboratory, samples were centrifuged, and serum was poured into 12x75mm polypropylene tubes, refrigerated, and tested within 72 hours of collection. Samples were analyzed for the presence of COVID-19 antibodies using the Platelia SARS-CoV-2 Total Ab assay (BioRad© Laboratories, Inc., Hercules, California), [17] which is a qualitative Enzyme-linked Immunoassay (ELISA) that detects the presence of IgM, IgA, and IgG antibodies to SARS-CoV-2 (Immunoassay 1). The assay uses a recombinant SARS nucleocapsid protein and employs the principle of one-step antigen capture. The assay was performed on the EVOLIS© automated analyzer according to the manufacturer's package insert. Remnant samples were separated into two aliquots and frozen at -20 0 C. The sensitivity and specificity of the test, respectively, are 100% and 98.9%. [17] The sample aliquots analyzed by the BCM laboratory were tested using an in-house optimized IgG anti-S ELISA (Immunoassay 2). In brief, SARS-CoV-2 full S glycoprotein (Novavax, Gaithersburg, MD) was coated onto Immulon 2HB 96-well plates, and the plates were kept moist at 4°C overnight. The following day, the plates were washed and blocked with 5% milk in 1X KPL for one hour in the incubator at 36°C. A standard curve was generated with rabbit SARS-CoV-2-S1 IgG monoclonal antibody in each test plate. Three test samples were added to each plate in A c c e p t e d M a n u s c r i p t duplicates and, diluted two-fold across the plate and incubated for one-hour. The plates were washed with KPL wash solution and the appropriate horseradish peroxidase (HRP) -conjugated anti-rabbit IgG streptavidin and HRP conjugated antihuman IgG were added at a 1/2000 dilution in 1X KPL to the standard control wells and test sample wells, respectively. After a 1 one-hour incubation period, the plates were washed in KPL wash, followed by addition of substrate (TMB: 3,3′,5,5′tetramethylbenzidine) for 20 minutes, and the reaction was stopped with 0.16M sulfuric acid. The plates were read at wavelength 450 nm. A four-parameter logistic regression model was used to calculate the relative binding antibody concentrations (μg/mL). The lower limit of detection was 1 μg/mL. The sensitivity and specificity of the assay are 94% and 99%, respectively. The age, sex, and racial/ethnic distributions of participants were compared with those of the target population using chi-squared tests. The overall estimate was a weighted average of the within stratum estimates, with weights determined by the number of census block groups within each one. Sample strata weights [18] [19] [20] were further adjusted to match population demographic characteristics, [21] by gender, race/ethnicity, and age group, and for multiple observations per household. Two immunoassays were used to capture unique (one test was positive) and overlapping (both tests were positive) immune responses following SARS-CoV-2 infection. Immunoassay 1 detected antibody responses to the nucleocapsid protein, the major structural protein of SARS-CoV-2, while Immunoassay 2 measured antibody response to the spike protein, the major surface glycoprotein that induces protective neutralizing antibody. The use of both assays provides a more complete assessment of the population-based seroprevalence. [6] The prevalence of SARS-CoV-2 infection A c c e p t e d M a n u s c r i p t in the population was estimated using the results from Immunoassay 1 and 2, separately, as well as combined (i.e., if at least one test was positive and then if both tests were positive). Seroprevalence estimates were also computed by sex, age group, and race/ethnicity, as well as by selected variables from the survey (i.e., history of RT-PCR testing, illness episodes, presence of comorbidities, current occupation, and frequency of visits to outside venues). Applying previous methodology, [8] The statistical software R (version 3.6.1, R Foundation for Statistical Computing) and SAS Enterprise Guide (version 7.1, Cary, NC) were used to perform statistical analyses. Two-sided P values less than 0.05 were considered statistically significant. In total, 678 residents living in 428 households were enrolled after visiting on average, 53 households per block group. For both strata combined and for residents living in census block groups in the low positivity stratum, based on chi-squared goodness of fit tests, the sample and target populations had similar age, race/ethnicity, and sex profiles (see Table 1 ) (p > 0.05). In the high positivity stratum, A c c e p t e d M a n u s c r i p t there was a greater proportion of individuals ages 5 to17 in the target population as compared to the sample population (p=0.04). Of the 678 residents on whom venipuncture was performed, blood samples for 6 residents could not be tested using either immunoassay; another 6 individuals had a result from just one of the two antibody tests. In total, 89 individuals tested positive based on Immunoassay 1 (27 in the low positivity stratum and 62 in the high positivity stratum), and 84 individuals tested positive based on Immunoassay 2 (26 in the low positivity stratum and 58 in the high positivity stratum). There was good agreement (96%) between Immunoassay 1 and Immunoassay 2 (see Supplementary Table 1) . Population-based estimates of seroprevalence for SARS-CoV-2 in Houston for individuals ages 5 years and older were 13.5% (95% CI: 9.0%-18.1%) and 12.9% (95% CI: 8.0%-17.7%) using Immunoassays 1 and 2, respectively ( Table 2) . Differences were detected between the low and high positivity strata: 9.6% (95% CI: 3.6%-15.7%) and 17.7% (95% CI: 10.9%-24.6%), respectively based on Immunoassay 1 and 9.1% (95% CI: 2.5%-15.6%) and 17.0% (95% CI: 9.8%-24.1%) for Immunoassay 2. Higher seroprevalence was observed among females, Hispanic Whites, and non-Hispanic Blacks as compared to other racial ethnic groups and among individuals under the age of 40. Supplementary Table 2 Table 3 for a breakdown of these results. Table 4 ). This investigation was undertaken on behest of local public health officials to estimate the proportion of residents living in the fourth largest city of the U.S. who had been infected with SARS-CoV-2. Using a two-stage complex sampling design, a random sample of individuals living in households in Houston, stratified into areas with low-and high positivity rates, completed questionnaires and provided blood samples between September 8 and September 19, 2020. Blood samples were assayed using two different antibody tests, one targeting the nucleocapsid protein and the other the full S glycoprotein, both of which have excellent performance A c c e p t e d M a n u s c r i p t characteristics. During this period, seroprevalence was estimated around 14% with similar results from the two serological assays that were performed. Our community-based seroprevalence findings calculated an under ascertainment in SARS-CoV-2 cases based on RT-PCR testing results. For each RT-PCR positive SARS-CoV-2 case there were approximately 3 additional cases that were under reported based on the number of infections identified using the two immunoassays. Although RT-PCR testing is the gold standard for identifying an acute SARS-CoV-2 infection based on the detection of viral RNA, an immunoassay can provide a more comprehensive assessment of past and recent infections. This is because it relies on measuring the host antibody response that remains detectable for a prolonged period of time rather than viral RNA that is cleared rapidly during the acute infection. The under-ascertainment case ratio can provide a valuable tool for estimating the overall prevalence of SARS-CoV-2 infection based on the cumulative number of RT-PCR positive cases in the community. Our findings confirm a two-fold difference in infection in areas with high (17.7%) versus low (9.6%) RT-PCR positivity rates. This is consistent with the persistent spatial variation in positivity rates that the HHD observed across the city since the start of the pandemic. Findings also indicate that case-based surveillance that relies on RT-PCR tests underestimates the level of SARS-CoV-2 infectivity in the city, as has been reported in a previous investigation that relied on residual sera collected in 10 sites across the U.S. [8] The sampling design employed, based on disaster research principles [15] , has advantages in providing methodology for a rapid public health response in the face of an emergency and results that allow for inferences to the target population. [22] In A c c e p t e d M a n u s c r i p t Houston, considerably higher seroprevalence estimates were observed as compared to investigations conducted earlier in the pandemic both in and outside of the U.S. that employed a similar design, e.g., in Atlanta [12] or Spain. [6] The only investigation to report a greater proportion (22.7%) of individuals with SARS-CoV-2 infection was in a convenience sample in New York in April 2020, [7] which may not have been representative of the general population. While other studies have reported geographic variation within a state [7] or across regions of the U.S, [8] this investigation reported different levels of seroprevalence within a single urban area, stratified by positivity rate. This difference is similar to a previous study conducted in May 2020 that found varying levels of current and active infection by zip code in two parishes in Louisiana. [13] Similar to investigations elsewhere in the U.S., [7, [11] [12] [13] there were significant disparities in SARS-CoV-2 infection with approximately 3.4 and 2.9 fold-increases in estimated SARS-CoV-2 seroprevalence among Hispanics and non-Hispanic blacks as compared to non-Hispanic whites. In contrast, seroprevalence was considerably higher among females and among younger age groups than reported in an investigation in Atlanta, Georgia in spring 2020. [12] These findings suggest important groups for outreach to mitigate infection and its spread. As expected, we also found higher seroprevalence among individuals who received RT-PCR testing and among persons who reported at least one illness during the period, January 15, 2020 to the time of the blood draw. A c c e p t e d M a n u s c r i p t While we employed a stratified two-stage clustered sampling strategy to collect a random sample, the sampling frame did not extend to hospitalized patients or residents of nursing homes, group homes, or prisoners where outbreaks have occurred [23] [24] and, hence, inferences are restricted to non-hospitalized individuals living in non-congregate settings. Owing to the cross-sectional nature of the investigation, the degree to which antibodies persist could not be evaluated. However a follow-up study is planned with participants who tested positive for antibodies in the first wave, which should inform questions about the persistence of antibodies following SARS-CoV-2 infection. In September 2020, approximately 14% of individuals living in the fourth largest city in the U.S. have been infected with SARS-CoV-2. The estimated seroprevalence is greater than that estimated based on RT-PCR data. Results highlight the importance of public health measures like physical distancing, wearing of face coverings, and hand hygiene to mitigate future transmission of SARS-CoV-2. Taken together, HHD's programs for both RT-PCR and antibody testing, will provide them with tools to continue to monitor and mitigate risks of the COVID-19 pandemic in Houston and inform plans for vaccine distribution. M a n u s c r i p t M a n u s c r i p t Authors have no conflict of interest. This work was funded by a grant from the Centers for Disease Control. M a n u s c r i p t A c c e p t e d M a n u s c r i p t A c c e p t e d M a n u s c r i p t Figure 3 Transmission, Diagnosis, and Treatment of Coronavirus Disease 2019 (COVID-19): A Review Interpreting Diagnostic Tests for SARS-CoV-2 What COVID-19 Seroprevalence Surveys Can Tell Us SARS-CoV-2 detection, viral load and infectivity over the course of an infection Antibody Responses to SARS-CoV-2 in Patients with Novel Coronavirus Disease Prevalence of SARS-CoV-2 in Spain (ENE-COVID): a nationwide, population-based seroepidemiological study Cumulative incidence and diagnosis of SARS-CoV-2 infection in New York Seroprevalence of Antibodies to SARS-CoV-2 in 10 Sites in the United States Estimated SARS-CoV-2 Seroprevalence in the US as of Seroprevalence of SARS-CoV-2-Specific Antibodies Among Adults Population Point Prevalence of SARS-CoV-2 Infection Based on a Statewide Random Sample -Indiana Estimated Community Seroprevalence of SARS-CoV-2 Antibodies -Two Georgia Counties Seroprevalence of SARS-CoV-2 and Infection Fatality Ratio, Orleans and Jefferson Parishes Seroprevalence of anti-SARS-CoV-2 IgG antibodies in Geneva, Switzerland (SEROCoV-POP): a population-based study Centers for Disease Conrol and Prevention (CDC). CASPER Sampling Methodology Platelia SARS-CoV-2 Total Ab Calibrating survey data using iterative proportional fitting (raking) Post-stratification or non-response adjustment? Accounting for Complex Sampling in Survey Estimation: A Review of Current Software Tools Accessed 11 Community Assessment for Public Health Emergency Response (CASPER): An Innovative Emergency Management Tool in the United States Characteristics of U.S. Nursing Homes with COVID-19 Cases COVID-19 in Prisons and Jails in the United States A c c e p t e d M a n u s c r i p t A c c e p t e d M a n u s c r i p t A c c e p t e d M a n u s c r i p t A c c e p t e d M a n u s c r i p t