key: cord-0322807-ocydd9fl authors: Lee, S.; Almendares, O.; Prince-Guerra, J. L.; Heilig, C. M.; Tate, J. E.; Kirking, H. L. title: Performance of Existing and Novel Symptom- and Antigen Testing-Based COVID-19 Case Definitions in a Community Setting date: 2022-05-10 journal: nan DOI: 10.1101/2022.05.10.22274914 sha: 41c84a37eadb50bf6988f69e9fac693fcf45ab6d doc_id: 322807 cord_uid: ocydd9fl Point-of-care antigen tests are an important tool for SARS-CoV-2 detection, but they are less clinically sensitive than real-time reverse-transcription PCR (RT-PCR), impacting their efficacy as screening procedures. Our goal in this study was to see whether we could improve this sensitivity by considering antigen test results in combination with other relevant information, namely exposure status and reported symptoms. In November of 2020, we collected 3,419 paired upper respiratory specimens tested by RT-PCR and the Abbott BinaxNOW antigen test at two community testing sites in Pima County, Arizona. We used symptom, exposure, and antigen testing data to evaluate the sensitivity and specificity of various symptom definitions in predicting RT-PCR positivity. Our analysis yielded 6 novel multi-symptom case definitions with and without antigen test results, the best of which overall achieved a Youden's J index of 0.66, as compared with 0.52 for antigen testing alone. Using a random forest as a guide, we show that this definition, along with our others, does not lose the ability to generalize well to new data despite achieving optimal performance in our sample. Our methodology is broadly applicable, and we have made our code publicly available to aid public health practitioners in developing or fine- tuning their own screening rules. Since the emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in December 2019, the virus has spread globally, causing widespread illness with varying degrees of severity. The emergence of several SARS-CoV-2 variants over time has led to periodic spikes in cases, with the most recent surge in cases related to the B.1.1.529 variant (Omicron) . Testing is an important part of the overall strategy to reduce and/or control SARS-CoV-2 spread, and antigen testing in particular has become very common, with tests widely available for both clinical and at-home use. Relative to nucleic acid amplification tests (NAATs) for SARS-CoV2, antigen tests have several benefits: operationally, they are fast, cheap, and relatively easy to administer; and diagnostically, they are very specific, with false positive rates often under 1%. However, the clinical sensitivity of these tests varies substantially, depending, among other things, on the patient's exposure status, severity of illness, duration of illness, and presence of symptoms (2-8). For the Abbott BinaxNOW COVID-19 Ag Card (BinaxNOW) antigen test, which has been widely adopted in the United States for both community screening and at-home testing, investigators have found sensitivities ranging from as low as approximately 50% (3, 5, (9) (10) (11) (12) to as high as 92% (11) , with many more in between (e.g. from 74%-77% (6, 7, 13, 14) ). These values are much lower than the test's reported sensitivity of 97%, and their variability makes it difficult for public health practitioners to estimate accurately the performance of their screening programs without relying on routine confirmatory testing with NAATs. When potential COVID-19 cases are asymptomatic, antigen testing is more feasible and perhaps the better screening option, particularly when epidemiologic risk factors for infection (such as travel history or close contact with a confirmed case) have not been identified. When potential cases are symptomatic, however, the screening scenario changes in two important ways: the sensitivity of the antigen testing tends to improve (Prince-Guerra 2021), and there is more clinical information available for predicting the underlying infection. The current Council of State and Territorial Epidemiologists (CSTE) COVID-19 surveillance case definition (15) follows this line of reasoning for identifying probable cases, requiring them to have either a positive antigen test or to have had close contact with a confirmed case and be symptomatic. To the best of our knowledge, this case definition was not directly derived from case-level clinical and laboratory data (e.g., using the kind of optimization procedure we adopt below), meaning it may not provide an optimal balance of sensitivity and specificity. What balance it does provide is fixed, since it is a single binary rule intended for surveillance, rather than a collection of rules with different operational characteristics intended for a variety of clinical use cases. The question remains, then, of how best to combine symptom information with antigen test results for screening. In this analysis, our goal was to show how this question might be answered by rigorously exploring the sensitivity-specificity tradeoff in these sorts of hybrid, testing/symptom-based screening case definitions. Building on previous work in improving screening and diagnosis procedures for tuberculosis disease (16) and COVID-19 (17), we show that a brute-force combinatorial search, along with some guidance from a robust machine learning model, can find definitions that improve the sensitivity of antigen testing relative to RT-PCR without sacrificing too much of its specificity. Although the exact definitions we propose may not be appropriate for all COVID-19 screening or diagnosis applications, the discovery procedure itself is adaptable to any dataset with symptom data and a reference standard (e.g., RT-PCR) and will thus be useful for both researchers and practitioners alike in developing definitions tailored to their own scientific and programmatic uses. for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. We collected paired upper respiratory swabs, demographics, symptom information and exposure history from participants aged 10 years and older at two community testing sites in Pima County, Arizona during November 3-17, 2020, as previously described (3, 10) . Here we use symptom data which captured presence of 15 symptoms in the 14 days prior to or on the day of testing. The list of symptoms includes all symptoms per CSTE COVID-19 2020 Interim Case Definition (15) . The protocol for this evaluation was reviewed by the Centers for Disease Control and Prevention (CDC) and determined to be non-research and was conducted consistent with applicable federal law and CDC policy as defined in 45 CRF 46.102(I)(2) (18). All paired swab specimens were tested by RT-PCR and the Abbott BinaxNOW COVID-19 Ag Card (BinaxNOW) antigen test. A healthcare professional collected a bilateral anterior nasal swab first for BinaxNOW testing, immediately followed by a bilateral nasopharyngeal swab for RT-PCR testing. Nasopharyngeal swabs were tested using either the CDC 2019-nCoV RT-PCR Diagnostic Panel (CDC RT-PCR assay) for detection of SARS-CoV-2 (n=2,582) (19) or the Fosun COVID-19 RT-PCR Detection Kit (Fosun RT-PCR assay) (n=837) (20). As described previously (3), validation by the commercial laboratory reported the limit of detection as equivalent at 500 copies/mL for both assays. In the analysis below, both tests were treated equally as reference standards for classification. Our main goal was to see whether we could improve the performance of antigen testing in predicting RT-PCR positivity by using a hybrid case definition that also considers reported symptoms. To that end, we explored two methods for arriving at such a case definition: heuristics based on combinations of reported symptoms and antigen test results, including an exhaustive combinatorial search of the various predictors; and heuristics based on a tree-based machine learning model trained on the data. We were also interested in determining which of the hybrid case definitions would be most likely to generalize out-of-sample data. Here, we used two heuristics to avoid this kind of overfitting: a preference for parsimony, such that case definitions considering fewer variables are preferable to ones considering more; and a preference for case definitions that do not grossly exceed the generalization performance of the same machine learning model as above. To estimate the maximum generalizable classification performance, we might expect from these hybrid case definitions, we turned to the random forest (21), a tree-based machine learning model theoretically guaranteed not to overfit its training data. We trained random forests on two feature sets: one using only the individual symptoms as predictors, and one using both the individual symptoms and antigen test results as predictors. For each set of predictors, we fit five large forests (each with 10,000 trees) to the entire dataset, beginning with trees of depth one for the first forest, and increasing the depth of the trees by one for each successive forest. This process yielded 10 trained forests, one for each unique combination of feature set (n=2) and depth (n=5). From each of these 10 forests, we retrieved the out-of-bag decision scores for all observations, yielding a forestderived distribution of predicted probabilities for RT-PCR positivity. We then compared the predicted probabilities with the actual RT-PCR test results to construct a standard receiver operating characteristic (ROC) curve, plotting sensitivity as a function of 1 -specificity. Because this curve contains the random forest's bestpossible scores for Youden's J index, we treated it as an approximate upper bound on the generalization performance we might expect from naive (i.e., non-learned) case definitions constructed from the same set of variables. In addition to estimating the upper bound on generalizable classification performance, we also used the random forest as a heuristic for selecting symptoms to include in a combined case definition with the antigen results. for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. After training, random forests rank their features based on the average amount of information they contributed to the constituent trees' predictions. Here, we used the feature importances from the symptoms-and-antigen forests to determine which symptoms worked best in combination with test results. We note here that the decision trees in a random forest may split on features "both ways", such that an important feature may be a lack of that feature and not a presence. We used a brute-force combinatorial approach to enumerate the full range of classification scores achievable with non-machine-learned case definitions. Here, we considered three sets of predictors: the symptoms alone, the symptoms in combination with antigen test results, and the symptoms plus known exposure to a confirmed case in combination with the antigen test results. For the symptoms alone, we enumerated all possible m-of-n combinations of symptoms, where n is the total number of symptoms in the combination, and m is the minimum number of symptoms required to be present for the patient to be considered a case. We used 5 as the maximum value for both m and n, mainly to limit the computational complexity of the enumeration, but also to improve the generalizability of the results. For the symptoms and antigen testing together, we combined this enumeration with the binary antigen test results, such that anyone with either a positive antigen test or m of the n symptoms in a given combination would be labeled a case. To serve as a point of comparison for the combinations, we evaluated the predictive performance of 3 previously established case definitions: the CSTE COVID-19 case definition (15) , and two compound symptom combinations found to work well in a published development study (17). We redefine these last two as follows: Reses 1: loss of taste or smell OR at least 1 of (shortness of breath, fever and/or chills, or myalgia) Reses 2: loss of taste or smell OR at least 3 of (shortness of breath, difficulty breathing, fever and/or chills, or myalgia) Both these definitions worked well in the development study; Reses 2 achieved the highest overall performance as measured by F1 score (80%). To our knowledge, this is the first time the definitions have been validated on external data, as well as having their performance examined in combination with antigen test results. For the purposes of screening and diagnosis in the absence of confirmatory testing, we evaluated prediction methods on several common measures of binary classification performance, including sensitivity (se), specificity (sp), Youden's J index, and F1 score. For surveillance purposes, we evaluated methods on a single metric, the difference between actual prevalence, defined as the number of RT-PCR positive patients in our dataset, and predicted prevalence, defined as the number of positive calls from the classification method. For inference, we constructed bias-corrected and accelerated bootstrap confidence intervals (22) for all metrics; these were not adjusted for multiplicity. We also used the Wilcoxon signed-rank test to differences in paired classification metrics, where appropriate. for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The random forest, combinatorial search, and statistical analysis were implemented in Python 3.8 with the scikit-learn (23) and statsmodels (24) packages. We ran all analyses on a scientific workstation with a 12-core CPU and 128 GB of RAM, and we used parallelization whenever possible to improve runtime. Our code is publicly available on GitHub at https://github.com/scotthlee/az-covid. We analyzed 3,419 paired specimens from 3,302 total participants aged 10 years and older with complete testing and survey data for which demographic and symptom data has been previously described in detail (3). Table 1 shows the performance of the single symptoms and antigen test results in predicting RT-PCR positivity. In terms of individual-level performance, congestion was the most sensitive (62%), followed by headache (59%), antigen testing (52%), cough (38%), and chills (37%). Antigen testing was the most specific (99%) and was closely followed by loss of taste or smell (97%), difficulty breathing (97%), rigors (shivers) (96%), and abdominal pain (96%). Overall, antigen testing was the best single predictor, with the highest Youden's J index (52%) and F1score (68%), even though its sensitivity was relatively low (53%). In terms of population-level performance, chills had the most accurate prevalence estimate (+4%), followed by diarrhea (-12%) and fever (-20%). Figure 1 shows the out-of-bag Receiver Operating Characteristic (ROC) curves for our five random forests, organized in ascending order by depth (n) and plotted against the full set of ROC points for the m-of-n combinations of equal size n. Depth did not substantially impact performance, with all forests achieving sensitivities of 69% and 74% at specificities of 95% and 90%, respectively. We can also see that the ROC curves generally contain the points for the combinations, suggesting that the performance of most combinations should generalize to new samples from the same population. Regardless of depth, the strongest predictors by feature importance were antigen test result, loss of taste/smell, chills, myalgia, and fever, many of which also appeared in the top-performing combinations. Figure 1 also shows the enumeration of symptom combinations in ROC space. Higher values of m make the combinations less sensitive but more specific, and a lower ratio of m to n makes them more sensitive and less specific; increasing n has no substantial impact on performance alone. We see this trend in values of Youden's J index, as well, which decreases as a function of m (average J was 0.50, 0.46, 0.38, 0.32, 0.28 for combinations with m of 1, 2, 3, 4, and 5, respectively) but not n (average J was 0.39 for all). By these measures, then, 1-of-n rules perform the best. Furthermore, because these rules are comparable to random forests of depth n, we expect those on or near the corresponding forest's ROC curve both to generalize to new data and to be close to optimal. The maximum achievable sensitivity for any combination of symptoms without antigen testing was 83% (specificity 61%); with antigen testing, this rose to 89% (specificity 61%). Because these are maxima at low values of m (1) and high values of n (15) , and because they are well beyond the corresponding RF ROC curves, we do not expect them to generalize well to other datasets; nonetheless, they serve as useful points of comparison in selecting candidate rules we do expect to generalize. For combinations near the ROC boundary, we propose three rules that seem to work particularly well in combination with antigen testing. In order of increasing specificity, these are as follows: one of fever, chills, fatigue, myalgia, or loss of taste or smell (82% sensitivity at 80% specificity); one of chills, difficulty breathing, or loss of taste or smell (75% at 91%); and two of rigors (shivers), headache, or loss of taste or smell (68% at 96%). for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. For ease of reference, we name these spec80, spec90, and spec95, respectively. Table 2 shows their classification performance both with and without antigen testing in comparison to loss of taste or smell alone and the existing case definitions (with the exception of the CSTE definition, none of these rules consider known exposure status). Of note, the CSTE case definition is the only one that does not perform optimally; all others, including Reses 1 and Reses 2, achieve near-maximal levels of sensitivity at their given levels of specificity. This result is easier to see in Figure 2 , which visualizes points for the case definitions against the full enumeration of m-of-n combinations in ROC space. While no definition is more specific than antigen testing alone, all are more sensitive, and some achieve a better balance of the two metrics, scoring up to 0.14 higher on J (0.52 for antigen alone versus 0.66 for spec90). For surveillance purposes, loss of taste or smell, when combined with antigen results, seems to be a good candidate, since it produces a nearly perfect prevalence estimate (-5%) in addition to achieving a relatively high score on J (0.64). This balance between group-and individual-level performance was rare in these data and suggests the rule could be used effectively for both screening and surveillance, especially considering its parsimony (only 2 features evaluated). Adding exposure status to the combinations generally did not improve their performance. When antigen test results were included alongside the symptoms, adding exposure status to a given m-of-n combination increased its J index only 43% of the time, lowering it the other 57% of the time (0.2% of the pairs had ties). Although the median J index was higher for rules considering exposure status (0.58) than for those not (0.57), the distribution of pairwise differences suggested the opposite (though very subtle) effect, with the median difference hovering around 0.01 in favor of the latter (Wilcoxon signed-rank test p-value < 0.001). Supplemental Figure 2 shows the distribution of these differences, and Supplemental Figure 3 shows how the distribution changes with respect to the specificity of the original combination (i.e., the combination without exposure status). Adding exposure status always made the original combination more specific, but when the original specificity was already high (more than approximately 0.80), the increase did not outweigh the resulting drop in sensitivity (Supplemental Figure 3 ). In our case, adding reported symptom data to antigen test results boosted the latter's sensitivity as a screening procedure without incurring a large drop in specificity. Given the basic characteristics of the data in our study, this is perhaps unsurprising: of the 142 study participants who were antigen negative but RT-PCR positive, 110 (78%) were symptomatic, and so the potential gain in sensitivity from considering symptoms was large from the start. Many of these missed cases are recouped by our proposed hybrid definitions (especially spec80), but the added sensitivity does come at the cost of reduced specificity, which under some practical circumstances may be non-negligible. This result highlights an important point for public health practice: case definitions should be tailored to their specific use cases, or, perhaps more precisely, to the real-world costs associated with falsepositive and false-negative classifications. From a methodological standpoint, our results suggest that using parsimony as a heuristic for selecting definitions may increase the chances of their performance generalizing to unseen data. Reses1 and Reses2 were both developed with a brute-force combinatorial search on different data (as rules with an m1-of-n1 OR m2-of-n2 structure, they are also more complex than the simple m-of-n definitions considered here) (17), and yet they performed well here, skirting the upper boundaries in ROC space for both the random forest and the full for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 10, 2022. ; https://doi.org/10.1101/2022.05.10.22274914 doi: medRxiv preprint enumeration of symptom combinations. Although this kind of combinatorial search is the most straightforward way of determining which of the candidate combinations work best, it is computationally intensive and may be impractical to run on consumer hardware for very large feature spaces. In these scenarios, a hybrid approach may be sensible, such that a random forest or other machine learning method with a built-in variableselection/ranking procedure (e.g., LASSO or L1-regularized regression) could be used to whittle down the feature space before performing the combinatorial search. Relating to SARS-CoV-2 specifically, our results suggest that antigen testing alone may not always be the best strategy for screening or diagnosis in the absence of confirmatory testing (or even in the presence of confirmatory testing, which may vary depending on likelihood of SARS-CoV-2 infection and symptom status), which is in line with current CDC guidance on the use of antigen tests in the community (25). However, given the increase in use of antigen tests and the high demand for rapid test results (26), our analysis here shows the potential benefit for use of antigen tests in the context of specific symptoms to improve sensitivity, albeit at some loss of specificity. Screening scenarios where frequent, repeated antigen testing would be impractical-at border entry points where travelers or migrants may only be tested once before moving on, for example, or in private homes where the cost of repeat testing may be prohibitive-may be good settings for the use of hybrid screening rules that improve on the sensitivity of a single test. Conversely, when confirmatory tests are relatively scarce but required for care, or when the local healthcare system is nearing capacity, a more specific rule, like our spec90 candidate rule or antigen testing alone, may be more appropriate. Finally, case definitions considering exposure status may be less optimal than those that do not, at least in scenarios similar to those in which our data were collected. In our results, these rules were often less sensitive than alternatives achieving the same specificity and less specific than those achieving the same sensitivity. This effect may have been due to the presence of other high-performing predictors in the data, like the antigen test results and loss of taste and/or smell, that made the added specificity from considering exposure less useful than it would have been otherwise. Our analysis has several limitations. First, the CSTE COVID-19 case definition (27) has been updated since the time of this investigation to add additional symptoms for consideration which were not included in our investigation and cannot be evaluated, although many symptoms added to the most recent definition are more severe in nature (confusion or change in mental status; persistent chest pain/pressure; blue-colored skin, lips, or nail beds, depending on skin tone; and inability to wake or stay awake), and it is unlikely they would have been captured in an outpatient community setting. Second, although SARS-CoV-2 samples were not sequenced as part of this investigation, data collection occurred in November 2020, well before the two variants of concern, Delta (B.1.617.2) and Omicron (B.1.1.529), were documented in the United States. The candidate definitions we propose above, which rely heavily on loss of taste or smell to boost the sensitivity of antigen testing, may not generalize well to populations heavily affected by these variants if the symptoms they cause differ substantially from those cause by the original strain. Additionally, it is unclear whether these results may apply to reinfections or vaccine-breakthrough infections as this study was conducted prior to vaccine availability and widespread concern for re-infection. Because our analysis could easily be rerun with the availability of new data, we hope these limitations will be addressed by future research. Third, the data presented here are crosssectional and pre-symptomatic individuals could not be identified. Fourth, as mitigation measures have relaxed, such as limits on large gatherings, mask use, etc., the circulation of other non-SARS-CoV-2 respiratory viruses will likely increase and the impact on symptom analysis for SARS-CoV-2 is not fully known. Finally, the FDA recently published a statement (28) suggesting that antigen tests may have reduced sensitivity against the Omicron variant, although there is also some recent data showing antigen test sensitivity remains similar to pre-Omicron levels in a community setting (29). for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 10, 2022. ; https://doi.org/10.1101/2022.05.10.22274914 doi: medRxiv preprint While confirmatory testing for symptomatic persons with a negative antigen test is recommended (25), this often presents challenges including logistical and cost constraints and longer turnaround times for RT-PCR, thus delaying care and/or isolation of individuals (26). In certain situations, confirmatory testing may not be available at all -especially in low resource settings. Therefore, using the methodology described in this study to better understand combinations of symptoms that are more predictive of SARS-CoV-2 infection in the absence of a positive antigen test may aid clinicians and public health professionals. In situations where confirmatory testing is not available, and immediate public health action is needed, considering the presenting symptoms in antigen negative individuals may provide important information on how to prioritize public health action, especially in congregate living or school settings. for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 1 Sensitivity, the probability of testing positive when disease is present. 2 Specificity, the probability of testing negative when disease is absent. 3 Positive predictive value, the probability of a patient having disease when test is positive. 4 Negative predictive value, the probability of a patient not having disease when test is negative. 5 Youden's J index is defined as sensitivity plus specificity minus 1 (perfect score = 1) 6 F1 score is the harmonic mean of sensitivity and PPV (perfect score = 1) 7 Difference in prevalence is the difference in the number of positive symptom screens (i.e. true positive plus false positive) for each combination from the actual number of RT-PCR positive specimens. 8 Includes measured or subjective fever 9 Included nasal congestion or runny nose. 10 Cough, shortness of breath includes only if symptom was new or worsening in the 14 days prior to testing. for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The colors correspond to the three kinds of predictors that could be included in a combination: symptoms alone (blue), symptoms plus antigen test results (orange), and symptoms plus antigen test results, but with the added requirement that symptoms are only considered with known exposure to a confirmed case (green). for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted May 10, 2022. ; https://doi.org/10.1101/2022.05.10.22274914 doi: medRxiv preprint Field performance and public health response using the BinaxNOW TM Rapid SARS-CoV-2 antigen detection assay during community-based testing Performance Characteristics of the Abbott BinaxNOW SARS-CoV-2 Antigen Test in Comparison to Real-Time Reverse Transcriptase PCR and Viral Culture in Community Testing Sites during Performance Evaluation of Serial SARS-CoV-2 Rapid Antigen Testing During a Nursing Home Outbreak Performance of the BinaxNOW coronavirus disease Antigen Card test relative to the severe acute respiratory coronavirus virus 2 (SARS-CoV-2) real-time reverse transcriptase polymerase chain reaction (rRT-PCR) assay among symptomatic and asymptomatic healthcare employees Performance and Implementation Evaluation of the Abbott BinaxNOW Rapid Antigen Test in a High-Throughput Drive-Through Community Testing Site in Massachusetts Performance of Repeat BinaxNOW SARS-CoV-2 Antigen Testing in a Community Setting Evaluation of the Abbott BinaxNOW rapid antigen test for SARS-CoV-2 infection in children: Implications for screening in a school setting A Real-World Comparison of SARS-CoV-2 Rapid Antigen Testing versus PCR Testing in Florida Evaluation of Abbott BinaxNOW Rapid Antigen Test for SARS-CoV-2 Infection at Two Community-Based Testing Sites -Pima County Performance Characteristics of BinaxNOW COVID-19 Antigen Card for Screening Asymptomatic Individuals in a University Setting Effectiveness of Abbott BinaxNOW Rapid Antigen Test for Detection of SARS-CoV-2 Infections in Outbreak among Horse Racetrack Workers Epidemiologic characteristics associated with SARS-CoV-2 antigen-based test results, rRT-PCR cycle threshold values, subgenomic RNA, and viral culture results from university testing Multidisciplinary assessment of the Abbott BinaxNOW SARS-CoV-2 point-of-care antigen test in the context of emerging viral variants and self-administration Council for State and Territorial Epidemiologists (CSTE) 102(l)(2), 21 C.F.R. part 56; 42 U.S.C. Sect. 241(d); 5 U.S.C. Sect. 552a; 44 U.S.C. Sect. 3501 et seq Better Bootstrap Confidence Intervals Scikit-learn: Machine learning in Python Scaling up COVID-19 rapid antigen tests: promises and challenges Council for State and Territorial Epidemiologists (CSTE) Direct Comparison of SARS-CoV-2 Nasal RT-PCR and Rapid Antigen Test (BinaxNOWâ„¢) at a Community Testing Site During an Omicron Surge Difference in the number of positive symptom screens (i.e. true positive plus false positive) for each combination from the actual number of RT-PCR positive specimens Spec80 includes one of fever, chills, fatigue, myalgia, or loss of taste or smell Reses 1: Loss of taste or smell or at least one of shortness of breath, fever and/or chills, or myalgia Reses 2: Loss of taste or smell OR at least three of shortness of breath, difficulty breathing, fever and/or chills, or myalgia. 15 CSTE: U.S. Council of State and Territorial Epidemiologists (CSTE) interim and revised clinical criteria for COVID-19 reporting Includes antigen test results for each combined multi-symptom case definition