key: cord-0702845-trdcg6zw authors: Graber, T. E.; Mercier, E.; D'Aoust, P. M.; Hoang, H.-D.; Tian, X.; Tasneem, S.; Bhatnagar, K.; Delatolla, R. title: An allele-specific primer extension assay to quantify the proportion of B.1.1.7-specific SARS-CoV-2 RNA in wastewater date: 2021-02-23 journal: nan DOI: 10.1101/2021.02.22.21252041 sha: 4bd7add648e3850d56e82b96561804b72bc688d1 doc_id: 702845 cord_uid: trdcg6zw The coronavirus disease 2019 (COVID-19) pandemic caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has claimed millions of lives globally to date. Rapid accumulation of co-occurring mutations has led to the emergence of viral variants which appear to be more transmissible, virulent, or both. Variants of concern (VOC) now include those belonging to the B.1.1.7, B.1.351, and P.1 lineages. Early detection of VOC and the ability to retrospectively follow their respective, longitudinal prevalence in communities is wanting. Wastewater-based epidemiology (WBE) allows tracking of disease prevalence in the general population using RT-qPCR to detect viral fragments, but ongoing longitudinal studies have yet to differentiate between these variants. Here, we describe and validate a primer extension strategy to amplify and distinguish B.1.1.7-specific, from non-B.1.1.7 alleles by combining new forward, and existing CDC 2019-nCoV_N1 qPCR probes and reverse primers. This assay can be quickly implemented within a current SARS- CoV-2 WBE framework with minimal cost with the goal of providing early detection of increasing B.1.1.7 transmission in a community prior to identification through clinical testing and confirmation via secondary screening strategies. As such, this assay can provide public health units with an additional and much needed metric to be able to rapidly triangulate B.1.1.7 prevalence. Since the beginning of the COVID-19 pandemic, the possibility of viral mutations within the SARS-CoV-2 genome was outlined as a potential risk 1 . SARS-CoV-2, being a single-stranded RNA (ssRNA) virus, is susceptible to frequent mutations [2] [3] [4] [5] . Generally, the majority of viral mutations have a low impact on disease in populations 6 , but in late 2020, several concurrent mutations emerged which increase infectivity 7, 8 and potentially decrease sensitivity to neutralizing antibodies 9, 10 . As of February 2021, these variants of concern (VOC) include: 1) the B.1.1.7/501Y.V1 variant (also known as the UK variant); 2) the B.1.351/501Y.V2 variant (also known as the South African variant) and 3) the P.1/501Y.V3 variant (also known as the Brazilian variant). With communities around the world already reeling from the effects of the initial waves of the pandemic, the onset and widespread resurgence of newer variants, which may be more infectious and might exhibit a higher case fatality rate than ancestral variants, are exhausting public health resources and effecting the mental health of communities. SARS-CoV-2 WBE is being adopted as a means of surveillance in many jurisdictions around the world (reviewed in 11 ). The common method currently used for SARS-CoV-2 WBE is RT-qPCR based on probe and primer sets used for clinical testing 12 . These commonly used probe and primer sets to date do not distinguish VOCs, including B.1.1.7. The high specificity and sensitivity of the CDC "2019-nCoV N1" (N1) and "2019-nCoV N2" (N2) probe/primer sets seen in clinical samples 12 , appears to be mirrored in wastewater matrices 11 although relative sensitivities haven't been formally investigated. N1 and N2 assays are thus used by many WBE research groups including those participating in the wastewater-based surveillance initiative in Ontario, Canada. Here, we describe and validate an RT-qPCR assay that targets a B.1.1.7specific allele that lies adjacent to the N1 region. Diagnosing a B.1.1.7 VOC case requires an assay with a very high positive predictive value. This is difficult to achieve in a singleplex qPCR assay, as targeting multiple, co-occurring mutations at different loci may be needed to unambiguously define a VOC. Thus, multiplexed assays or secondary screens incorporating Sanger sequencing and/or whole-genome sequencing are required to confirm the genetic lineage of a virus isolated from clinical specimens. Unlike individual testing, wastewater contains mixtures of genomic fragments sourced from multiple infections that can be probed across the course of disease in a given population of individuals. In essence, wastewater surveillance of SARS-CoV-2 allows unbiased sampling of population-level allele frequencies in a single test tube. Thus, additional granularity that is represented in wastewater samples includes genomes from multiple individuals collected over continuous periods of time. This property provides , and D3L or A28271del/D3L which target B.1.1.7-specific alleles. TRS-B=Transcriptional regulatory site-body. Radial plots were generated on nextstrain.org 13 with sequencing data from GISAID 14 . Allele frequency table generated at covidcg.org using GISAID sequencing data. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted February 23, 2021. ; https://doi.org/10.1101/2021.02.22.21252041 doi: medRxiv preprint substantial power to probe and follow the dynamics of allele frequencies in a population as infections wax and wane. Additionally, following a single, signature mutation in this context gives sufficient power to distinguish VOC lineages, as other variants that are less transmissible, and which have a very low prevalence in communities, might carry a mutation at the same locus but whose frequency and prevalence does not change substantially over time ( Figure 1A) . We found that one such signature mutation on the N gene at the amino acid level (D3L) is present in the B.1.1.7 lineage ( Figure 1B) . Moreover, a single nucleotide variant (SNV) in the wobble position of N:D3, T28282A appears to be unique to the B.1.1.7 lineage ( Figure 1C ). Serendipitously, D3L lies only 5 bp upstream from the start of the CDC 2019-nCoV N1 amplicon region, which we currently target to detect SARS-CoV-2 in wastewater 15, 16 . Taking advantage of this, we designed forward primers with mutations at the 3' end complementary to the B.1.1.7 signature allele at this locus. This allele-specific primer extension strategy works by virtue of the inability of DNA polymerase to extend mismatched nucleotides at the 3' end of the primer 17 . This method affords high allele specificity, while at the same time obviating the complexities associated with the development of competition probe assays. We then combined this forward primer with the existing N1 probe and reverse primer. We call these two new assays "D3" and "D3L" and it distinguishes non-B.1.1.7 from B.1.1.7 lineages when applied in the wastewater context. Note that this assay may not have a high positive predictive value in clinical samples for viruses of the B.1.1.7 lineage as a rare mutation may occur at this location in individual cases. Part way into designing and testing this new assay, we noted the emergence of a SNV in B.1.1.7 sequences deposited at GISAID that co-occurs with the D3L mutation and is located in the middle of the D3L forward primer. This deletion mutant (A28271del) now appears to be the dominant SNV in the B.1.1.7 lineage, cooccurring with D3L at similar frequencies in the United Kingdom ( Figure 1D) . Therefore, we designed an additional forward primer (A28271del/D3L) incorporating this SNV. The primer locations are shown in Figure 1E . We in vitro transcribed (IVT) RNA template to determine the allelic specificity of the D3 and D3L assays. We synthesized the dsDNA with sequence complementary to either the D3 or D3L amplicons. Note that this sequence also contains the nested N1 amplicon. An additional ~250nt of downstream sequence was incorporated to include amplicons corresponding to other PCR standards used in the laboratory. A T7 promoter sequence was incorporated at the 5' end of the sequence allowing in vitro transcription of a ~350 bp RNA (Figure 2A) . We performed D3, D3L, and A28271del/D3L assays on either the D3 or D3L RNA templates, and, in parallel performed the N1 assay whose expression allowed normalization of showing expression on commercially available RNA templates (standards). Note that the EDX(COVID19) standard, unlike the other two shown, does not contain the intergenic sequence that is complementary to D3 forward primers and thus no amplification is observed. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted February 23, 2021. ; https://doi.org/10.1101/2021.02.22.21252041 doi: medRxiv preprint copies of each RNA standard. Critically, the D3 assay showed positivity on the D3 RNA template, albeit at a lower sensitivity than N1; while both D3L and 28271del/D3L assays amplified at least 10 Ct's higher (Figure 2A) . Similarly, both the D3L and A28271del/D3L assays showed positivity on the D3L template, while the D3 assay amplified more than 15 cycles later on the same template. This demonstrates high template specificity allowing quantification of the relative proportion of alleles in the sample with high confidence. Additionally, we found that the D3L assay failed to amplify on 3 commercially available synthetic RNA templates (Figure 2B) , while the "wildtype" D3 allele was detected at similar levels as the N1 target. Notably, D3 failed to detect on the EDX standard, which lacks the Orf8-Orf9 intergenic sequence that is complementary to the D3 forward primer. These data indicate high allele specificity for distinguishing D3 from D3L. We are working to also determine the specificity of D3L vs. A28271del/D3L assays, although we expect this will be less specific, with less Ct differential as the deletion occurs in the middle of the forward primer. We wished to confirm the specificity and sensitivity of D3 and D3L assays using fecal RNA donated by a COVID-19 patient. The patient in question was admitted to the ICU in 2020, prior to the emergence of B.1.1.7 viruses in Canada. Confirming the specificity of the assays, we did not see amplification with the D3L assay in this sample, while it was seen with the D3 assay. We performed a comparison between N1 and D3 assays at different annealing temperatures to optimize sensitivity (Figure 3) . At 55ºC, the D3 assay appeared to be 2-3X less sensitive than the N1 assay. Surprisingly, increasing annealing temperature to 60ºC enhanced sensitivity of the D3 assay, bringing it closer to that of N1. We performed N1 as well as the allele-specific assays on RNA extracted from 24h composites of primary sludge obtained from a wastewater resource recovery facility (WRRF) serving a large Canadian urban centre (Ottawa, Ontario; pop. ~1M) between Fall 2020 and January 16, 2021 (n=4). We performed that assay at 2 distinct annealing temperatures, however both D3L and the A28271del/D3L assays failed to detect B.1.1.7-specific RNA fragments ( Figure 4A) . Fewer than 5 clinically verified and travel-related B.1.1.7 cases were active in this community at the time of the last sampling of January 16, 2021. More surprisingly, we found a consistent decrease in sensitivity with increasing annealing temperature -the opposite of what was observed when RNA isolated from the faeces of one patient (Figure 3) . Although the explanation is unclear, a possible reason may be reduced template specificity in a wastewater matrix at lower annealing temperatures. We next obtained 24h composite influent and grab primary sludge samples from a WRRF serving the mid-size (pop. ~150K) community of Barrie, Ontario. We readily detected N1 amplicons in both samples, indicating that SARS-CoV-2 was prevalent in the community. Two weeks prior to sampling, an outbreak of B.1.1.7 infection was confirmed at a long-term care facility (LTCF) within the WRRF catchment. More than 100 cases were later confirmed by whole . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted February 23, 2021. ; https://doi.org/10.1101/2021.02.22.21252041 doi: medRxiv preprint genome sequencing of clinical samples at the provincial laboratories. Importantly, while the D3L and A28271del/D3L assays yielded non-detects in samples from Ottawa, with either no or very few known cases of B.1.1.7 (Figure 4A, B) ; Barrie, with a known active B.1.1.7 outbreak, yielded a readily detectable signal with all assays and in both sample types ( Figure 4B) . Surprisingly, D3, targeting non-B.1.1.7 was at very low levels compared to both D3L and A28271D3L alleles, suggesting that B.1.1.7 was the dominant viral lineage in the community at the time of sampling. Furthermore, we saw >2-fold greater signal with A28271D3L than D3L suggesting that this is the major allele in the faecally-shed viral RNA population. It's possible that unequal primer efficiencies may lead to erroneous conclusions regarding the relative proportion of each allele. We are thus currently defining the primer efficiencies for these assays. Together, these data support the use of allelespecific qPCR strategies to follow the emergence and proportional changes of VOCs in populations served by WRRFs. Allele-specific RT-qPCR assays provide sufficiently high sensitivity and specificity in the wastewater context and can be rapidly designed and implemented within existing WBE surveillance architectures. 24-hour composite samples of primary clarified sludge (PCS) were collected from the Ottawa, Ontario WRRF with no known community transmission of B.1.1.7. 24-hour composite samples of post-grit influent and grab samples of PCS were collected from the Barrie, Ontario WRRF during an isolated B.1.1.7 outbreak. Both sampled WRRFs are designed and operated as conventional activated sludge treatment systems. Influent samples were first decanted at 4°C for one hour and for each sample, 40 mL of the settled solids were pipetted to a 50 mL centrifuge tube. Similarly, for the PCS samples, 40 mL of well-mixed PCS was also pipetted to a 50 mL centrifuge tube. Samples were then centrifuged at 10,000 x g for 45 minutes at 4°C. The supernatant was decanted, taking care not to disturb the resulting pellet. Samples were then centrifuged a second time at 10,000 x g for another 5 minutes and the remaining supernatant decanted. The resulting influent solids and PGS pellets were transferred to a new, pre-weighed . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) RNase-free centrifuge tube before being weighed and immediately extracted. Viral RNA was then extracted from the resulting pellets of influent and PCS solids using the RNeasy PowerMicrobiome Kit (Qiagen, Germantown, MD). The following changes to the manufacturer's protocol were performed: 1) 250 ± 15 mg of sample pellet was added to the initial extraction step in place of 200 µL of liquid sample, and 2) the optional phenol-chloroform step was instead performed with Trizol LS reagent (ThermoFisher, Ottawa, Canada) to maximize RNA protection and lysis prior to vortexing and centrifugation. The resulting aqueous phase was retained and processed as per the original manufacturer's protocol, including the DNase treatment step. RNA was eluted in 100 µl of RNase-free water. One-step RT-qPCR was performed with TaqMan ® Fast Virus 1-Step Master Mix (ThermoFisher, Ottawa, Canada). N1 assay measurements, detecting SARS-CoV-2 RNA including B.1.1.7 lineages, were performed using the premixed 2019-nCoV N1 Assay-RUO probe/primers set (500/125nM, respectively) (IDT, Kanata, Canada). Allele-specific RT-qPCR was performed in parallel using either D3 (detecting non-B.1.1.7 alleles), D3L, or A28272del/D3L (both detecting B.1.1.7 alleles only) combined with the N1 probe and reverse primers at 500, 125 and 500nM, respectively. Sequences of the probes/primers are list in Table 1 . RT-qPCR reactions were run in triplicate using 1.5 µl of RNA input in a final reaction volume of 10 μl. Cycling was performed on a CFX Connect qPCR thermocycler (Bio-Rad, Hercules, CA) as follows: RT at 50ºC, 5 minutes, followed by RT inactivation, polymerase activation and template denaturation at 95ºC for 20 seconds, and 45 cycles of denaturation (95ºC/3s), then annealing/extension (55ºC/45s). No template controls showed amplification. Reactions were considered positive when Ct<40. The authors declare that no known competing financial interests or personal relationships could appear to influence the work reported in this manuscript. Emerging SARS-CoV-2 mutation hot spots include a novel RNAdependent-RNA polymerase variant Why are RNA virus mutation rates so damn high? Viral evolution and the emergence of SARS coronavirus Biased Mutation and Selection in RNA Viruses Viral Mutation Rates We shouldn't worry when a virus mutates during disease outbreaks Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus Evolution of antibody immunity to SARS-CoV-2 mRNA vaccine-elicited antibodies to SARS-CoV-2 and circulating variants Wastewater surveillance for SARS-CoV-2: Lessons learnt from recent studies to define future applications Analytical sensitivity and efficiency comparisons of SARS-CoV-2 RT-qPCR primer-probe sets NextStrain: Real-time tracking of pathogen evolution Data, disease and diplomacy: GISAID's innovative contribution to global health Quantitative analysis of SARS-CoV-2 RNA from wastewater solids in communities with low COVID-19 incidence and prevalence Catching a resurgence: Increase in SARS-CoV-2 viral RNA identified in wastewater 48 h before COVID-19 clinical tests and 96 h before hospitalizations Single-nucleotide polymorphisms and other mismatches reduce performance of quantitative PCR assays The authors wish to acknowledge the leadership of Dr. Alex MacKenzie (CHEO-RI) in this continuing endeavour as well as the participation of the City of Ottawa, Ottawa Public Health, the City of Barrie, the Simcoe Muskoka District Public Health Unit, Public Health Ontario and the Ministry of the Environment, Conservation and Parks, and their respective employees involved in the project during this study. Their time, facilities, resources, and assistance provided throughout the study greatly contributed to this work. We gratefully acknowledge the originating and submitting laboratories responsible for sharing genetic sequence data via the GISAID Initiative, which was used in this project.