key: cord-0769957-vp16ns3d authors: Lim, Ho Jae; Park, Min Young; Jung, Hye Soo; Kwon, Youngjin; Kim, Inhee; Kim, Dong Kwan; Yu, Nae; Sung, Nackmoon; Lee, Sun-Hwa; Park, Jung Eun; Yang, Yong-Jin title: Development of an efficient Sanger sequencing-based assay for detecting SARS-CoV-2 spike mutations date: 2021-12-14 journal: PLoS One DOI: 10.1371/journal.pone.0260850 sha: dcb90b9882ec58c0ffae33d9a0cf101d565a4a49 doc_id: 769957 cord_uid: vp16ns3d Novel strains of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) harboring nucleotide changes (mutations) in the spike gene have emerged and are spreading rapidly. These mutations are associated with SARS-CoV-2 transmissibility, virulence, or resistance to some neutralizing antibodies. Thus, the accurate detection of spike mutants is crucial for controlling SARS-CoV-2 transmission and identifying neutralizing antibody-resistance caused by amino acid changes in the receptor-binding domain. Here, we developed five SARS-CoV-2 spike gene primer pairs (5-SSG primer assay; 69S, 144S, 417S, 484S, and 570S) and verified their ability to detect nine key spike mutations (ΔH69/V70, T95I, G142D, ΔY144, K417T/N, L452R, E484K/Q, N501Y, and H655Y) using a Sanger sequencing-based assay. The 5-SSG primer assay showed 100% specificity and a conservative limit of detection with a median tissue culture infective dose (TCID(50)) values of 1.4 × 10(2) TCID(50)/mL. The accuracy of the 5-SSG primer assay was confirmed by next generation sequencing. The results of these two approaches showed 100% consistency. Taken together, the ability of the 5-SSG primer assay to accurately detect key SARS-CoV-2 spike mutants is reliable. Thus, it is a useful tool for detecting SARS-CoV-2 spike gene mutants in a clinical setting, thereby helping to improve the management of patients with COVID-19. The World Health Organization (WHO) declared the coronavirus disease (COVID- 19) , which is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), as a global pandemic on March 11, 2020 [1] . SARS-CoV-2 is a highly transmissible virus and has a long incubation time before the manifestation of symptoms, such as fever, cough, shortness of breath, and diarrhea [2] . SARS-CoV-2 has a single-stranded, positive-sense RNA genome of a1111111111 a1111111111 a1111111111 a1111111111 a1111111111 approximately 29.9 kb, which encodes several proteins, including the structural proteins, and spike (S) [3, 4] . The S gene encodes the S1 and S2 subunits, and the S1 subunit contains an N-terminal domain and a receptor-binding domain, the latter of which is associated with human infections [5] . Across its genome, the virus accumulates mutations that are associated with its transmissibility, virulence, or resistance to some neutralizing antibodies [6, 7] . SARS-CoV-2 variants have been recently identified, raising concerns abouta subsequent wave of the pandemic. Since the emergence of multiple variants, the WHO and the Centers for Disease Control and Prevention (CDC) have set up a classification scheme for monitoring the potential impact of emerging variants. The variants are classified into variants of interest (VOIs), variants of concern (VOCs), and variants of high consequence (VOHCs) [8, 9] [10] , have been associated with changes in receptor binding, reduced neutralization by antibodies generated against previous infection or vaccination, reduced efficacy of treatments, potential diagnostic effects, or a predicted increase in transmissibility or disease severity. VOCs, including alpha (B.1.1.7), beta (B.1.351), delta (B.1.617.2), and gamma (P.1) [10] , have been associated with an increase in transmissibility and disease severity, a significant reduction in neutralization by antibodies generated during previous infection or vaccination, reduced effectiveness of treatments or vaccines, or diagnostic detection failures. The D614G substitution is the most prevalent mutation observed [11, 12] . However, VOHC-related lineages have not yet been classified [10] . These lineages have been analyzed using next generation sequencing (NGS) methods and classified using Phylogenetic Assignment of Named Global Outbreak (PANGO) lineages [13] . The use of NGS has been very useful for obtaining accurate information on genetic variability and transmission [14] . However, as outbreaks occur sporadically and cannot be predicted, it is not always possible to have all resources required to perform the tests necessary to detect SARS-CoV-2 variants, especially in resource-limited settings [15] . To overcome this limitation, both PCR and Sanger sequencing have been applied [16, 17] . Here, we aimed to accurately and rapidly detect nine key S mutations (ΔH69/V70, T95I, G142D, ΔY144, K417T/N, L452R, E484K/Q, N501Y, and H655Y) in strains classified as VOCs and/or VOIs using our laboratory-developed five SARS-CoV-2 S gene (5-SSG) primers via PCR assay in conjunction with Sanger sequencing. In addition, we compared the results of our assay with those of a commercially available NGS assay to evaluate its accuracy and reliability in detecting and identifying variants. The 5-SSG primer assay was designed based on the Wuhan-CoV reference sequence (Wuhan-Hu-1, NCBI accession number NC_045512.2) [18, 19] . Primers were modified from the Global Initiative on Sharing Avian Influenza Data (GISAID) database, with a frequency cut-off > 1%, applied with degenerative or inosine to optimize the melting temperature (Tm), avoid repetitive sequences, and include GC content > 65%, using Gene Runner (ver. 6.0) [20, 21] . NCBI-Basic Local Alignment Search Tool (NCBI-BLAST) was used to optimize the specificity for SARS-CoV-2 [22] . Sequences were also screened based on alignments using the GISAID database for species selectivity. After these assessments, five targets were selected for validation using the S mutant assay, and M13 universal sequence primers were tagged for Sanger sequencing. The sequences of the 5-SSG primers (69S forward primer, TGTAAAACGACGGCC AGTATTACCCTGACAAAGTTTTCAGATC; 69S reverse primer, CAGGAAACAGCTATGACGCG TTATTAACAATAAGTAGGGAC; 144S forward primer, TGTAAAACGACGGCCAGTCCACTGAG AAGTYTAACATAAT AAGAG; 144S reverse primer, CAGGAAACAGCTATGACTCACCAGGAG TCAAATA ACTTCTAT; 417S forward primer, TGTAAAACGACGGCCAGTGCTTTAGAACCA TT GGTAGATTTG; 417S reverse primer, CAGGAAACAGCTATGACGTTTGAGATTAG ACTT CCTAAACAATC; 484S forward primer, TGTAAAACGACGGCCAGTTCTAAYA AICTTGATT CTAAGGTTG; 484S reverse primer, CAGGAAACAGCTATGACCKCCT GTGCCTGTTAAACC ATT; 570S forward primer, TGTAAAACGACGGCCAGTGAAC TTCTACATGCACCAGCAAC; 570S reverse primer, CAGGAAACAGCTATGACCTG CATTCAGTTGAATCACCAC) are presented in Table 1 . The 5-SSG primer-specific target mutants and lineages are summarized in S1 Table. To determine analytical specificity, 67 strains of viruses, bacteria, and fungi were used with or without respiratory pathogens, including 42 strains of virus (24 strains of SARS-CoV-2, Coronavirus OC43 and 229E, and 18 other viruses), 19 strains of bacteria, and six strains of fungi (Tables 2 and S2 Table 2 . As part of the routine procedure using the Allplex™ SARS-CoV-2 assay for SARS-CoV-2 testing (Seegene Inc., Seoul, Republic of Korea), anonymized residual of 17 SARS-CoV-2 positive nasopharyngeal swab specimens of patients diagnosed with SARS-CoV-2 positive between February and June 2021 were obtained and used for this study. All samples were processed using an automated nucleic acid extraction system, namely MagNA Pure 96 (Roche, Basel, Switzerland), according to the manufacturer's protocol, and stored at −80˚C until use [23] . The comparative limit of detection (LOD) of the 5-SSG primer assay was determined using heat-inactivated cultural fluids of SARS-CoV-2 (Zeptometrix-0810589CFHI) as a positive control, following the manufacturer's instructions. Each control was 10-fold serially diluted to approximately 1.4 × 10 3 , 1.4 × 10 2 , 1.4 × 10 1 , 1.4 × 10 0 , 1.4 × 10 −1 , and 1.4 × 10 −2 TCID 50 (median tissue culture infectious dose)/mL. For the performance analysis of 5-SSG primers, 25 replicates were performed. The comparative LOD was determined as the minimum detectable concentration. Probit regression was used to estimate positive values with 95% confidence intervals [24] . The template (2.5 ng) from viral, bacterial, and fungal strains was added for one-step RT-PCR (Nanohelix Co., Daejeon, Republic of Korea) analysis, which was performed using a SeeAmp (Seegene) instrument. PCR assays with the 5-SSG primers were performed using the following thermal cycling conditions: 45˚C for 15 min (reverse transcription), followed by 94˚C for 15 min (initial denaturation), and 45 cycles of 94˚C for 10 s (denaturation), 64˚C for 30 s (annealing), and 72˚C for 30 s (extension). A final extension step was conducted at 72˚C for 5 min. Next, the PCR products were analyzed using 2% agarose gel electrophoresis with 0.5× TBE buffer, and the gels were stained with ethidium bromide (Biosesang, Seongnam, Republic of Korea). PCR amplicons from the 67 samples were analyzed using agarose gel electrophoresis in a horizontal unit (CBS Scientific, San Diego, CA, USA) operating at 280 V for 28 min, and the band sizes on ethidium bromide-stained gels were quantified using a Gel-Doc XR+ system (Bio-Rad Laboratories, Hercules, CA, USA). All PCR-positive products were purified with MEGAquick-spin™ plus (iNtRON Biotechnology, Seongnam, Republic of Korea), according to the manufacturer's instructions [25] . The sequence analysis of PCR products (partial S gene amplified to~800 bp) was performed using the 5-SSG primers (5 0 tagged M13 primer) and the BigDye Terminator v3.1 cycle sequencing kit reagent (Applied Biosystems, Foster City, CA, USA). The sequence analysis conditions were as follows: 96˚C for 1 min (incubation), followed by 25 cycles of 96˚C for 10 s (denaturation), 50˚C for 5 s (annealing), and 60˚C for 4 min (extension). Dye-labeled products were analyzed using an ABI 3730 sequencer (Applied Biosystems). Sequencing chromatograms were analyzed manually using Variant Reporter™ v3.0 software (Applied Biosystems). Samples were classified as mutants if the sequencing results from the specific regions matched those of lineage information [26] . NGS was performed using the SARS-CoV-2 FLEX Panels (Paragon Genomics, Hayward, CA, USA) and an Illumina MiSeq platform (Illumina, San Diego, CA, USA) in accordance with the manufacturer's instructions [27] . Reverse transcription was performed using 55 ng of nucleic acid, and multiplex PCR was performed using 343 pairs of primers. A second PCR was conducted using CleanPlex Dual-Indexed PCR Primers for Illumina1 Set A (Paragon Genomics). The final library was sequenced on an Illumina MiSeq platform (Illumina) with 2 × 150bp flow cells using a MiSeq Micro Reagent Kit v2 (300 cycles). Next, NGS assays were analyzed using the Flomics pipeline (Flomics, Barcelona, Spain). The processing pipeline comprised FastQC v0.11.9 (quality control), followed by fastp v0.20.1. (adapter trimming), Bowtie2 (reference alignment), and iVar v1.2.2. (variant calling). The viral lineage was accessed using the GISAID database, and PANGO Lineages [28] . In NGS analysis, depths of less than 10× were identified by read-depth segmentation in an integrated genomics viewer [27] . Ethical aspects of this study were reviewed and approved by the Seegene Medical Foundation Institutional Review Board (approval number, SMF-IRB-2021-006), provided that after conducting the laboratory diagnoses of SARS-CoV-2 testing, the remaining samples be destroyed. All data were fully anonymized administrative data without patient identifiers, and patient consent was waived by the institutional review board. The 5-SSG primers consisted of five primer pairs, including 69S, 144S, 417S, 484S, and 570S. The 69S primer pair for 4 mutants (A67V, ΔH69/V70, D80A, and T95I), 144S primer pair for 9 mutants (D138Y, G142D, ΔY144, W152C, E154K, ΔE156/F157, R158G, R190S, and D215G), 417S primer pair for 4 mutants (D253G, K417T, K417N, and L452R), 484S primer pair for 4 mutants (T478K, E484K, E484Q, and N501Y), and 570S primer pair for 8 mutants (A570D, D614G, H655Y, Q677H, P681H, P681R, A701V, and T716I) were designed to detect target mutants ( Table 1 ). The lineage and CDC classification information of each primer are shown in S1 Table. All target mutants were efficiently included in S gene coverage (Table 1, Fig 1, and S1 Table) . The analytical performance of the 5-SSG primers was confirmed using a total of 67 strains, including viruses, bacteria, and fungi. The PCR results were determined to be positive or negative based on the expected PCR product sizes (Tables 1, 2 and S2). As shown in Tables 2 and S2, the 5-SSG primer pairs achieved consistent results for twenty-two strains of SARS-CoV-2, whereas a negative result was obtained for the remaining 45 stains (other viruses, bacteria, and fungi). Analytical sensitivity, positivity rate, and LOD were estimated using 25 replicates of positive strains (Zeptometrix-0810590CFHI) at six different concentrations, from approximately 1.4 × 10 −2 to 1.4 × 10 3 TCID 50 /mL (Table 3) . Results (probit analysis) showed that the 95% LOD was approximately 3.7 × 10 1 TCID 50 /mL for 69S, 9.8 × 10 1 TCID 50 /mL for 144S, 6.6 × 10 1 TCID 50 /mL for 417S, 3.9 × 10 1 TCID 50 /mL for 484S, and 7.2 × 10 1 TCID 50 /mL for 570S (Table 3) . Assay results showed 100% reproducibility for all 5-SSG primer pairs, even for concentrations of as low as approximately 1.4 × 10 2 TCID 50 /mL. The LOD was approximately 1.4 × 10 1 TCID 50 /mL, except for in the 69S and 484S assays, which were 10 times more sensitive than the 144S, 417S, and 570S assays. The 5-SSG primer-PCR reactions performed using ten-fold diluted positive samples. The LOD 95% data were estimated using the probit regression analysis. Abbreviations: Conc., concentration; TCID 50 , median tissue culture infective dose; LOD, limit of detection. https://doi.org/10.1371/journal.pone.0260850.t003 As shown in Table 2 , the nucleotide sequences of the positive PCR products obtained from 22 strains were compared with the existing S mutations through the Sanger Sequencing method using the M13 primer. The key deletion mutations ΔH69/V70 and ΔY144 were found in four strains (Twistbio-601443, Twistbio-7105258, NCCP-43381, and NCCP-43386). In addition, other substitution mutations were found to be 100% consistent with those in each strain's corresponding lineage, except for two cases in which substitutions at T95I for the NCCP-43390 strain and W152C for the NCCP-43384 strain were mismatched in the CDC classification (Figs 2 and S1, and S1 Table) . Overall, it was confirmed through Sanger sequencing that the 5-SSG primers can detect predominant S gene mutations of SARS-CoV-2 observed in the major mutant strain categories, VOIs and VOCs, with high sensitivity and efficiency. SARS-CoV-2 mutants have been genetically characterized using NGS-based lineages [13, 29] . To confirm the detection accuracy of the 5-SSG primer assay developed in this study for the SARS-CoV-2 mutants, NGS analysis results were used for a comparison. The NGS assay identified three strains (Twistbio-710528 The remaining ten strains (NCCP-43330, NCCP-43331, NCCP-43342, NCCP-43343, NCCP-43344, NCCP-43345, NCCP-43383, Zeptometrix-0810587CFHI, Zeptometrix-0810589CFHI, and Zeptometrix-0810589CFHI) were genetically classified into another lineage ( Table 4) . Results of NGS and Sanger sequencing using the 5-SSG primers showed 100% consistency for all strains, including T95I for the NCCP-43390 strain and W152C for the NCCP-43384 strain (Table 4) . Taken together, the 5-SSG primer assay is very efficient in detecting SARS-CoV-2 major S gene mutant strains. To confirm the detection accuracy of the 5-SSG primer assay using clinical samples, Sanger sequencing results were compared with those of NGS analysis ( Table 5 ). The results of VOCs (Table 5) . NGS assays and Sanger sequencing using the 5-SSG primers showed 100% consistent results for all strains. We concluded that the 5-SSG primer assay also had a very efficient performance with clinical samples. In the ongoing COVID-19 pandemic, it has been demonstrated that the rapid detection of the pathogen is critical to prevent the rampant spread of the disease [30] . The emergence of SARS-CoV-2 variants, which are associated with increased transmission, disease severity, and resistance to vaccines, is a grave concern [31] . The alpha (B.1.1.7) and beta (B.1.351) lineages of SARS-CoV-2, which account for 98.7% of total variant cases, contain the mutations ΔH69/ V70, E484K, and N501Y [32] . S protein-based vaccines might provide less protection against these mutants (ΔH69/V70, E484K, and N501Y) of SARS-CoV-2 [33] . Therefore, a simple and rapid screening assay to monitor the emergence and spread of these variants is essential to implement public health strategies [31] . In this study, we developed primers for the rapid and accurate detection of the key mutants of the S gene of SARS-CoV-2 and evaluated the reliability and reproducibility of these primers (Tables 2 and 3 ). The 5-SSG primers (69S, 144S, 417S, 484S, and 570S) had high analytical specificity for SARS-CoV-2 strains and no cross-reactivity with other strains (Tables 2 and S2) . Results of Sanger sequencing using 5-SSG primers and commercial NGS were in 100% agreement; however, the three approaches differed in their ability to detect the E484K and D215G variants of the beta (B.1.351) lineage, E484K of the gamma (P.1) lineage, and G142D of the delta (B.1.617.2) lineage (Tables 4 and 5 ). These results indicate that in NGS analysis, lowdepth levels of mutants (G142D, D215G, and E484K) are detected, because the target amplification is affected by a mutation in the reverse primer binding site (ΔE156/F157, R158G, ΔLAL242-244, and N501Y). In addition, NGS is limited to the environment in which the equipment is built, and it also takes a longer as it is more complex than typical Sanger sequencing [34] . Therefore, the Sanger sequencing-based 5-SSG primer assay system can rapidly and accurately detect key mutants of the S gene without resource constraint, and is a useful tool that can overcome the limitation of relatively low read-depth caused by mutations in primerbinding site during NGS analysis. One limitation of this study is that the performance of the 5-SSG primers was tested using small numbers of clinical samples through Sanger sequencing and NGS analysis, and thus, further studies using a larger number of clinical samples should be performed. In addition, the current 5-SSG primer system can identify lambda (C.37) variants with the 417S primer set, but the ΔRSYLTPGD246-253N mutation affects the 144S reverse primer. Therefore, improvements in primer performance for detection of additional variants (e.g. B. 1.617.3 and B. 1.621) and the development of new primers should be pursued in future studies. Collectively, the 5-SSG primer assay system has high PCR sensitivity specifically for SARS-CoV-2 and is a useful tool that can detect various S gene mutants very quickly and accurately, thereby contributing to the faster control of pathogen transmission in the population. Cutaneous manifestations of SARS-CoV-2 infection: a clinical update Epidemiology, causes, clinical manifestation and diagnosis, prevention and control of coronavirus disease (COVID-19) during the early outbreak period: a scoping review Computational analysis of microRNA-mediated interactions in SARS-CoV-2 infection Genomic characterization of a novel SARS-CoV-2 Structural and functional properties of SARS-CoV-2 spike protein: potential antivirus drug development for COVID-19 The Impact of Mutations in SARS-CoV-2 Spike on Viral Infectivity and Antigenicity Trends of mutation accumulation across global SARS-CoV-2 genomes: Implications for the evolution of the novel coronavirus SARS-CoV-2 Variants of Interest and Concern naming scheme conducive for global discourse Emerging Variants of SARS-CoV-2 And Novel Therapeutics Against Coronavirus (COVID-19) Centers for Disease Control and Prevention: SARS-CoV-2 Variant Classifications and Definitions-03 Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus Structural and Functional Analysis of the D614G SARS-CoV-2 Spike Protein Variant A pneumonia outbreak associated with a new coronavirus of probable bat origin Next generation sequencing of SARS-CoV-2 genomes: challenges, applications and opportunities Whole Genome Sequencing of SARS-CoV-2: Adapting Illumina Protocols for Quick and Accurate Outbreak Investigation during a Pandemic PCR-Based Detection Methods for Single-Nucleotide Polymorphism or Mutation: Real-Time PCR and Its Substantial Contribution Toward Technological Refinement DiSNPindel: improved intra-individual SNP and InDel detection in direct amplicon sequencing of a diploid Phylo-geo-network and haplogroup analysis of 611 novel coronavirus (SARS-CoV-2) genomes from India Severe Acute Respiratory Syndrome Coronavirus 2 Genomes from the Houston Metropolitan Area Identifies the Emergence and Widespread Distribution of Multiple Isolates of All Major Variants of Concern Improving the PCR protocol to amplify a repetitive DNA sequence PrimerStation: a highly specific multiplex genomic PCR primer design server for the human genome NCBI BLAST: a better web interface Prospective evaluation of a new automated nucleic acid extraction system using routine clinical respiratory specimens PROBIT: a computer program analysis Synergistic Activity of Niclosamide in Combination With Colistin Against Colistin-Susceptible and Colistin-Resistant Acinetobacter baumannii and Klebsiella pneumoniae Centers for Disease Control and Prevention: SARS-CoV-2 Variant Classifications and Definitions-23 Evaluation of NGS-based approaches for SARS-CoV-2 whole genome characterisation Detection of R.1 lineage severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) with spike protein W152L/E484K/G769V mutations in Japan Genomic surveillance of SARS-CoV-2 in the Republic of Congo Ultrasensitive and visual detection of SARS-CoV-2 using all-in-one dual CRISPR-Cas12a assay A Simple RT-PCR Melting temperature Assay to Rapidly Screen for Widely Circulating SARS-CoV-2 Variants Variants: distribution of cases data Neutralization of SARS-CoV-2 spike 69/70 deletion, E484K and N501Y variants by BNT162b2 vaccine-elicited sera Turnaround Time of Plasma Next-Generation Sequencing in Thoracic Oncology Patients: A Quality Improvement Analysis We would like to thank Editage for English language editing. We also thank the National Culture Collection for Pathogens for kindly providing fifteen strains of SARS-CoV-2 as resources