key: cord-0901280-8hssbnq5 authors: Ahouidi, Ambroise D.; Rodgers, Mary A.; Padane, Abdou; Leye, Nafissatou; Olivo, Ana; Mbow, Moustapha; Mboup, Aminata; Diaw, Papa Alassane; Dia, Aminata; Harris, Barbara; Padane, Yacine Amet Dia; Lo, Gora; Meyer, Todd V.; Diedhiou, Cyrille K.; Diagne, Diabou; Kane, Ndeye Coumba Toure; Cloherty, Gavin; Mboup, Souleymane title: Emergence of novel combinations of SARS-CoV-2 spike receptor binding domain variants in Senegal date: 2021-12-08 journal: Sci Rep DOI: 10.1038/s41598-021-02874-z sha: 9a8abbb30eaeb56023140ae0acc3dfb25c01db03 doc_id: 901280 cord_uid: 8hssbnq5 The emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) lineages that carry mutations in the spike gene are of concern for potential impact to treatment and prevention efforts. To monitor for new SARS-CoV-2 mutations, a panel of specimens were sequenced from both wave one (N = 96), and wave two (N = 117) of the pandemic in Senegal by whole genome next generation sequencing. Amongst these genomes, new combinations of SARS-CoV-2 spike mutations were identified, with E484K + N501T, L452R + N501Y, and L452M + S477N exclusively found in second wave specimens. These sequences are evidence of local diversification over the course of the pandemic and parallel evolution of escape mutations in different lineages. Ongoing viral evolution of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) threatens the efficacy of our strongest defenses against coronavirus disease : vaccines, therapeutics, and diagnostics. To keep pace with continual viral diversification, molecular surveillance serves as a critical alert system for identifying new strains to evaluate for potential immune or diagnostic escape. Most recently, the identification of SARS-CoV-2 lineages of concern, B.1.1.7 (alpha), B.1.351 (beta), P.1 (gamma), and B.1.617.2 (delta), immediately preceded their rise in prevalence and global spread [1] [2] [3] [4] . Subsequent reports have demonstrated that increased transmissibility and immune escape are linked to these lineages, which are defined by spike receptor binding domain (RBD) mutations, including N501Y, K417N/T, L452R, and E484K. Notably, the E484K and L452R mutations in RBD had previously been demonstrated to confer immune escape in cell culture selection experiments 5 , which is consistent with their increasing prevalence 6,7 , possibly due to increased viral fitness 8, 9 . Therefore, vigilant monitoring of circulating strains for these mutations is of critical importance for potentially preventing their spread. The SARS-CoV-2 pandemic in Senegal has surged in several waves occurring in March-November of 2020 (wave 1), December 2020-March 2021 (wave 2) and July-September 2021 (Wave 3). The first variant of concern that was reported in Senegal was B.1.1.7, which was first identified in a patient who was diagnosed on December 30th, 2020 during the second wave 10 . To compare the SARS-CoV-2 strains circulating during the first two waves of the pandemic in Senegal, a panel of 150 first wave and 150 s wave leftover nasopharyngeal specimens in viral transport media (VTM) were collected in a study approved by the Ethical Committee of the Ministry of Health of Senegal (000129/MSAS/CNERS). VTM specimens were sequenced by next generation sequencing (NGS) using a metagenomic approach with probe enrichment (xGen) and analysis on an Illumina HiSeq 11 . Genomes were assembled using BLAST and sequence NC_045512 as a reference, followed by clade assignment and mutation analysis with the NextClade tool (clades.nextstrain.org) and lineage assignments with the Pangolin tool 12 . Genome coverage of > 60% was achieved for N = 213 specimens (N = 96 first wave, N = 117 s wave), with an average coverage depth of 43,006x (GISAID accession numbers EPI_ISL_1630259-1630270). The first wave genomes fell into 3 clades: 19B (N = 3), 20A (N = 78), and 20B (N = 15), similar to the composition of strains in other countries around the same time period 13 . In Pangolin nomenclature 14 , nine lineages were present in the first wave, which was predominated by B.1.416 (57/96, 59.4%, Fig. 1A ). Viral diversity increased greatly in wave two with genomes from 9 clades present: 19A (N = 1), 19B (N = 11), 20A (N = 108), 20B (N = 81), 20C (N = 3), 20D (N = 1), 20E (N = 1), 20G (N = 1), and 20I (N = 1). Increased diversity of Pangolin lineages was also observed in the second wave, with 20 lineages identified, the majority of which were not present in the first wave (Fig. 1A ). Fig. 1B) , confirming a widespread distribution in western Senegal. The earliest B.1.1.7 infection in this study was diagnosed on December 21st in Thies, which predates the first case previously identified Senegal 10 . The December 21st patient was a patient who was tested due to contact with an infected person, suggesting that B.1.1.7 was already circulating in Senegal in early December. The remaining 5 B.1.1.7 cases were all diagnosed in early January during the exponential phase of the second wave spike in cases. Escape mutations in the spike protein were absent from wave one but were present in 4% (5/117) of all wave two infections (Fig. 1B, Supplemental Table 1 ). Additional details for all genomes with mutations of concern at position 501, 484, and/or 452 in the spike RBD are shown in Fig. 1B . When classified by clade, all of the L452R mutations were exclusively found in 19B clade genomes whereas the L452M mutation appears to have emerged in wave two in the 20A clade (Supplemental Table 1 ). In addition to strains carrying L452R individually, variant strains carrying a combination of L452R + N501Y (3/117, 2.6%) were also identified. The N501Y mutation confers higher affinity for the ACE2 receptor and is present in several variants of concern (alpha, beta, gamma) while L452R is a signature escape mutation found in the delta and epsilon lineages that also increases infectivity 4, 8, 9, 15, 16 . The combination of both of these mutations in one strain is of concern for potential rapid spread of an immune escape variant. All three of the genomes carrying the L452R/N501Y combination belonged to the A.27 lineage (clade 19B) and did not encode the D614G mutation that predominates most global infections today. Likewise, the other lineage defining mutations for variants of concern were absent in the A.27 genomes, with the exception of L18F and H655Y, which are both present in the gamma lineage (Fig. 1C) . While 13 common single nucleotide polymorphisms (SNPs) were identified for this lineage, each individual genome had unique SNPs as well, suggesting they were not transmission linked cases. The three patients who had A.27 infections were diagnosed in the Almadie district of Dakar in December 2020 and ranged in age from 36 to 55 (Fig. 1B) . In addition to the L452R + N501Y double mutant, a single genome was identified that carried a unique combination of E484K + N501T spike RBD mutations in a B.1 lineage genome (clade 20C) with D614G also present. This lineage has been provisionally named B.1.501T.V1 (Fig. 1C) . The patient who was infected with this variant strain was a patient who was diagnosed in December 2020 in Diamniadio (Fig. 1B) . While E484K confers escape from neutralizing antibodies 17, 18 , the N501T mutation enhances the spike receptor binding domain (RBD) affinity for ACE2 in vitro and is predicted to enhance transmissibility, similar to N501Y 19, 20 . Strains harboring N501T first emerged in August of 2020 in Northern Italy 6 and the N501T mutation has been found recently in an emerging Brazilian lineage that differs from B.1.501T.V1 21 . Alarmingly, N = 2122 N501T strains were posted to GISAID from specimens collected in the months that followed the identification of this specimen in Senegal (January-April 2021) from countries in Africa, Europe, Asia, North America, and South America (GISAID, date of accession April 18th, 2021) 6 . Altogether, these trends suggest that convergent evolution around the world is leading to mutations at spike positions E484 and N501 in many lineages, suggesting a possible increased fitness for viruses carrying these mutations. Transmission of SARS-CoV-2 Lineage B.1.1.7 in England: Insights from linking epidemiological and genetic data Detection of a SARS-CoV-2 variant of concern in South Africa Genomics and epidemiology of a novel SARS-CoV-2 lineage in Manaus SARS-CoV-2 spike mutations, L452R, T478K, E484Q and P681R, in the second wave of COVID-19 in Maharashtra Complete mapping of mutations to the SARS-CoV-2 spike receptor-binding domain that escape antibody recognition Data, disease and diplomacy: GISAID's innovative contribution to global health CoVariants: SARS-CoV-2 mutations and variants of interest SARS-CoV-2 spike L452R variant evades cellular immunity and increases infectivity Molecular definition of severe acute respiratory syndrome coronavirus 2 receptor-binding domain mutations: Receptor affinity versus neutralization of receptor interaction First detection of the British variant of SARS-CoV-2 in Senegal SNP and phylogenetic characterization of low viral load SARS-CoV-2 specimens by target enrichment Lineage assignment in an emerging pandemic as an epidemiological tool Nextstrain: Real-time tracking of pathogen evolution A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology New variant of SARS-CoV-2 in UK causes surge of COVID-19 Transmission, infectivity, and neutralization of a spike L452R SARS-CoV-2 variant Escape of SARS-CoV-2 501Y.V2 from neutralization by convalescent plasma Antibody evasion by the P.1 strain of SARS-CoV-2 Receptor recognition by the novel coronavirus from Wuhan: An analysis based on decade-long structural studies of SARS coronavirus Deep mutational scanning of SARS-CoV-2 receptor binding domain reveals constraints on folding and ACE2 binding We gratefully acknowledge the authors, originating and submitting laboratories of the genetic sequence and metadata made available through GISAID. This Project was funded by Abbott. The authors declare no competing interests. Supplementary Information The online version contains supplementary material available at https:// doi. org/ 10. 1038/ s41598-021-02874-z.Correspondence and requests for materials should be addressed to A.D.A. or M.A.R. Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.