key: cord-0842313-vw2ijtj6 authors: Yavarian, Jila; Nejati, Ahmad; Salimi, Vahid; Shafiei Jandaghi, Nazanin Zahra; Sadeghi, Kaveh; Abedi, Adel; Sharifi Zarchi, Ali; Gouya, Mohammad Mehdi; Mokhtari-Azad, Talat title: Whole genome sequencing of SARS-CoV2 strains circulating in Iran during five waves of pandemic date: 2022-05-02 journal: PLoS One DOI: 10.1371/journal.pone.0267847 sha: 5d66db5df2afb23287caa02bd731d7b767cc5cb2 doc_id: 842313 cord_uid: vw2ijtj6 PURPOSE: Whole genome sequencing of SARS-CoV2 is important to find useful information about the viral lineages, variants of interests and variants of concern. As there are not enough data about the circulating SARS-CoV2 variants in Iran, we sequenced 54 SARS-CoV2 genomes during the 5 waves of pandemic in Iran. METHODS: After viral RNA extraction from clinical samples collected during the COVID-19 pandemic, next generation sequencing was performed using the Nextseq platform. The sequencing data were analyzed and compared with reference sequences. RESULTS: During the 1st wave, V and L clades were detected. The second wave was recognized by G, GH and GR clades. Circulating clades during the 3rd wave were GH and GR. In the fourth wave GRY (alpha variant), GK (delta variant) and one GH clade (beta variant) were detected. All viruses in the fifth wave were in clade GK (delta variant). There were different mutations in all parts of the genomes but Spike-D614G, NSP12-P323L, N-R203K and N-G204R were the most frequent mutants in these studied viruses. CONCLUSIONS: These findings display the significance of SARS-CoV2 monitoring to help on time detection of possible variants for pandemic control and vaccination plans. In December 2019, severe acute respiratory syndrome coronavirus 2 (SARS-CoV2) was detected in Wuhan, China which has known as the cause of coronavirus disease 2019 (COVID-19) pandemic [https://covid19.who.int/]. To date, Iran has experienced five waves of the pandemic since the first detection of SARS-CoV2 on 19 February 2020 in Iran [1] . As of 16 November 2021, 253,640,693 cases of SARS-CoV2 with 5,104,899 deaths were reported worldwide and in Iran 6,045,212 laboratory confirmed cases with 128,272 deaths reported [https://covid19.who.int/]. SARS-CoV2 is a virus in coronaviridae family with a single stranded RNA. The genome is about 30 kb and consists of genes encoding multiple non-structural, structural and accessory proteins. The non-structural proteins include NSP1-16 which are necessary for virus transcription and replication. These proteins are encoded by ORF1ab with about 21300 nucleotides length [2] . NSP1 helps virus to evade innate host antiviral response and promote viral growth. NSP2 can bind to two host proteins prohibitin 1 and prohibitin 2 that are recognized to be important in cell migration, cell cycle progression, apoptosis, cellular differentiation, and mitochondrial biogenesis [3] . NSP3 is a papain-like protease and also it can suppress host protein synthesis. NSP3 and NSP4 with other cofactors are important for virus replication by prompting membrane rearrangement. NSP5 is a cysteine-like protease which is the virus main protease and cleaves NSP4-NSP16. NSP6 is involved in autophagy. NSP7 and NSP8 form complex with NSP12 for viral replicase machinery. NSP7 has primer-independent RNA polymerase activity. NSP8 has primase activity-Primase is an enzyme that synthesizes primers during replication. NSP9, an RNA-binding protein, in complex with NSP8, is involved in RNA replication and virulence. NSP10 is a cofactor for the 2 0 O-methyltransferase activity of NSP16 that increases evasion of the innate immune system, and the N7-guaninemethyltransferase/exoribonuclease activities of NSP14. NSP11 and NSP15 are involved in endoribonuclease activity and essential for replication. NSP12 has RNA polymerase activity. NSP13 has helicase and NTPase activity [2] . The structural proteins include spike (S), envelope (E), membrane (M) and nucleoprotein (N). Spike with a length of 1273 amino acid (aa), mediates attachment and entry. Envelope (75 aa) is a small membrane protein and important in virus infectivity. Membrane (222 aa) is important in virion morphogenesis and nucleoprotein (419 aa) is a viral genome packaging protein [2] . There are nine accessory proteins encoded by ORF3a, 3b, 6, 7a, 7b, 8, 9a, 9b and 10, which seem to be important in virus pathogenesis [2] . During virus replication, mutations can occur which might lead to alteration in protein functions and virus transmission and pathogenesis. Nowadays next-generation sequencing (NGS) is an effective method to identify different mutations and new variants for epidemiological and surveillance studies. SARS-CoV2 has different variants known as variants of concern (VOC), variants of interests (VOI) and variants under monitoring (VUM). It has also classified to different clades that currently 9 clades have been recognized based on markers mutations of the genome: S, L, V, G, and later of G into GH, GR and GV, and more recently GR into GRY and GRA [4] . Different variants of the SARS-CoV2 have been identified during the pandemic. Some spread worldwide, while others quickly faded away. Identification of the circulation of variants in a society is important, therefore, we set up NGS for SARS-CoV2 detected during five waves of pandemic for better understanding of circulation of different variants, genetic diversity and mutations in all non-structural, structural and accessory genes. Throat swab specimens from COVID-19 suspected patients were sent for sequencing to National Influenza Center (NIC) located at Virology Department, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran. After primary detection by Real time PCR the samples from different parts of the country with ct value lower than 25 were selected for NGS. Iran experienced the 1st wave of SARS-CoV2 pandemic in February until May 2020. The 2nd wave began in late June until September 2020. The 3rd wave occurred during October to December 2020. The 4th wave started on early April until June 2021 and the 5th wave was from August until October 2021. This study was approved by ethics committee of National Institute for Medical Research Development, Tehran, Iran (IR.NIMAD.REC.1399.119). After viral RNA extraction using High Pure Viral Nucleic Acid kit (Roche,Germany) according to the manufacturer's instruction, cDNA synthesis was performed. Library construction was done by using the Nextera DNA Flex kit (Illumina,USA), then used for hybridization using Respiratory Virus Oligo Panel kit (Illumina,USA). This was followed by bead-based capture of hybridized probes, amplification and clean-up. After quality control assessment for library concentration by Qubit (Thermo Fisher,USA), sequencing was performed using Next-Seq 550 machine (Illumina,USA). For data analysis, all collection of reads were mapped against the reference genome assembly of SARS-CoV2. We had high quality of assembled full viral genome coverage without undetermined nucleotides. The assembled genomes were analyzed using CoVsurver mutations App in GISAID [4] and aligned by BioEdit sequence alignment software. After sequence alignment, a phylogenetic tree was drawn by neighbor joining method (bootstrap 1000) using MEGA v7 software. Then all 54 Iranian full genome SARS-CoV2 of different waves were submitted to GISAID with accession numbers: EPI-ISL-1014676-87, EPI-ISL-959275-84, EPI-ISL-862075-81, EPI-ISL-1993547-557, EPI-ISL-2360250-57, EPI-ISL-4803556-58, EPI-ISL-4803554, EPI-ISL-4803538 and EPI-ISL-4803528. Fifty four COVID-19 confirmed cases were subjected to NGS selecting 10, 10, 9, 20 and 5 samples from 1st, 2nd, 3rd, 4 th and 5th waves respectively. Here we analyzed the mutations of each genes separately by comparing with hCoV-19/Wuhan/WIV04/2019 in GISAID. The mutations of NSP genes are illustrated in the Spike. As shown in Table 1 , amino acid substitutions in S glycoprotein were increased with pandemic progression. During the 1st wave there was no special amino acid substitutions on the S glycoprotein but viruses detected during the 4th and 5 th waves had the most frequent amino acid substitutions. It should be noted that D614G was detected in all viruses after the 1st wave. Envelope. No mutation was found in viruses studied during the 1st, 2nd, 3rd and the 5 th waves but in two samples in the 4 th wave there were mutations encoding the amino acid substitution P71L in one sample belonged to beta variant and S68F in one sample belonged to alpha variant. Membrane. During the 1st and the 2nd waves, there were no mutations in M, but a mutation causing the amino acid substitution I73M was detected in three samples during the 3rd wave. Four samples during the 4 th wave and 4 samples of the 5 th wave had I82T which all these 8 samples belonged to delta variant. Nucleoprotein. During the 1st wave, one sample had N192K in N and two samples had A35V. During the 2nd wave, two samples had D3Y and M234I, three samples had S194L substitutions. One sample had P13T and S194L. Another sample had A220V and P326L. hCoV-19/Iran/Gilan-NICS1-58/2020 along with four viruses during the 3rd wave, had R203K and G204R. hCoV-19/Iran/Tehran-05V/2020 had P162S beside R203K and G204R. One sample during the 3rd wave, had S201I. Three samples had S194L along with another substitution including G34W, P383L and P13T. During the 4 th wave, 13 samples of alpha variant had D3L, R203K, G204R and S235F. One sample of alpha variant beside these four substitutions had also L219F. Seven samples belonged to delta variant had D63G, D377Y, G215C and R203M. One sample of delta variant during the 5 th wave had D63G, R203M and D377Y. One sample had just R203M and D377Y and one sample had D63G, R203M and G215C. The only beta variant had T205I and T362I. First wave. One sample had T151I in NS3, another sample had A105V in NS7a and two samples had T40I in NS7b. Second wave. In NS3, five samples had Q57H and one sample had G18V along with Q57H. One sample had G254V and one sample had T223A and V112F, therefore during the 2nd wave eight samples had just mutation in NS3 among all accessory proteins. Third wave. In NS3, two samples had Q57H the same as five samples in the 2nd wave. Two more samples had one more substitution along with Q57H in NS3, including W131C and T223I. One sample had just N58G in NS3. Three samples had a deletion at G66 (G66del) and S67 (S67del) and K68E in NS8. One sample had T223I and L4F in NS3. Fourth wave. During the fourth wave, amino acid substitutions detected in NS3 were W131C, S171L, G100C, D238Y, S26L, Q57H and S171L in different samples. There was just W27L in NS6 in one sample. T39I, T120I, V82A and S83L were detected in NS7a. NS7b had T40I and a stop codon at E39 (E39stop). NS8 had the most frequent amino acid substitutions in viruses circulating during the 4 th wave including stop codons at K68 (K68stop) and Q27 (Q27stop) and R52I, V62L, Y73C, C90F. These stop codons at NS7b and NS8 may lead to producing the truncated proteins. During the 5 th wave, amino acid substitutions were detected in NS3a, NS7a and NS7b which K16T, S26L and L41F in NS3a, L49P, A50D, D51H, T120I, V82A in NS7a and T40I in NS7b were found. Analyses showed that during the 1st wave clades V and L were circulating which V was dominant. During the 2nd wave, GH was dominant, but G and GR were also detected. Surprisingly one V clade was also detected in the 2 nd wave. GH and GR were the clades during the 3rd wave. During the 4 th wave, 14 viruses of alpha variant were belonged to GRY, five delta variants were in GK clade and one beta variant was in GH clade and finally five delta variants of the 5 th wave were in GK clade. The Neighbor joining method was used with 1,000 bootstrap replicates. The tree contains 54 SARS-CoV2 sequences of these study compared to the reference sequence from GISAID and some other sequences from each clade. In this tree the reference sequence is marked by white circle and sequences of this study were marked as follow: The 1st wave black circle, the 2nd wave inverted black triangle, the 3rd wave with black square, the 4th wave with black triangle and the 5th wave with black diamond. https://doi.org/10.1371/journal.pone.0267847.g002 In this study, we reported the circulation of distinct clades of SARS-CoV2 during the five waves in Iran. The V clade was found during the 1st wave. Clade V is characterized by NSP6-L37F plus NS3-G251V. At the end of the first wave, one sample detected in clade L (reference clade). The 2nd wave was recognized by G, GH and GR clades. G clade shows D614G in spike which is one of the most important substitutions in spike. Clade GH has S-D614G plus NS3-Q57H substitutions and clade GR characterized by S-D614G and N-G204R. One sample belonged to the V clade was also detected in the 2 nd wave. Circulating clades during the 3rd wave were GH and GR. During the fourth wave, GH (1 beta), GRY (alpha) and GK (delta) clades were detected. GRY clade (alpha) has S-H69del, S-V70del, S-Y144del, S-N501Y plus S-D614G and N-G204R. All viruses in the 5th wave were in clade GK (S-D614G plus S-T478K). In different clades NGS of 54 samples selected during the five waves in Iran showed important mutations in the different parts of the genome which all were analyzed and compared with circulating variants worldwide. The effects of important mutations were discussed here. ORF1ab is more than two thirds of the SARS-CoV2 genome with 21,290 nucleotides at the 5' end which encodes 16 non-structural proteins (NSP1-NSP16). Among these NSPs, NSP3 had the highest number of mutations. NSP3 is important for virus replication and it can suppress host protein synthesis, then amino acid substitutions in NS3 deserve greater study in vitro. Suppression of type I interferon (IFN) response is a consequence of infection of SARS--CoV2. NSP5, main protease, is an IFN antagonist. NSP5 variant K90R seen in SARS-CoV2 retained the IFN-antagonizing activity. The suppressive outcome of NSP5 on IFN-β gene transcription induced by IKK�, TBK1, RIG-I and MAVS suggested that NSP5 probably involved at a stage downstream of IRF3 phosphorylation in the cytoplasm [5] . In this study K90R was detected just in one sample during the 4 th wave while it has been more often found in Icelandic and Chinese strains [6] . NSP6 is important for viral assembly, viral protein folding and replication. NSP6-L37F leads to asymptomatic transmission and reduced virulence which we can see in the most of the viruses detected during the 1st wave and one in the 2nd wave [7] . In hCoV-19/Iran/Gilan-NIC230/2020, E50G in NSP7 was detected in the 2nd wave which is important in immune response as shown in a study that amino acid residues 36-50 (HNDILLAKDTTEAFE) of NSP7 and also NSP13 and N are SARS-CoV2 specific T cell epitopes recognized by CD8 T cells [8] . NSP12 is necessary for the replication/transcription of the SARS-CoV2 genome, and this protein is considered as a target for the treatment of COVID-19. The P323L substitution of NSP12, could induce structural changes and adverse effect on proofreading during the replication of the virus. Meanwhile, the P323L is located in a pocket that might be the site of drug function [9] . In our research NSP12-P323L located in the NSP8 binding cleft [10] was detected after the first wave. NSP12-P323L is the most common detected substitution with increasing in occurrence over time [11] . A study by Mohammad., et al. showed that remdesivir has higher binding affinity to NSP12 with P323L than the wild-type RdRp, therefore they suggested using remdesivir in patients infected with SARS-CoV2 carrying P323L in NSP12 [12] . In this study there was no virus seen to contain F480, V557 and S861 in NSP12, which have shown to be important in decreasing susceptibility to remdesivir [11] . As all viruses studied in this research, did not have any mutations in the genes for NSP10 and NSP11, therefore these two nonstructural proteins could be considered as good targets for diagnosis and targets for therapeutic agents. Furthermore, there were some mutations in other NSPs with that of unknown importance. The effect of these substitutions need to be studied in vitro. After the 1st wave, all viruses studied in this research had S-D614G. Studies showed that the S-protein amino acid substitution, D614G, evolved concurrently with P323L in NSP12. NSP12-P323L might decrease the replication of viral RNA and therefore increase the probability of asymptomatic infections and/or change in incubation period. Meanwhile the infectivity-enhancing D614G substitution in S protein could compensate this reduction in virus replication and help virus transmission even by asymptomatic individuals [13] . Flores-Alanis., et al. showed that D614G substitution in S protein, P323L in NSP12, and R203K and G204R in N protein had a substantial association with the disease severity [9] . Another study showed that D614G in the S protein increased the infectivity of SARS-CoV2 and associated with lower RT-PCR cycle thresholds. It showed high viral load in the upper respiratory tract, but not enhanced disease severity. Also they showed that G614-bearing virions are not intrinsically more resistant to neutralization by convalescent sera [14] . G614 of S protein is responsible for making the virus 2.4 times more contagious with higher viral load [9, 15] . hCoV-19/Iran/Tehran-055M/2020 in the 2nd wave had D614G and L5F. One study showed that variants with D614G and L5F had increased infectivity [16] . Evaluation of the mutations which lead to amino acid substitutions, showed that Spike-D614G, NSP12-P323L, N-R203K and N-G204R were the most frequent amino acid substitutions in these studied viruses and also worldwide which we could consider that they may increase SARS-CoV2 transmissibility as the pandemic progressed. The S protein is crucial factor for the entry of SARS-CoV2 to the host cell which interacts with the angiotensin-converting enzyme 2 (ACE-2) receptor through its receptor binding domain (RBD) [17] . The S477N is the part of an epitope recognized by human neutralizing antibodies and located in the RBD. A study showed that S477N increases the affinity for the ACE-2 receptor [9, 18] . Singh., et al. showed that S477N strengthen the binding of SARS--COV2 spike to the ACE-2 receptor [19] . In this study, two samples had S477N which both belonged to GR clade. Among the variations in the alpha variant, S-N501Y, is in the receptor binding site which was shown to increase the binding of SARS-CoV2 to the ACE-2 host receptor, leading to increased viral fitness and transmission [20] [21] [22] . One study showed that N439K, N501Y and S477N significantly reduced the neutralization activity of some monoclonal antibodies [23] . Ostrov showed that B.1.1.7 with N501Y had increased affinity for ACE-2 and A570D, D614G and S982A substitutions might increase virus fusion by decreasing the intermolecular stability of S1 and S2 [24] . The delta variant (GK clade) has ten amino acid substitutions, T19R, G142D, 156del, 157del, R158G, L452R, T478K, D614G, P681R and D950N in the spike protein [25] . T19R removes an N-glycosylation site at position 17 that might also affect antigenic properties. A study by Li., et al. showed that variants with L452R were resistant to some neutralizing antibodies [16] . Tchesnokova., et al. showed that L452R might result in stronger binding to the ACE-2 and escape from neutralizing antibodies [26] . Overall L452R reduces sensitivity to neutralizing antibodies, increases viral infectivity, transmissibility, spike stability, ACE-2 binding affinity and viral fusogenicity, therefore it supports viral replication [27] [28] [29] [30] . Delta variant circulation was started at the end of the 4 th wave and all viruses detected during the 5 th wave in Iran were the delta variant. The beta variant (GH clade) includes L18F, D80A, D215G, R246I, K417N, E484K, N501Y, D614G, and A701V in the spike protein, which K417N, E484K, and N501Y are in the RBD and increase the binding affinity for the ACE-2 receptor and eventually the infectivity [31] . K417N and E484K have also essential role in viral escape from neutralizing antibodies [32] . Overall K417N, E484K and L452R are vaccine escape mutants [33] . The only beta variant of this study did not have L18F and R246I but it had a deletion of three amino acids 242-244 (L242del, A243del and L244del) in spike. L242del, A243del and L244del showed reduced sensitivity to some neutralizing antibodies [22] . The beta variant is reported to have an increased risk of transmission and reduced neutralization by monoclonal antibody therapy, convalescent sera, and post-vaccination sera [34] . Some studies showed that most serum samples from vaccinated people or patients recovering from COVID-19 have shown full or slightly decreased capacity to inactivate SARS-CoV2 variants, except for variants with N501Y, K417N and E484K substitutions [22, [35] [36] [37] . We had just one beta variant during the 4 th wave which had E484K in the S protein. As seen in different studies, E484K plays a crucial role in increasing virus transmission and decreasing antibody neutralizing titers [37] [38] [39] , thus continuous screening for emerging variants with substitutions such as E484K is necessary for public health. hCoV-19/Iran/Tehran-03/2020 in the 3rd wave had Q677H. Grabowski., et al. showed that E484K, F490S, S494P (in the RBD of spike) and Q677H and Q675H (in the vicinity of the polybasic cleavage site at the S1/S2 border) may limit efficiency of vaccines [40] . Liu., et al. showed that substitutions at residues T345, R346, K444, G446, N450, L452, S477, T478, E484, F486, and P499 were each related to the resistance to more than one monoclonal antibody, of which substitutions at S477 and E484 residues showed wide resistance [41] . Viruses with H69del and V70del had 2-fold higher infectivity compared to wildtype and also showed reduced neutralization sensitivity to mAb, targeting an as yet undefined epitope outside the RBD [42] . Sixteen samples of this study during the 4 th wave had H69del, V70del and also Y144del. In one study it was shown that viruses with Y144del in spike had decreased sensitivity to convalescent sera [16] . Li., et al. showed that, among spike mutations, the most characteristic ones are substitutions such as D614G, N501Y, Y453F, N439K/R, P681H, K417N/T, and E484K, and deletions of ΔH69/V70 and Δ242-244, which enhanced viral infectivity, transmissibility, and resistance to neutralization [43] . Some of these mutations were found after the 2nd wave of pandemic in Iran. P681R in the furin cleavage site may help in increased rate of membrane fusion, internalization and so better transmissibility which was found in delta variants in this study and worldwide [44] . Voss., et al. found that P681H of spike and S235F of nucleoprotein in the alpha variant changed the specificity of the corresponding epitopes [45] . During the 4 th wave, 14 samples had P681H and S235F. Mutations in the E gene were somewhat uncommon as in this study just two mutations in 2 different samples in the 4 th wave were detected which needs more investigation to find out their effects on virus life cycle. All viruses of delta variant in this study had I82T in M gene. Shen., et al. showed that this M gene substitution was more naturally fit, probably connected to glucose uptake during virus replication, and it is better to be included in genomic surveillance [25] . Nucleoprotein was mainly expressed in the initial stages of infection, and is important in viral RNA transcription and replication. Nucleoprotein has been shown to affect some basic cellular processes, inflammatory responses to upregulate the expression of the proinflammatory factor COX2, and it inhibits the innate immune responses in the host cell [46] . Therefore, amino acid substitutions in the nucleoprotein might have significant effect in immune response. R203K and G204R were the most common substitutions in the nucleoprotein in viruses studied in this research and worldwide. These mutations are important in disease severity [47] . There are some studies about the accessory gene's mutations and their impact on the virus cell cycle. The results of research by Wu et al., showed that Q57H and G251V in NS3a, S194L and R203K/G204R in N made changes in the structure of proteins and also had effect on the binding affinity of intraviral protein-protein interactions during assembly and release of coronavirus. So these changes might be associated with virus evolution and beneficial for the virus and its virulence [48] . Some viruses during the waves 2,3 and 4 had NS3a-Q57H. In primary human respiratory cells, viruses with NS3a-Q57H evade stimulation of chemokine, cytokine, and interferon-stimulated gene expression [49] . Some studies have described that NS3a can competently induce apoptosis in the cell and affect virus uptake and release [50, 51] . Besides ORF3, some mutations were detected in different accessory genes of viruses in this study which more research is needed to explore their importance in virus pathogenesis. For instance, NS8 supposed to suppress immune responses [52] , then substitutions of this protein can be imperative. Of special importance, stop codon mutations which lead to absence of NS8 can extend the duration of symptoms, then increase the virus transmissibility. Spike mutations such as D614G, HV69-70 del and L5F which affect the receptor binding affinity of the S protein and increase the virus transmission when associated with NS8 stop codons which extend the period of signs and symptoms might enhance the chance of transmission [53] . The combination of these variations were found especially in some viruses detected during the 4 th wave which should raise concern. In this study we did not have data on relationship between different SARS-CoV2 lineages and patient's signs and symptoms and disease severity which was an important limitation of this research. In conclusion, we detected different lineages of SARS-CoV2 contributing to all five waves and showed that all viruses circulating during the 5 th wave belonged to delta variant. We compared the mutations identified in our complete genomes study with those reported in GISAID. The findings of this study showed that with progression of the pandemic, the number of mutations were considerably increased which showed the adaptive evolution of SARS-CoV2 in human to increase transmissibility. Therefore genomic surveillance is an important tool to screen the progression of the COVID-19 pandemic. It should be noted that for variant detection, we partially sequenced S glycoprotein of more than 1000 samples with Sanger sequencer of which the results were compatible with NGS results during all 5 waves (unpublished data). First cases of SARS-CoV-2 in Iran, 2020: case series report Role of Structural and Non-Structural Proteins and Therapeutic Targets of SARS-CoV-2 for COVID-19 Severe acute respiratory syndrome coronavirus nonstructural protein 2 interacts with a host protein complex involved in mitochondrial biogenesis and intracellular signaling Global initiative on sharing all influenza data-from vision to reality SARS-CoV-2 main protease suppresses type I interferon production by preventing nuclear translocation of phosphorylated IRF3 Global SNP analysis of 11,183 SARS-CoV-2 strains reveals high genetic diversity. Transboundary and emerging diseases Decoding asymptomatic COVID-19 infection and transmission. The journal of physical chemistry letters SARS-CoV-2-specific T cell immunity in cases of COVID-19 and SARS, and uninfected controls Molecular epidemiology surveillance of SARS-CoV-2: mutations and genetic diversity one year after emerging Targeting SARS-CoV-2 Nsp12/Nsp8 interaction interface with approved and investigational drugs: an in silico structure-based approach Genetic conservation of SARS-CoV-2 RNA replication complex in globally circulating isolates and recently emerged variants from humans and minks suggests minimal pre-existing resistance to remdesivir Remdesivir MD Simulations Suggest a More Favourable Binding to SARS-CoV-2 RNA Dependent RNA Polymerase Mutant P323L Than Wild-Type. Biomolecules Concurrent mutations in RNA-dependent RNA polymerase and spike protein emerged as the epidemiologically most successful SARS-CoV-2 variant Tracking changes in SARS-CoV-2 spike: evidence that D614G increases infectivity of the COVID-19 virus One year of SARS-CoV-2: How much has the virus changed? The impact of mutations in SARS-CoV-2 spike on viral infectivity and antigenicity Usage of peptidases by SARS-CoV-2 and several human coronaviruses as receptors: A mysterious story Deep mutational scanning of SARS-CoV-2 receptor binding domain reveals constraints on folding and ACE2 binding Serine 477 plays a crucial role in the interaction of the SARS-CoV-2 spike protein with the human receptor ACE2 Whole-genome sequencing of SARS-CoV-2 in the Republic of Ireland during waves 1 and 2 of the pandemic. medRxiv SARS-CoV-2 complete genome sequencing from the Italian Campania region using a highly automated next generation sequencing system SARS-CoV-2 501Y. V2 escapes neutralization by South African COVID-19 donor plasma The Antigenicity of Epidemic SARS-CoV-2 Variants in the United Kingdom Structural Consequences of Variation in SARS-CoV-2 B. 1.1. 7 Emerging variants of concern in SARS-CoV-2 membrane protein: a highly conserved target with potential pathological and therapeutic implications. Emerging microbes & infections Acquisition of the L452R mutation in the ACE2-binding interface of Spike protein triggers recent massive expansion of SARS-Cov-2 variants Epitope classification and RBD binding properties of neutralizing antibodies against SARS-CoV-2 variants of concern SARS-CoV-2 spike L452R variant evades cellular immunity and increases infectivity SARS-CoV-2 reinfection with a virus harboring mutation in the Spike and the Nucleocapsid proteins in Panama Transmission, infectivity, and neutralization of a spike L452R SARS-CoV-2 variant The impact of spike mutated variants of SARS-CoV2 [Alpha, Beta, Gamma, Delta, and Lambda] on the efficacy of subunit recombinant vaccines SARS-CoV-2 501Y. V2 (B. 1.351) elicits cross-reactive neutralizing antibodies Vaccine-escape and fast-growing mutations in the United Kingdom, the United States SARS-CoV-2 E484K Mutation Narrative Review: Epidemiology, Immune Escape, Clinical Implications, and Future Considerations. Infection and Drug Resistance Novavax vaccine delivers 89% efficacy against COVID-19 in UK-but is less potent in South Africa mRNA vaccine-elicited antibodies to SARS-CoV-2 and circulating variants Comprehensive mapping of mutations in the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human plasma antibodies Antibody resistance of SARS-CoV-2 variants B. 1.351 and B. 1.1. 7 Detection and characterization of the SARS-CoV-2 lineage B. 1.526 in New York. bioRxiv SARS-CoV-2 variant of concern 202012/01 has about twofold replicative advantage and acquires concerning mutations Identification of SARS-CoV-2 spike mutations that attenuate monoclonal and serum antibody neutralization SARS-CoV-2 evolution during treatment of chronic infection Recent progress on the mutations of SARS-CoV-2 spike protein and suggestions for prevention and controlling of the pandemic SARS-CoV-2 Spike Mutations, L452R, T478K, E484Q and P681R, in the Second Wave of COVID-19 in Maharashtra, India. Microorganisms Epitope-specific antibody responses differentiate COVID-19 outcomes and variants of concern The nucleocapsid protein of the SARS coronavirus is capable of self-association through a C-terminal 209 amino acid interaction domain Nucleocapsid mutations R203K/G204R increase the infectivity, fitness, and virulence of SARS-CoV-2 Effects of SARS-CoV-2 mutations on protein structures and intraviral protein-protein interactions Introduction of ORF3a-Q57H SARS-CoV-2 Variant Causing Fourth Epidemic Wave of COVID-19 The ORF3a protein of SARS-CoV-2 induces apoptosis in cells SARS-CoV-2 and ORF3a: Nonsynonymous Mutations, Functional Domains, and Viral Pathogenesis. mSystems Structure of SARS-CoV-2 ORF8, a rapidly evolving immune evasion protein SARS-CoV-2 variants combining spike mutations and the absence of ORF8 may be more transmissible and require close monitoring Authors would like to express their great thanks to all data contributors, i.e. the authors and their originating laboratories responsible for obtaining the specimens, and their submitting laboratories for generating the genetic sequence and metadata and sharing via the GISAID Initiative, on which this research is based. We thank GISAID for all their support.