key: cord-0979038-b4v5odgx authors: Phutthasophit, Kanyarat; Buddhari, Darunee; Chinnawirotpisan, Piyawan; Joonlasak, Khajohn; Manasatienkij, Wudtichai; Huang, Angkana; Kaewkao, Thitiman; Mahayos, Narong; Khontong, Rotjana; Iamsirithaworn, Sopon; Jones, Anthony R.; Farmer, Aaron R.; Fernandez, Stefan; Klungthong, Chonticha title: Coding-Complete Genome Sequences of Alpha and Delta SARS-CoV-2 Variants from Kamphaeng Phet Province, Thailand, from May to July 2021 date: 2021-12-02 journal: Microbiol Resour Announc DOI: 10.1128/mra.00877-21 sha: 3dd00fe10c2abcbaefa6378d8f296d5f669f284b doc_id: 979038 cord_uid: b4v5odgx We report coding-complete genome sequences of 44 severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) strains of the alpha and delta variants identified from patients in Kamphaeng Phet, Thailand. Two nonsense mutations in open reading frame 3a (ORF3a) (G254*) and ORF8 (K68*) were found in the alpha variant sequences. Two lineages of the delta variant, B.1.617.2 and AY.30, were found. , GISAID clade nomenclature (10) , and phylogenetic analysis (11) (12) (13) were used to determine SARS-CoV-2 lineages. Nextclade v1.6.0 (14) was used to identify variants. All tools were run with default parameters. b Nucleotide and amino acid substitutions and annotation were analyzed using an in-house bioinformatics pipeline (19) . All alpha variant sequences were aligned with the first alpha variant sequence collected in Thailand (GISAID accession number EPI_ISL_1346626), which was collected on 21 December 2020. All delta variant sequences were aligned with the first delta variant sequence collected in Thailand (GISAID accession number EPI_ISL_2104743), which was collected on 2 May 2021. Individual genome characteristics are summarized in Table 1 . The reads obtained were 35 to 251 nucleotides in length, and the average length was 217 nucleotides. Consensus sequences of coding regions were 29,387 to 29,402 bp in length, with the mean DOC ranging from 3,636Â to 8,642Â. Of 44 sequences, 20 and 24 were identified as alpha and delta variants, respectively. The alpha variants were found from 3 May 2021 to 18 July 2021, whereas the delta variants were found from 24 June 2021 to 18 July 2021. The phylogenetic tree is shown in Fig. 1 . Amino acid substitutions found in the alpha and delta variants from KPP when aligned with the sequences of the first corresponding variants collected in Thailand are shown in Table 1 . Two nonsense mutations, i.e., G254* and K68* in open reading frame 3a (ORF3a) and ORF8 genes, respectively, were not found in the first alpha variant virus in Thailand but were found in all alpha variant viruses in this study. K68* was reported previously (15) . G254* in ORF3a resulted in the predicted absence of 18 amino acid residues (positions 254 to 271) at the C terminus of the protein, located in a region thought to carry several B cell epitopes (16) . Mutations in ORF3a were previously described as potentially having an impact on viral infectivity and pathogenesis (16) (17) (18) . Among the 24 delta variant viruses from KPP, 8 sequences were identified as B.1.617.2 lineage and 16 sequences were identified as AY.30 lineage. In conclusion, the two variants of concern, alpha and delta, were identified from May to July 2021 in KPP. Nonsense mutations in ORF3a and ORF8 were found in the alpha variant sequences. Two lineages of the delta variant were found. Data availability. The sequences from this study were deposited in GenBank (accession numbers MZ888515 to MZ888557 and MZ895505). Individual accession numbers are indicated in Table 1 . The raw reads were deposited in the NCBI Sequence Read Archive (SRA) (accession numbers SRR15571382 to SRR15571425). The BioProject accession number is PRJNA757144. The BioSample accession numbers are SAMN20934606 to SAMN20934649. FIG 1 Maximum likelihood phylogenetic tree of 47 SARS-CoV-2 coding sequences, including 44 sequences from this study (black), the first collected alpha variant sequence from Thailand (GISAID accession number EPI_ISL_1346626), which was collected on 21 December 2020 (green), the first collected delta variant sequence from Thailand (GISAID accession number EPI_ISL_2104743), which was collected on 2 May 2021 (green), and the reference Wuhan-Hu-1 genome sequence (GenBank accession number NC_045512.2) (blue). Multiple sequence alignments were performed using MAFFT v7.475 with default settings (11) . The tree was constructed by using IQ-TREE v2.1.2 (12) with substitution model TIM21F1I and 1,000 ultrafast bootstrap replicates and was visualized by using FigTree v1.4.2 (13) . COVID-19 daily situation report COVID-19 ARTIC v3 Illumina library construction and sequencing protocol V Coding-complete genome sequences of 11 SARS-CoV-2 B.1.1.7 and B.1.351 variants from metro Manila Coding-complete genome sequences of 23 SARS-CoV-2 samples from the Philippines Fast and accurate short read alignment with Burrows-Wheeler transform An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar Genome Project Data Processing Subgroup. 2009. The Sequence Alignment/Map format and SAMtools Full-length transcriptome assembly from RNA-Seq data without a reference genome A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology Region Sequencing Laboratories and GISAID EpiCoV Group. 2020. Geographical and temporal distribution of SARS-CoV-2 clades in the WHO European Region MAFFT multiple sequence alignment software version 7: improvements in performance and usability IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies FigTree, a graphical viewer of phylogenetic trees Nextstrain: real-time tracking of pathogen evolution A novel SARS-CoV-2 viral sequence bioinformatic pipeline has found genetic evidence that the viral 39 untranslated region (UTR) is evolving and generating increased viral diversity Variations in Orf3a protein of SARS-CoV-2 alter its structure and function SARS-CoV-2 and ORF3a: nonsynonymous mutations, functional domains, and viral pathogenesis. mSystems SARS-Cov-2 ORF3a: mutability and function 2021. vbtafrims/SARS-CoV-2-substitution-calling We acknowledge the support from the Kamphaeng Phet Provincial Hospital (KPPH) infectious control team and acknowledge the KPP provincial health office surveillance team, especially Parinya Nakpoonnabutra, Hatairat Suntornsuk, and Wiratch Kateudomsup, for field surveillance.This study was funded by the Armed Forces Health Surveillance Division, Global Emerging Infections Surveillance (GEIS) Branch (ProMIS identifier P0084_21_AF).The material has been reviewed by the WRAIR; there is no objection to its presentation and/or publication. The opinions or assertions contained herein are the private views of the authors and are not to be construed as official or as reflecting the views of the Department of the Army or the Department of Defense.We declare no competing interests.