key: cord-0718772-0b2blj2c authors: Frey, Kenneth G.; Redden, Cassie L.; Bishop-Lilly, Kimberly A.; Johnson, Reed; Hensley, Lisa E.; Raviprakash, Kanakatte; Luke, Thomas; Kochel, Tad; Mokashi, Vishwesh P.; Defang, Gabriel N. title: Full-Genome Sequence of Human Betacoronavirus 2c Jordan-N3/2012 after Serial Passage in Mammalian Cells date: 2014-05-29 journal: Genome Announc DOI: 10.1128/genomea.00324-14 sha: 20905ce137d4d5735c25f22352a653f3ebcbc4fc doc_id: 718772 cord_uid: 0b2blj2c Middle East respiratory syndrome coronavirus (MERS-CoV) is the etiologic agent of a highly lethal pneumonia. Here, we report the full-genome sequence of the Jordan-N3/2012 strain after serial passage in two distinct mammalian cell lines. The genome exhibits noteworthy stability, which may inform the development of vaccines and therapeutics used to treat infection with this virus. a Saudi man with acute pneumonia (1) , and the genome sequence of this virus was announced shortly thereafter (2) . Reports followed of a similar virus isolated from patients in Qatar, England, and Jordan (3, 4) . Subsequently, the virus was designated Middle East respiratory syndrome coronavirus (MERS-CoV) (5) . A recent WHO report identified 180 laboratory-confirmed cases of MERS-CoV, including 77 deaths (6) . Numerous investigations are being conducted, including efforts to develop vaccines and therapeutics. It has been reported that MERS-CoV is capable of replicating in a wide variety of mammalian cell types (7) . However, it was unclear if serial growth impacts the viral genome. Here, we report the whole-genome sequence of MERS-CoV Jordan-N3/2012 after sequential passage in Vero CCL81 cells and a human embryonic fibroblast line (MRC5). The genome of MERS-CoV is a singlestranded RNA (ssRNA) encoding 10 proteins: a replicase polyprotein, (ORF1ab), three structural proteins (E, N, and M), a surface glycoprotein (S), and five nonstructural proteins (open reading frame [ORF] 3, 4a, 4b, 5, and 8b) (2). Total RNA was extracted from the supernatants of infected cell cultures, and sequencing libraries were created using the TruSeq RNA sample prep version 2 kit (Illumina, Inc.), beginning the protocol at the fragmentation step. Paired-end 151-base sequencing was performed on the MiSeq (average Ͼ730 Mb/sample). The reads were aligned against the NCBI reference (accession no. KC776147.1) using the CLC bio Genomics Workbench (version 6.5). In addition, the sequence reads were de novo assembled using CLC bio. In total, five samples were sequenced: passages 2, 6, 7, and 8 through CCL81 cells and passage 2 through MRC5 cells. All samples shared two single-nucleotide variants (SNVs) compared to the reference strain Jordan-N3/2012. Further analysis of putative SNVs indicated that all samples share an SNV at reference position 24045. This T¡C transversion falls in the S gene and results in a nonsynonymous mutation, I¡T. This variant is present in 55.8% of the reads in the MRC5 passage and 73.4% of the reads in the CCL81 passage. The proportion of reads with this SNV increased in the Vero-passaged samples, from 34.55% in passage 6 to 93.89% in passage 8, likely representing a cell culture adaptation. This residue is not predicted to reside in the binding domain of the S protein (8) . Although the reference genome contains an ambiguous nucleotide (W) at position 11262, all samples show a distinct preference for T (resulting in a leucine versus a histidine residue). It is uncertain as to whether this variation is due to a sequencing error in the reference genome or if population-level differences were lost during cell passage. Although this SNV is present in the coding sequence (CDS) of the ORF1ab gene, the functional consequences are unclear. The residue corresponding to this codon lies between the endopeptidase and the replicase domains and may not be present in mature virions. The apparent stability of this virus in vitro may facilitate the development of countermeasures by reducing the potential for rapid evolution and the resulting changes in immunodominant epitopes. Nucleotide sequence accession number. The genome sequence was deposited in GenBank under the accession no. KJ614529. The views expressed in this article are those of the authors and do not necessarily reflect the official policy or position of the Department of the Navy, Department of Defense, or the US. Government. T.K., V.P.M., and G.N.D. are employees of the U.S. Government, and this work was prepared as part of their official duties. Genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans Detection of a novel human coronavirus by real-time reverse-transcription polymerase chain reaction Severe respiratory illness caused by a novel coronavirus Middle East respiratory syndrome coronavirus (MERS-CoV): announcement of the Coronavirus Study Group State of knowledge and data gaps of Middle East respiratory syndrome coronavirus (MERS-CoV) in humans Human coronavirus EMC does not require the SARScoronavirus receptor and maintains broad replicative capability in mammalian cell lines Structure of MERS-CoV spike receptor-binding domain complexed with human receptor DPP4