key: cord-0035354-f8a6fjaj authors: Wang, Leyi; Stuber, Tod; Camp, Patrick; Robbe-Austerman, Suelee; Zhang, Yan title: Whole-Genome Sequencing of Porcine Epidemic Diarrhea Virus by Illumina MiSeq Platform date: 2015-09-10 journal: Animal Coronaviruses DOI: 10.1007/978-1-4939-3414-0_18 sha: 1b4e6bb5cc4854fe08050c87c79a06e3a569e2d0 doc_id: 35354 cord_uid: f8a6fjaj Porcine epidemic diarrhea virus (PEDV) belongs to the genus Alphacoronavirus of the family Coronaviridae. PEDV was identified as an emerging pathogen in US pig populations in 2013. Since then, this virus has been detected in at least 31 states in the USA and has caused significant economic loss to the swine industry. Active surveillance and characterization of PEDV are essential for monitoring the virus. Obtaining comprehensive information about the PEDV genome can improve our understanding of the evolution of PEDV viruses, and the emergence of new strains, and can enhance vaccine designs. In this chapter, both a targeted amplification method and a random-priming method are described to amplify the complete genome of PEDV for sequencing using the MiSeq platform. Overall, this protocol provides a useful two-pronged approach to complete whole-genome sequences of PEDV depending on the amount of virus in the clinical samples. Complete genome sequencing and genetic analysis signifi cantly improved our understanding of the evolution and relationship of porcine epidemic diarrhea virus (PEDV) strains worldwide. The fi rst PEDV whole-genome sequence was completed for the prototype strain CV777 in 2001 [ 1 ] . Since then, several PEDV strains have been sequenced and now over 170 whole-genome sequences have been deposited in GenBank. Based on the phylogenetic analysis of the whole-genome sequence, PEDV has been classifi ed into two Genogroups-1 and 2-which the variant and classical strains of US PEDV belong to, respectively [ 2 ] . Since the fi rst 454 FLX pyrosequencing platform was introduced to the market in 2005, next-generation sequencing ( NGS ) has signifi cantly advanced research in diverse fi elds. NGS has the advantages of high-throughput and cost-effectiveness. Currently, there are several platforms available, including the Genome Analyser developed by Illumina/Solexa, and the Personal Genome Machine (PGM) by Ion Torrent. One of Illumina NGS platforms-MiSeq -is commonly used in diagnostic laboratories. In general, sequencing viruses directly from fecal samples for PEDV are technically challenging without prior amplifi cation with specifi c primers. In this chapter, we describe a useful two-pronged approach where the random-priming method can be used to sequence the complete PEDV genome from samples with Ct values of less than 15, whereas the targeted amplifi cation method is recommended to be used to sequence clinical fecal samples with higher Ct values (low viral loads). Step RT-PCR 3. MiSeq (Illumina). 3. After the best reference has been found, alignment metrics including read counts, mean depth of coverage, and percent of genome with coverage are collected. 4. Reports summarizing the alignment metrics ( Fig. 1 ) along with Kraken identifi cation interactive Krona HTML fi le, a FASTA of assembled genome, and depth of coverage profi le graph (Fig. 2 ) are e-mailed to concerned individuals. 5. The assembled FASTA fi le can be visually verifi ed in IGV using the BAM and VCF output from the script. If necessary the FASTA can be corrected in program of choice. 6. Script details are provided on GitHub ( https://github.com/ USDA-VS/public/blob/master/secd/idvirus.sh ). Fig. 1 Report summary for sample OH851-RP-Virus. The reference set used to initiate the shell script was SECD (swine enteric coronavirus diseases). File size and read counts for each fastq fi le are shown. Provided by Kraken, 223,955 virus reads were identifi ed. "Reference used" is the closest fi nding in the NCBI nt database. The read count shows the number of raw reads shown to match the reference. "Percent cov" shows the percent of reference having coverage. A coverage of 98.36 % for PEDV indicates a true fi nd relative to the sporadic <51 % coverage seen from PRCV, although in this case, the presence of PRCV cannot be ruled out. There were no reads matching TGEV and PDCoV, which were not shown. Because of the high percent of genome coverage, the completed reference-guided assembly for PEDV was BLAST against the nt database to provide mismatches, e -value, and bit score against the most closely related publicly available genome Life Technologies) for viral RNA extraction from fecal or intestinal contents ( see Note 1 ) One-Step RT-PCR kit (Qiagen) Forward primer: 5′-CATGGGCTAGCTTTCAGGTC-3′. Reverse primer: 5′-CGGCCCATCACAGAAGTAGT-3′. Probe: 5′/56-FAM/CATTCTTGGTGGTCT TTCAAT CCTGA/ZEN 3IABkFQ/3′ One-Step RT-PCR kit (Qiagen) ( see Note 3 ) Oligonucleotide primers dissolved in nuclease-free water to a stock concentration of 100 pmol/μl and a working concentration of 20 pmol/μl. The sequences for 19 pairs of primers are listed in Table 1 Qiagen gel purifi cation kit Qubit 2.0 Fluorometer (Life Technologies) SuperScript III Reverse Transcriptase kit (Invitrogen) Klenow amplifi cation (NEB) Advantage 2 PCR kit (Clontech) 10 mM dNTP mix (NEB) RNase Inhibitor (Promega) Notes 1. The MagMAX Pathogen RNA/ DNA Kit was used for the extraction of nucleic acid for pathogen detection -including the detection of TGEV, PEDV, and PDCoV-from pig feces or intestinal contents