key: cord-0017718-jo6w0f6g
authors: Ben Chehida, Selim; Filloux, Denis; Fernandez, Emmanuel; Moubset, Oumaima; Hoareau, Murielle; Julian, Charlotte; Blondin, Laurence; Lett, Jean-Michel; Roumagnac, Philippe; Lefeuvre, Pierre
title: Nanopore Sequencing Is a Credible Alternative to Recover Complete Genomes of Geminiviruses
date: 2021-04-23
journal: Microorganisms
DOI: 10.3390/microorganisms9050903
sha: 2aef902c3f200fc02e03ac0979d360f065fee7ff
doc_id: 17718
cord_uid: jo6w0f6g

Next-generation sequencing (NGS), through the implementation of metagenomic protocols, has led to the discovery of thousands of new viruses in the last decade. Nevertheless, these protocols are still laborious and costly to implement, and the technique has not yet become routine for everyday virus characterization. Within the context of CRESS DNA virus studies, we implemented two alternative long-read NGS protocols, one that is agnostic to the sequence (without a priori knowledge of the viral genome) and the other that use specific primers to target a virus (with a priori). Agnostic and specific long read NGS-based assembled genomes of two capulavirus strains were compared to those obtained using the gold standard technique of Sanger sequencing. Both protocols allowed the detection and accurate full genome characterization of both strains. Globally, the assembled genomes were very similar (99.5–99.7% identity) to the Sanger sequences consensus, but differences in the homopolymeric tracks of these sequences indicated a specific lack of accuracy of the long reads NGS approach that has yet to be improved. Nevertheless, the use of the bench-top sequencer has proven to be a credible alternative in the context of CRESS DNA virus study and could offer a new range of applications not previously accessible.

Recent advances in metagenomics applied to viruses has fostered a greater inventory of the viral diversity [1] [2] [3] [4] . Hence, the large scale sampling of oceanic water [5, 6] , plants, animals, and humans [7] [8] [9] [10] , extreme environments [11] , or the mining of genomic and transcritptomic data [12] [13] [14] [15] have completely shifted our understanding of viral diversity and the function of viruses in host populations or even at the global ecosystem scale. However, these inventories remain largely incomplete, and the current knowledge of the virus diversity probably only represents the contour of the extant diversity [4] . The collection of hundreds of new genome sequences with, sometimes only remote resemblance to known viruses led to the acceptance of genome from metagenomic studies as genuine and legitimate taxonomic material for the description of new viruses [16] [17] [18] [19] [20] [21] , even without knowledge of the phenotype or the host associated to these viruses [17, 22] .

Yet, as access to metagenomics is usually costly and requires sophisticated technical expertise in data management and analysis [23, 24] ; for day-to-day analysis, classical Sanger sequencing remains more common [25] . Also, despite the potential of third generation sequencing technique to provide, in real time, hundreds of sequences with read length

Leaf samples of an apparently asymptomatic Medicago arborea were collected in November 2019 at Montferrier-sur-Lez (France). The sample was stored at −20 • C before use. Total DNA was extracted using the DNeasy Plant DNA extraction kit (Qiagen, Hilden, Germany), following the manufacturer's instructions. DNA extract was stored at −20 • C before use. From a previous analysis [46] , using a PCR amplification and Sanger sequencing, two strains of the capulavirus Trifolium virus 1 (TrV1-B and TrV1-C) were identified into the sample.

Pairs of abutting primers were designed to recover the full-length genome of TrV1-B and TrV1-C isolates. A two-step amplification was achieved, including a first amplification step using either the primer pair 3580F-CAPULUZARB-1F: 5 -ACT TGC CTG TCG CTC TAT CTT CTC CCT TGG AGA TGT AAT CTG CCA CGT CAG-3 , and PR2-CAPULUZARB-2R: 5 -TTT CTG TTG GTG CTG ATA TTG CGG AGT TTT TGA GGA ACG AGG AAT ACT  TAG AGC TTC A-3 for amplifying TrV1-B genomes or the primer pair 3580F-CAPUCORO-1F: 5 -ACT TGC CTG TCG CTC TAT CTT CAA CTG TCC TCC CTT TGC AAT GTA GTC  AGC C-3 and PR2-CAPUCORO-2R: 5 -TTT CTG TTG GTG CTG ATA TTG CCG AGG  AGC GAG GAC TTC TTA AGG CAA GT-3 for amplifying TrV1-C genomes. Amplification was carried out using the GoTaq ® Master Mix Kit (Promega Corporation, Madison, WI, USA) and the following conditions: an initial denaturation at 95 • C for 5 min, 8 cycles at 94 • C for 30 s, 60 • C for 30 s, 72 • C for 3 min, and a final extension step at 72 • C for 10 min. A common second amplification step was then performed using the primer pair (3580F: 5 -ACT TGC CTG TCG CTC TAT CTT C-3 and PR2: 5 -TTT CTG TTG GTG CTG ATA TTG C-3 ) and the GoTaq ® Master Mix Kit (Promega Corporation, Madison, WI, USA). The amplification conditions were as followed: an initial denaturation at 95 • C for 5 min, 25 cycles at 94 • C for 30 s, 60 • C for 30 s, 72 • C for 3 min, and a final extension step at 72 • C for 10 min. Amplification products of approximately 2.7-2.8 kb were gel purified, ligated to pGEM-T (Promega, Madison, WI, USA) and sequenced by standard Sanger sequencing using a primer walking approach.

Two alternative protocols for MinION sequencing were used. In the first protocol, called hereafter the RCA-MinION protocol, a RCA amplification was performed using Phi29 DNA polymerase (Illustra TempliPhi Amplification Kit, GE Healthcare, Chicago, IL, USA) by mixing 2 µL of total plant DNA extract with 5 µL of Sample Buffer before incubation at 95 • C during 3 min. After cooling at room temperature, 0.2 µL of enzyme mix and 5 µL of Reaction Buffer were added before incubation at 30 • C for 6 h followed by 20 min of polymerase deactivation at 65 • C. RCA products were cleaned-up using 2× of Sera-Mag Select Size Selection beads (GE Healthcare) and the 10 µL eluate were digested with 1 µL of T7 Endonuclease I (NEB), 2 µL of 5× buffer in a 10 µL reaction volume at 37 • C during 1 h. The fragments were purified with a 1× Sera-Mag Select Size Selection beads and eluate with 10 µL of purified water. Library construction for MinION sequencing was performed using the PCR Barcoding Kit (SQK-PBK004), following the manufacturer's instructions but using SeraMag Select Size Selection beads (GE Healthcare) for DNA purification. Sequencing was performed as described below for the PCR-MinION procedure. Two Flongles (flow cell dongle) FLO-FLG001 were used for sequencing. Whereas in the first Flongle, a single RCA amplicon was sequenced, in the second, three distinct RCA amplicons were multiplexed.

In the second protocol, called hereafter the PCR-MinION protocol, a two-step amplification was carried out. In the first PCR, both sets of abutting primers (3580F-CAPULUZARB-1F/PR2-CAPULUZARB-2R and 3580F-CAPUCORO-1F/PR2-CAPUCORO-2R) described above for respectively amplifying the genomes of strains TrV1-B and TrV1-C and the GoTaq ® Master Mix Kit (Promega Corporation, Madison WI, USA) were employed. The amplification conditions were as follows: an initial denaturation at 95 • C for 5 min, 15 cycles at 94 • C for 30 s, 60 • C for 30 s, 72 • C for 3 min, and a final extension step at 72 • C for 10 min. The second amplification step was performed using the cDNA Primer (cPRM) supplied in the PCR-cDNA Sequencing Kit (SQK-PCS109) and the LongAmp Taq 2X Master Mix Kit (New England Biolabs, Evry, France). The amplification conditions were as followed: an initial denaturation at 95 • C for 30 s, 20 cycles at 95 • C for 15 s, 62 • C for 15 s, 65 • C for 3 min, and a final extension step at 65 • C for 6 min. The amplicons were purified using Agencourt AMPure XP beads (Beckman Coulter, Brea, CA, USA) and MinION sequencing library was constructed using the PCR-cDNA Sequencing Kit (SQK-PCS109), following manufacturer's instructions. Sequencing was then performed on Flongle (FLO-FLG001) using MinKNOW software 19.06.8. Three Flongles were used, two for TrV1-B (Flongle 13 and Flongle 14) and another one for TrV1-C (Flongle 15).

For the reads obtained through the two MinION protocols, accurate basecalling was performed using Guppy (v4.09 or 4.015; [55] ). Demultiplexing and adapter removal was then performed using Porechop v0.2.4 [56] . Quality of the reads was investigated using NanoPlot v1.33.0 [57] .

The demultiplexed reads obtained from RCA-MinION procedure were assembled with FLYE 2.6 [58] , using the "meta" and "plasmid" parameters and when possible circularized as monomers using a homemade script. Contigs were then subjected to a BLASTn search against a CRESS DNA reference sequence database obtained from GenBank. CRESS DNA contigs were then polished using Medaka v1.2.2 [59] . PCR-MinION sequences higher than 1500 nt for one run (TrV1-C) and 2000 nt for the two other run (TrV1-B), were assembled using Canu v1.8 [60] . CRESS DNA contigs were filtered using BLAST as described above. Contigs coverage was estimated after mapping the raw reads back to the assembled CRESS DNA sequences using Minimap2 v2.17 [61] (Figure 1 ).

All the available capulavirus full genome sequences were downloaded from GenBank on 4 March 2021 and aligned with the sequences of strains TrV1-B and TrV1-C obtained after Sanger sequencing using MAFFT v7.475 [62] . A maximum likelihood (ML) phylogenetic tree was inferred using FastTree2 [63] using the "gtr" and "gamma" parameters. Branch supports were tested using SH-like local supports. Tree edition was performed using the ape R package [64] . In order to properly classify the sequences obtained, an analysis that include a subset of representative capulavirus from GenBank and the Sanger sequence obtained in this study was performed using SDT1.2 [65] .

Sequences obtained using the three distinct protocols (i.e., Sanger, PCR-MinION, and RCA-MinION procedures) were aligned together with MAFFT v7.475 before manual edit of the alignment. A home-made R script was used for sequence comparison and mutation count. Mutations were classified in three categories: substitution, insertion/deletion (INDEL) and homopolymer length variation (HLV) (Figure 1 ). ML trees were inferred from these alignments using FastTree2 as described above. oorganisms 2021, 9, x FOR PEER REVIEW 5 of 16 Figure 1 . Schematic representation of the three distinct sequencing methodologies used including the molecular biology procedure (purple boxes) the sequencing procedure (light blue boxes), the bio-informatics procedure (red boxes) and the method comparison (orange boxes). 

A total of 46 sequences were obtained after cloning and Sanger sequencing. These sequence groups in two distinct clades ( Figure S1 ) that share a mean identity of 90.3%. Within the first clade (n = 19), sequences present with identity ranging from 99.5 to 100% with each other and were most similar to the isolate BG2_capuz_47 of the B strain of TrV1 (GenBank accession number MW698819) with a minimum identity of 99.7%. Within the second clade (n = 27), sequences present identity ranging from 99.0 to 100% with each other and were most closely related to the isolate BG2_coro_02-2 of the C strain of TrV1 (GenBank accession number MW698820) with a minimum identity of 99.1%. Therefore, our two groups of sequences belong to two distinct strains (TrV1-B and TrV1-C) of the capulavirus Trifolium virus 1 species ( Figure S1 ). It is important to notice here that among the 46 isolates, five of the TrV1-B isolates and seven of the TrV1-C isolates present with defective genomes. Two of the TrV1-B defective isolates have deletions that encompass a fraction of the V3 gene, two other isolates have deletions within the gene of the replicationassociated protein and one last has a deletion that encompasses the rep gene. Four sequences of the TrV1-C isolates have deletions that encompass a fraction of the CP gene. Among these, the deletions also span a fraction of the V3 gene or the rep gene, depending on the isolate. One isolate has a deletion that encompasses a fraction of all the gene encoded in the complement strand. These twelve defective sequences were excluded from downstream analysis. The genomic organization of the remaining sequences confirms the presence of a short intergenic region (SIR) and a long intergenic region (LIR), a characteristic inverted repeat potentially capable of forming a stem-loop structure that included a conserved nonanucleotide sequence TAATATTAC present at almost all geminivirus virion-strand replication origins, the cp, a spliced complementary-strand intron-containing transcript which expresses replication-associated protein gene (rep), a large complementary-sense ORF (C3) that is completely embedded within the rep gene, and a complex arrangement of possible MP-encoding ORFs located in the 5 direction from the cp gene, which is a unique feature of Capulavirus genomes [47] . Four and three sequences of the TrV1-B and -C strains, present truncated ORFs. The full genome sequences of the isolate without ORFs truncation are available on GenBank under the accession numbers MW698819-MW698821 and MW768713-MW768736.

The RCA-MinION generated reads that confirmed the presence of both TrV1-B and TrV1-C strains in the Medicago arborea sample (Figure 2 ). The raw sequencing statistics are available in Table 1 . From Flongle 1 and Flongle 2, 188,123 and 273,088 raw reads were obtained from which 110,830 (59%) and 152,076 (56%) barcoded reads passed the quality control ( Figure S2 ), respectively. The median read length was 1154 bp with a longest read of 10,491 bp. From Flongle 1, only 27 reads (0.02%) mapped with the capulavirus references. From Flongle 2, barcodes were retrieved from 65,413 reads, 40,242 reads and 46,421 reads for each of the three barcodes, respectively. From 6 to 8.5% of those reads mapped with the capulavirus sequences. Although it was performed on the same DNA extract, RCA amplification yielded more than two order of magnitude less viral sequence for the Flongle 1 amplification than those performed for Flongle 2. It highlights known bias of the RCA amplification [66, 67] that were already evidenced in the context of CRESS virus amplification [68] . All the four distinct sequence sets (one barcode sample from Flongle 1 and three for Flongle 2) were then submitted to the assembly and circularization pipeline. For each of the four barcodes, unique contigs corresponding to the full genome sequences of the two strains were obtained. The four TrV1-B sequence lengths ranged from 2745 to 2769 bp and were at least 99.5% similar to any Sanger sequence. One of the TrV1-B sequences (RCA-Minion_10_2, Flongle 2 barcode 10, Table 1 ) present with a region that seems to be mis-assembled (100 nt in length, see grey tracks in Figure 3A ). The four TrV1-C sequences length ranged from 2763 to 2771 bp and were at least 99.1% similar to any Sanger sequence. Again, a probable mis-assembly (68 nt in length, grey tracks in Figure 3B ) was present within one sequence (RCA-Minion_01_1). As none of the raw minion reads supported the presence of this putative recombinant region, it has not been taken into consideration for further analyses. Figure S2 ). The median length of the passed reads was 643 bp with a longest read of 9816 bp. From Flongle 13, 14 and 15, 380,933 reads (98.7%), 367,381 reads (98.8%) and 337,338 reads (47.2%) mapped with the TrV1-B and -C reference sequences. All the three read sets were then submitted to the assembly. For every Flongle, contigs corresponding to the full genome sequence of the TrV1-B and TrV1-C Sanger references were obtained. The two TrV1-B sequences length ranged from 2731 to 2739 bp and were at least 99.7% similar to any Sanger sequence. The TrV1-C sequence is 2754 bp length and were at least 99.3% similar to any Sanger sequence (Figure 2 ).

In order to more precisely determine the nature of the differences between the sequences obtained through the different procedures, per capulavirus strains, all the full genome sequences obtained were compared to the consensus of the Sanger sequences. With more than 99.1% nucleotide identity for both the RCA-MinION and PCR-MinION sequences, the two methods demonstrate their ability to recover full genomes that are accurately assigned to strains TrV1-B and -C and whose sequence are representative of the viral population they originate. However, none of the assemblies obtained from MinION sequences was 100% identical to the Sanger sequence.

Beside mis-assemblies (grey tracks on Figure 3 ), the differences between sequences were classified in three categories with substitution (green ticks on Figure 3 ), INDEL (blue ticks) and HLV (red ticks). It must be noticed here that HLVs are a category of INDELs but were count separately as HLVs are recognized as the main source of errors in nanopore sequencing [69] [70] [71] [72] . Microorganisms 2021, 9, x FOR PEER REVIEW 8 of 16 First, Sanger sequences differ from each other (and to the consensus) mostly with substitutions (accounting for 79.5% of the variations) and more marginally with HLVs (19.4% of the variations) and INDELs (1.5% of the variations) ( Figure 2) . Except for two of the TrV1-C sequences, all the other Sanger sequences were unique and the number of variations between all these sequences was up to 12 and 27 for TrV1-B and TrV1-C, respectively. A total of 46 and 113 polymorphic sites were present in the Sanger sequences for TrV1-B and TrV1-C, respectively (Figure 3 ). Whereas these polymorphic sites tend be more frequently present within non-coding regions (binomial test p-value of 0.090 and 0.016 for TrV1-B and TrV1-C, respectively), these sites had globally more mutations among all the Sanger sequences (binomial test p-value of 3.7 × 10 −3 and 7.7 × 10 −6 for TrV1-B and TrV1-C, respectively). Whereas sequencing errors can explain a fraction of the mutation, with the high accuracy of Sanger sequencing in mind (per base Phred quality score of 50 [73] ), one can expect that most of the variations uncovered during the analysis are genuine and represent the biological variation associated with the diversity of the viral population infecting the plant [74] [75] [76] .

The analysis of the RCA-MinION and PCR-MinION sequences revealed a different pattern of polymorphism. Two sequences, one of TrV1-B and one of TrV1-C, both assembled from a restricted number of reads (Flongle 1, RCA-MinION_01-1 and RCA-MinION_01-2) were, as expected, less accurate. Coverage of the assembly, (e.g., representing the mean number of times every position of an assembly was read) is therefore a good indicator of the reliability of the resulting assembly ( Figure 4 ). Among the nine other MinION sequences assembled, seven present with no substitution to the consensus and one with ten substitutions (Figure 2 ). Through the TrV1-B sequences there was few common mutations: only a single HLV was common between Sanger and MinION assemblies ( Figure 3A ). PCR-MinION and RCA-MinION sequences presented four common HLVs ( Figure 3A ). For TrV1-C, Sanger sequences presented one substitution, no INDEL and three HLVs in common with MinION sequences ( Figure 3B ). PCR-MinION and RCA-MinION sequences presented one common substitution, three common HLVs and no common INDELs ( Figure 3B ). As these sequences were obtained after the assembly of reads, we were unable to catch the diversity of the distinct variants forming the viral population but rather to obtain a sequence very similar to the consensus of that population. Conversely to the reduced number of substitutions in comparison to the consensus, the assemblies present with larger numbers of INDELs (from 1 to 3) and HLVs (2 to 16). Some of these variations were also found in the Sanger sequences and it is probable that the assemblies actually represented some of the variability within the population. Nevertheless, for six INDELs and 38 HLVs, no corresponding mutations were found, and most would induce frameshift or protein truncation ( Figure 3 ). Despite the use of a dedicated sequence correction program, multiple HLVs remained in the assembled sequences.

Defective genomes are frequently detected within geminivirus populations [77] . For instance, twelve of the Sanger sequences displayed large deletions in comparison to full reference genomes. As MinION raw sequences are obtained after direct reads of either PCR or RCA amplicons, they should capture the diversity of defective genomes of the viral populations. Indeed, to obtain full genome assemblies using the PCR-MinION procedure, a selection of the reads approaching full genome size was required. The analysis of coverage of the reads (Figure 4 ) confirms the pervasive nature of defective genomes for both the TrV1-B and -C strains. The highest coverage was obtained for regions encompassing the stem-loop and reads were frequently missing most of the coding regions (see the blue lines for RCA-Minion). Importantly, it must be noticed here that the coverage inferred using the PCR-MinION procedure are not representative of the global population but rather represent the subset of virus that contains the priming site of the abutting primers used in the PCR (indicated with the verticals red dotted lines in Figure 4 ). The results indicate that every position of the TrV1-B strains present with a high coverage, most of the amplicons of TrV1-C were defective for a region encompassing the whole CP gene (from position 147 to 1559). frameshift or protein truncation (Figure 3 ). Despite the use of a dedicated sequence correction program, multiple HLVs remained in the assembled sequences. 

From an asymptomatic sample of Medicago arborea, two distinct strains of the capulavirus TrV1 have been cloned and sequenced by the Sanger methodology. Using both the a priori and the agnostic nanopore-based procedures, both TrV1-B and TrV1-C strains were detected, and full genome sequences were assembled. Despite being very similar to the consensus of Sanger sequences, mutations specific to the MinION assemblies were detected mostly within homopolymeric regions of the genomes, which is in agreement with other studies that have also pinpointed higher number of errors associated with homopolymer lengths [78, 79] . Whereas it could be argued that the HLV errors would be reduced with the increasing accuracy of sequencing, the development of a new base caller technologies and correction algorithm [80] , current MinION sequences assemblies may be avoided for some specific applications were the exact nucleotide sequence is required. Otherwise, for other applications, such as virus discovery, virus classification, or recombination analysis, MinION assemblies represent a competitive alternative to Sanger sequencing. Although, similarly to other NGS protocols, MinION based studies require the use of sophisticated bioinformatics tools for data management and analysis and despite the specific drawbacks on sequence quality, bench top sequencer such as the MinION would probably become routinely used in the laboratory. It allows a real-time detection and diagnostic of multiple viral strains or virus species in a single run. For niche applications, such as the exploration of geminivirus genomes, which do not exceed 10 kb, nanopore sequencing is poised to push the cost and performance limits of sequencing technologies. The high reactivity offered by the platform would make it more and more democratized as a mobile real-time plant disease diagnostic tool.

Supplementary Materials: The following are available online at https://www.mdpi.com/article/ 10.3390/microorganisms9050903/s1. Figure S1 : Maximum likelihood phylogenetic tree of all the Capulavirus genus full genome sequences obtained from NCBI along with the two TrV1-B and TrV1-C genotypes obtained in this study; Supplementary Figure S2 : Reads length (x-axis) against quality score (y-axis) for all passed reads (A and C) and capulavirus only passed reads (B and D) obtained using the RCA-MinION and PCR-MinION procedures.

Metagenomics reshapes the concepts of RNA virus evolution by revealing extensive horizontal virus transfer

A decade of RNA virus metagenomics is (not) enough

A tectonic shift in understanding virus evolution

Ecogenomics and potential biogeochemical impacts of globally abundant ocean viruses

Absolute quantification of infecting viral particles by chip-based digital polymerase chain reaction

Metagenomic Analyses of an Uncultured Viral Community from Human Feces

Emerging view of the human virome

The diagnosis of infectious diseases by whole genome next generation sequencing: A new era is opening

Illuminating an Ecological Blackbox: Using High Throughput Sequencing to Characterize the Plant Virome Across Scales

Coming of age: Ten years of next-generation sequencing technologies

Redefining the invertebrate RNA virosphere

Meta-transcriptomics and the evolutionary biology of RNA viruses

Using Metagenomics to Characterize an Expanding Virosphere

A field guide to eukaryotic circular single-stranded DNA viruses: Insights gained from metagenomics

Virus taxonomy in the age of metagenomics

Metagenomic arbovirus detection using MinION nanopore sequencing

Minimum Information about an Uncultivated Virus Genome (MIUViG)

Exploring the diversity of Poaceae-infecting mastreviruses on Reunion Island using a viral metagenomics-based approach

International Committee on Taxonomy of Viruses Executive Committee. The new scope of virus taxonomy: Partitioning the virosphere into 15 hierarchical ranks

Plant Virus Metagenomics: Advances in Virus Discovery

Deep sequencing: Becoming a critical tool in clinical virology

A first look at the Oxford Nanopore MinION sequencer

A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase

The Oxford Nanopore MinION: Delivery of nanopore sequencing to the genomics community

Real-time DNA barcoding in a rainforest using nanopore sequencing: Opportunities for rapid biodiversity assessments and local capacity building

Detection of Viral Pathogens with Multiplex Nanopore MinION Sequencing: Be Careful With Cross-Talk

Nanopore-based detection and characterization of yam viruses

MinION nanopore sequencing of an influenza genome

Real-time, portable genome sequencing for Ebola surveillance

Serotyping dengue virus with isothermal amplification and a portable sequencer

Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples

The Architecture of SARS-CoV-2 Transcriptome

Characterising Maize Viruses Associated with Maize Lethal Necrosis Symptoms in Sub Saharan Africa

Nanopore Sequencing as a Surveillance Tool for Plant Pathogens in Plant and Insect Tissues

A simple method for cloning the complete begomovirus genome using the bacteriophage ϕ29 DNA polymerase

Diverse circular ssDNA viruses discovered in dragonflies (Odonata: Epiprocta)

Novel circular single-stranded DNA viruses identified in marine invertebrates reveal high sequence diversity and consistent predicted intrinsic disorder patterns within putative structural proteins

Nanopore sequencing as a revolutionary diagnostic tool for porcine viral enteric disease complexes identifies porcine kobuvirus as an important enteric virus

Tree Lab: Portable genomics for Early Detection of Plant Viruses and Pests in Sub-Saharan Africa

Profiling of Human Gut Virome with Oxford Nanopore Technology

Manulis-Sasson, S. Diagnosis of plant diseases using the Nanopore sequencing platform

Nanopore sequencing of a novel bipartite New World begomovirus infecting cowpea

Nanopore-Based Complete Genome Sequence of a Sri Lankan Cassava Mosaic Virus

Genome characterization and diversity of trifolium virus 1: Identification of a novel legume-infective capulavirus

Capulavirus and Grablovirus: Two new genera in the family Geminiviridae

Geminivirus strain demarcation and nomenclature

Alfalfa Leaf Curl Virus: An Aphid-Transmitted Geminivirus

Establishment of three new genera in the family Geminiviridae: Becurtovirus, Eragrovirus and Turncurtovirus

From Spatial Metagenomics to Molecular Characterization of Plant Viruses: A Geminivirus Case Study

Identification and characterisation of a highly divergent geminivirus: Evolutionary and taxonomic implications

Diverse and variable virus communities in wild plant populations revealed by metagenomic tools

NanoPack: Visualizing and processing long-read sequencing data

Assembly of long, error-prone reads using repeat graphs

Canu: Scalable and accurate long-read assembly via adaptivek-mer weighting and repeat separation

Minimap2: Pairwise alignment for nucleotide sequences

MAFFT-DASH: Integrated protein sequence and structural alignment

FastTree 2-Approximately Maximum-Likelihood Trees for Large Alignments

ape 5.0: An environment for modern phylogenetics and evolutionary analyses in R

A Virus Classification Tool Based on Pairwise Sequence Alignment and Identity Calculation

Multiple displacement amplification compromises quantitative analysis of metagenomes

Amplification Methods Bias Metagenomic Libraries of Uncultured Single-Stranded and Double-Stranded DNA Viruses

The Number of Target Molecules of the Amplification Step Limits Accuracy and Sensitivity in Ultradeep-Sequencing Viral Population Studies

Denoising DNA deep sequencing data-high-throughput sequencing errors and their correction

Performance of neural network basecalling tools for Oxford Nanopore sequencing

Rapid Detection of Genetic Engineering, Structural Variation, and Antimicrobial Resistance Markers in Bacterial Biothreat Pathogens by

MinION-Based DNA Barcoding of Preserved and Non-Invasively Collected Wildlife Samples

Next-generation DNA sequencing

Genetic Structure and Population Variability of Tomato Yellow Leaf Curl China Virus

Experimental evidence indicating that mastreviruses probably did not co-diverge with their hosts

Barcoding of Plant Viruses with Circular Single-Stranded DNA Based on Rolling Circle Amplification

Illumina and Nanopore methods for whole genome sequencing of hepatitis B virus (HBV)

Direct RNA nanopore sequencing of full-length coronavirus genomes provides novel insights into structural variants and enables modification analysis

Takeaways from Mobile DNA Barcoding with BentoLab and MinION

The authors declare no conflict of interest.