key: cord-0810138-0n15rt90 authors: TOZAKI, Teruaki; OHNUMA, Aoi; KIKUCHI, Mio; ISHIGE, Taichiro; KAKOI, Hironaga; HIROTA, Kei-ichi; KUSANO, Kanichi; NAGATA, Shun-ichi title: Design and storage stability of reference materials for microfluidic quantitative PCR-based equine gene doping tests date: 2021-12-28 journal: J Equine Sci DOI: 10.1294/jes.32.125 sha: bdf397a326d612e7982cbe3a2698a9883416afde doc_id: 810138 cord_uid: 0n15rt90 One method of gene doping in horseracing is administering of exogenous genetic materials, known as transgenes. Several polymerase chain reaction (PCR)-based methods have been developed for detecting transgenes with high sensitivity and specificity. However, novel designs for reference materials (RMs) and/or positive template controls (PTCs) are necessary for simultaneous analysis of multiple transgene targets. In this study, we designed and developed a novel RM for simultaneously detecting multiple targets via microfluidic quantitative PCR (MFQPCR). Twelve equine genes were selected as targets in this study. A sequence region including primers and probes for quantitative PCR was designed, and a 10 bp sequence was inserted to allow the RM to be distinguished from the original transgene sequences. The sequences of individual detection sites were then connected for 12 genes and cloned into a single plasmid vector. We performed fragment size analysis to distinguish between the PCR products of the original transgene sequence and those of the RM, enabling identification of RM contamination. PTCs diluted to 10,000, 1,000, 100, and 10 copies/µl with horse genomic DNA from RM were stably stored at 4°C for 1 year. As digital PCR enabled absolute quantification, the designed substances can serve as an RM. These findings indicate that the RM design and storage conditions were suitable for gene doping tests using MFQPCR. The International Federation of Horseracing Authorities (https://www.ifhaonline.org/) has updated the International Agreement on Breeding, Racing, and Wagering to prohibit gene doping [18] . Gene doping in horseracing is classified as the administration of gene doping substances to postnatal animals or the generation of genetically modified racehorses by editing of zygotes. One method in the former category involves the administration of an exogenous gene, known as a 'transgene', to postnatal horses [21] . Polymerase chain reaction (PCR)-based detection methods have been developed to detect transgenes in human and equine sports [3, 7, 16, 19] . Quantitative PCR methods using a hydrolysis probe can specifically detect administered transgenes cloned into plasmids or adeno-associated virus vectors [9, 21] . Recently, the World Anti-Doping Agency published laboratory guidelines for PCR-based detection of transgenes for gene doping control in human sports [25] . A necessary step in developing a gene doping test is method validation [5, 8, 11] , for which a reference material (RM) and/or positive template control (PTC) are required [4, 26] . RM is defined as a material that is sufficiently homogeneous and stable with respect to one or more specified properties and has been established to be suitable for its intended use in a measurement process, whereas a PTC is a substance diluted from the RM. In a gene doping test, a PTC constructed from a diluted RM should be used. When detecting a single transgene, the full gene sequence should be cloned into a plasmid vector to establish an RM, which should then be diluted for use as a PTC [21] . However, when targeting multiple transgenes simultaneously, preparing an RM or PTC individually is complicated. We recently developed a simultaneous detection method for 12 transgenes (24 assays in total) using microfluidic quantitative PCR (MFQPCR) as a gene doping test for use in horseracing [19] . However, a novel RM is required for simultaneous analysis of multiple targets. An advantage of PCR-based detection methods is their high sensitivity [16] . Analysis of the PTC is necessary for confirmatory detection of gene doping [25] . However, PCR-based detection methods have a potential risk of false positives due to PTC contamination [14, 17] , as PCR theoretically enables the amplification of a single copy. In this study, we developed an RM for MFQPCR-based gene doping tests that simultaneously detects multiple target genes. We also examined the contamination risk associated with the developed RM and its long-term storage stability. Blood samples were collected from Thoroughbreds at the Miho and Ritto Training Centers and Equine Research Institute of the Japan Racing Association after obtaining approval from the Animal Care Committee of the Laboratory of Racing Chemistry (Utsunomiya, Tochigi, Japan, approval number 20-4). Blood was collected from 264 Thoroughbreds into BD Vacutainer spray-coated K 2 EDTA tubes (BD Biosciences, Franklin Lakes, NJ, U.S.A.). Plasma was separated by centrifugation at 1,500 × g for 10 min. The separated plasma samples were stored at −30°C. The separated plasma samples from the 264 horses were used for MFQPCR detection. Among these samples, two plasma samples (1.5 ml) were spiked with either 150 or 1,500 copies of Control_A_MSTN as positive detection models (Table 1) , and two samples were spiked with either 150 or 1,500 copies of Control_C_SET1 as contamination models of PTCs (Table 1) . DNA was extracted from the 1.5 ml plasma samples using a Custom NEXTprep cfDNA Auto Kit (1.5 ml; PerkinElmer, Waltham, MA, U.S.A.) with a chemagic 360 instrument (PerkinElmer). The extract was dissolved in Milli-Q water to a final volume of 50 µl. The following 12 equine genes were selected as targets: creatine kinase, muscle (CKM), erythropoietin (EPO), fibroblast growth factor 2 (FGF2), follistatin (FST), growth hormone 1 (GH1), insulin-like growth factor 1 (IGF1), myostatin (MSTN), phosphoenolpyruvate carboxykinase 1 (PCK1), pyruvate dehydrogenase kinase 4 (PDK4), peroxisome proliferator activated receptor delta (PPARD), vascular endothelial growth factor (VEGF), and zinc finger and AT-hook domain containing (ZFAT). These genes were the same as those used in our previous study [19] . We used the pre-amplification primers and quantitative PCR primers and probes designed in our previous study, with minor modifications [19] . As shown in Fig. 1A , each gene had two primer/probe sets, SET1 and SET2. Forward and reverse primers targeting different exons and TaqMan-MGB probes targeting exon/exon junctions were designed and synthesised. Finally, 12 primer-probe sets for SET1 detections and 12 primer-probe sets for SET2 detections were prepared and used for the 12 transgenes. The SET1 assay for the MSTN transgene was labelled as MSTN_SET1; this labelling system was used for each gene. The designed probes and primers have not been listed, as this may prevent their use in actual gene-doping tests. Sequence information of the probes and primers will be provided through a confidentiality agreement with the corresponding author. RM Control_A, with an open reading frame (DNA sequence with exons aligned without gaps) and untranslated region, was prepared as described in our previous study [19] for each of the 12 transgenes. The RM for the MSTN transgene was labelled as Control_A_MSTN; this labelling system was used for each gene. Control_C_SET1 and Control_C_SET2 were novel RMs designed in this study. They contained only the detection site of each transgene between the forward and reverse primers used for pre-amplification. Additional 10-bp sequences were inserted into the region between the qPCR probe and primer sites (Fig. 1B) . Sequences designed to enable the individual detection of each gene were then connected. These sequences were synthesised by Fasmac (Atsugi, Japan) and cloned into pUCFa (r-Amp + , ColE1_ori + ). The Purified Control_C_SET1 and Control_C_SET2 were diluted to 5, 10, 30, 100, 1,000, and 10,000 copies in 2.75 µl of Milli-Q water with 10 ng/µl of equine genomic DNA. The copy concentration was measured and adjusted using digital PCR, as described below. These samples were used as PTCs for MFQPCR detection. In addition, 10, 100, 1,000, and 10,000 copies/µl of the PTCs in Milli-Q water with or without 10 ng/µl of equine genomic DNA were prepared and stored at 4°C for 1 year and used for storage stability experiments using digital PCR, as described below. MFQPCR detection was performed under previously described conditions with minor modifications [19] . Preamplification was conducted using 2.75 µl of sample solution (6 diluted PTCs and extracts from 86 unspiked plasmas and 2 plasmas spiked with Control_A in Fig. 2A ; 6 diluted PTCs and extracts from 88 unspiked plasmas in Fig. 2B ; and 6 diluted PTCs and extracts from 86 unspiked plasmas and 2 plasmas spiked with Control_C in Fig. 2C ) with preamplification primer pools for SET1 and SET2. The PCR conditions for pre-amplification were the same as those used in our previous study, except for the number of PCR cycles, which was 14 in this study. MFQPCR was performed under the conditions described in a previous study [19] , except with a different sample volume, using a 192.24 Dynamic Array IFC for Gene Expression (Fluidigm, South San Francisco, CA, U.S.A.). In this study, undiluted pre-amplified PCR products were used as samples for MFQPCR detection. All qPCR operations and data analyses were conducted using the Biomark Data Collection software and Real-Time PCR Analysis software (Fluidigm), respectively. Two NTCs (no template controls) were prepared for quality control. at 95°C for 3 min, followed by 30 cycles of denaturation at 95°C for 30 sec, annealing at 60°C for 30 sec, and extension at 72°C for 1 min. After a final extension at 72°C for 10 min, the amplicons were stored at 4°C until analysis. Fragment patterns were determined using a DNA 1K Reagent Kit assay with the DNA Extended Range LabChip of LabChip GX Touch 24 (PerkinElmer) in accordance with the manufacturer's recommended procedure. Each marker was measured 6 times, and then means and standard deviations were calculated. An NTC was prepared for quality control. Digital PCR analysis was performed according to the conditions described in our previous study [21] . A 22-µl reaction sample was prepared that contained 2.2 µl of sample solution (Control_C diluted to 10,000, 1,000, 100, and 10 copies), 11 µl of 2X digital PCR Supermix (no dUTP; Bio-Rad, Hercules, CA, U.S.A.), 0.2 µl of 100 µM forward primer, 0.2 µl of 100 µM reverse primer, and 0.6 µl of 10 µM TaqMan-MGB probe. A 20-µl droplet was created using an automated droplet generator (Bio-Rad) and then PCR amplified using a T100 thermal cycler (Bio-Rad). Enzyme activation was conducted at 95°C for 10 min, followed by 40 cycles of denaturation at 94°C for 30 sec and annealing/extension at 60°C for 1 min. The amplicons were stored at 12°C after 10 min of enzyme deactivation at 98°C. DNA concentrations were measured in the samples using a QX200 droplet reader (Bio-Rad). An NTC was prepared for quality control. The linearity of diluted Control_C (10,000, 1,000, 100, and 10 copies) was calculated using the leastsquares method. Table 1 ), which was detected by microfluidic quantitative PCR (A). Delivery errors of PTC (1,000 copies) and FGF2 assay solution are shown as green rectangles (A and B). Two plasma samples were spiked with Control_C_SET1 (green rectangles, samples 3 and 4 in Table 1 ) as models of PTC sample contamination (C). Blue circles represent non-specific amplification (A-C). MFQPCR analysis indicated that 5, 10, 30, 100, 1,000, and 10,000 copies of Control_C_SET1 and Control_C_ SET2 as PTCs were effectively amplified and detected in the 24 assays (12 genes × 2 assays [SET1 and SET2]), whereas extracts from unspiked plasma were not amplified. In addition, amplification signals in samples spiked with Control_A_MSTN (150 or 1,500 copies in 1.5 ml plasma) were detected in both the MSTN_SET1 and MSTN_SET2 assays. The red circles in Fig. 2A indicate the signals detected in the MSTN_SET1 assay, whereas the blue circles in Fig. 2 indicate the three weak signals observed in the SET1 assay. No amplification signals were observed in the same samples via the SET2 assay, suggesting that the weak amplification signals were non-specific signals. Among the large number of MFQPCR analyses performed, several errors in liquid transfer were observed in the integrated fluidic circuits (IFCs; Fig. 2A, 2B) . When an IFC liquid transfer error occurs, no chambers in the same row produce signals. As shown in Fig. 2A (green rectangle), Control_C_SET1 at 1,000 copies failed to produce an amplification signal in any of the SET1 assays. As shown in Fig. 2B (green rectangle), no PTCs produced amplification signals in any of the FGF2_SET1 assays. In contrast, as shown in Fig. 2C (green rectangle), all SET1 assays of samples spiked with 150 or 1,500 copies of Control_C_ SET1 produced positive results, indicating contamination. Using forward and reverse primers for MFQPCR analysis, the sizes of the PCR products amplified from the Control_A and Control_C templates were compared using a LabChip GX Touch 24. PCR products from Control_A and Control_C were differentiated by the insertion sequence of 10 bp for all markers (Fig. 3, Table 2 ). The assays distinguished between the amplification product lengths of Control_A as gene-doping substances and Control_C. The concentrations of Control_C_SET1 and Control_C_ SET2 were adjusted to 10,000, 1,000, 100, and 10 copies/µl and refrigerated for 1 year. Concentration was measured by digital PCR using SET1 and SET2 assays for CKM, FST, MSTN, and PPARD. The mean of four measurements was considered as the concentration value. Control_C_SET1 and Control_C_SET2 diluted with 10 ng/µl genomic DNA were stably detected for 1 year. However, the quantitative values of Control_SET1 and Control_SET2 diluted with Milli-Q water tended to decrease after 6 months in samples diluted to 1,000, 100, and 10 copies/µl (Fig. 4) . The coefficient of variation (CV) of each sample at each time point was calculated for each of the four concentrations (Fig. 5) . The CVs of 10,000, 1,000, and 100 copies/ µl reached approximately 20%, whereas that of 10 copies/ µl reached 36%, suggesting that quantification of low copy numbers was difficult (Fig. 5 ). In addition, at 10 copies/ µl, the CV of Control_C diluted in genomic DNA solu- tion ranged from 8.5% to 25%, whereas that of Control_C diluted in Milli-Q water ranged from 5.8% to 36%. Notably, the CVs at the last two measured time points were large, indicating that variation gradually increased over time for Control_C diluted in Milli-Q water (Fig. 5) . The linearity of diluted Control_C was calculated immediately after sample preparation and at 1, 3, 6, and 12 months (Fig. 6) . The linearity of all Control_C samples was generally maintained (R 2 : 0.999-1.0000), excluding Control_C_SET2 diluted in Milli-Q water (R 2 =0.9983 at 3 months, R 2 =0.9962 at 6 months, and R 2 =0.9949 at 12 months). In quantitative PCR-based gene doping tests, a PTC and NTC are required for positive determination [25] . In addition, to determine the limit of detection for method validation of a gene doping test, it is necessary to prepare low concentrations and/or serial dilutions of the PTC [14] . As we had previously developed a simultaneous detection method for 12 transgenes using MFQPCR [19] , in this study, we aimed to develop an RM for multiple targets. When a single gene is targeted in a gene doping test, an RM can be prepared as a single target [21] . However, when multiple genes are targeted, the preparation of RMs becomes complicated. In one procedure, an RM for each gene is cloned into a plasmid vector and individually quantified so that equal contents of each RM are present [19] . However, it is difficult to mix individually prepared RMs with the same number of copies, particularly those with low copy numbers (10 copies or less). This leads to variations in the PTC copy numbers, resulting in uncertain identification and quantification of gene doping substances. In fact, in a previous study, we mixed and diluted individually prepared PTCs, but the detection of mixed PTCs in the low copy number region was unstable [19] . Although not necessarily problematic when positive results are obtained by amplification and non-amplification, detection instability in the low copy number region is an issue when the limit of detection has been established. We concluded that the Control_C template designed in this study was suitable as an RM for simultaneous detection using MFQPCR because it had multiple detection sites in the single plasmid vector, rendering individually quantified and mixed PTCs unnecessary. Although RMs are traditionally constructed from RNA extracted from equine tissues, we prepared RMs via artificial synthesis. The whole-genome sequence of horses, excluding the Y chromosome, has been determined, and the reference sequence, EquCab3.0, is currently available [13, 24] . A variant database of 101 Thoroughbreds is now available [20] . Therefore, RMs can be easily designed using genomic information and artificially synthesised. For example, in our study, Control_C comprised the quantitative PCR detection sites of 12 target genes. One advantage of Control_C was that it could be quantified by multiple assays (12 assays maximum). Quantitative bias caused by assay differences, such as amplification efficiency differences, can be suppressed to a low level. In this study, we found that a more accurate PTC copy number could be quantified using the mean of four assays. These quantification procedures may be suitable for RM quantification [4, 26] . As digital PCR enables absolute quantification of target templates [10, 23] , quantification of Control_C via multiple assays is an effective method for validating gene doping tests. An advantage of MFQPCR analysis is the ability to analyze multiple markers simultaneously [12, 15] ; however, the device used for it is very sensitive, making it necessary to monitor errors originating from the analytical instrument. Particular attention should be paid to IFC liquid delivery errors, and Control_C may be useful for identifying such errors. In one assay, no amplification signals were detected, whereas PTCs were detected in the other assays. Here, errors in the delivery of primer/probe mixtures were suspected rather than errors in the delivery of control substances. PTCs are necessary for gene doping tests and PCR-based detection of viruses, such as COVID-19 [6, 22] . However, the use of PTCs comes with the potential risk of sample contamination. Because PCR theoretically enables amplification of a single copy, positive controls are needed to determine whether the detected signals originate from the sample or from contamination. An advantage of Control_C is that its PCR product size differs from that of the original gene. In addition, because it is unlikely that all tested genes will have been doped simultaneously, detected signals are more likely to have arisen from contamination than from a positive sample if Control_C is detected in all assays. The number of false-positive results can be reduced by using these procedures. In real-time PCR, controls must be prepared from the RM for each gene doping test [25] , either by diluting high-concentration RMs for each test or by preparing diluted controls for long-term storage. The former method is generally used for quantitative operations requiring strict dilutions because a high concentration is more stable than a low concentration [1] . The latter method is easier to perform, but the stability of PTCs with low copy numbers is a concern. Although it has been reported that PTCs with high copy numbers were stably stored for over 1 year [2] , in this study, we demonstrated that even PTCs with low copy numbers could be stably stored for approximately 1 year by dissolving them in genomic DNA. This enables diluted controls to be prepared for long-term storage prior to gene doping tests. The high CV observed at a concentration of 10 copies/ µl was attributed in part to the process of dispensing the sample stock into the well of the PCR plate. Theoretically, a signal can be detected from one copy; however, because it is difficult to dispense exactly one copy, approximately five copies may be the detection limit. In addition, particularly in Milli-Q water, adsorption to the storage container may lead to variation. In conclusion, we determined that even at 10 copies/µl, the newly designed PTC could be stored under refrigeration for up to 1 year when dissolved in a solution containing the genome. In addition, by absolute quantification via digital PCR, we determined that the prepared PTC can be used as an RM. The results indicate that our RM design concept is suitable for use in MFQPCR-based gene doping tests and their developmental validation. There are no competing interests, including patents, products in development, or marketed products, to declare in relation to this work. Development of singleplex and multiplex real-time (Taqman ® ) RT-PCR assays for the detection of viruses associated with fig mosaic disease Storage stability of solutions of DNA standards Synthetic certified DNA reference material for analysis of human erythropoietin transgene and transcript in gene doping and gene therapy The characterization and certification of a quantitative reference material for Legionella detection and quantification by qPCR The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments CoV2-ID, a MIQE-compliant sub-20-min 5-plex RT-PCR assay targeting SARS-CoV-2 for the diagnosis of COVID-19 A duplex qPCR assay for human erythropoietin (EPO) transgene to control gene doping in horses The digital MIQE guidelines update: minimum information for publication of quantitative digital PCR experiments for 2020 Detection of intra-articular gene therapy in horses using quantitative real time PCR in synovial fluid and plasma High-throughput droplet digital PCR system for absolute quantitation of DNA copy number The digital MIQE guidelines: minimum information for publication of quantitative digital PCR experiments Water quality monitoring and risk assessment by simultaneous multipathogen quantification Improved reference genome for the domestic horse increases assembly contiguity and composition A CRISPR/Cas9 eraser strategy for contamination-free PCR end-point detection A microfluidic platform for the simultaneous quantification of methanogen populations in anaerobic digestion processes Establishment of two quantitative nested qPCR assays targeting the human EPO transgene Reagent decontamination to eliminate false-positives in Escherichia coli qPCR Control of gene doping in human and horse sports Microfluidic quantitative PCR detection of 12 transgenes from horse plasma for gene doping control Rare and common variant discovery by whole-genome sequencing of 101 Thoroughbred racehorses Droplet digital PCR detection of the erythropoietin transgene from horse plasma and urine for gene-doping control Real time PCR to evaluate HSV-2 shedding from anal and genital samples among men who have sex with men, living with HIV Quantitative DNA analysis using droplet digital PCR Broad Institute Genome Sequencing Platform Broad Institute Whole Genome Assembly Team Guideline−Gene doping detection based on polymerase chain reaction (PCR). Version 1 Development and strategy of reference materials for the DNA-based detection of genetically modified organisms We thank Noriko Tanaka for her assistance with this study and the Japan Racing Association for approving and supporting this study through a grant-in-aid (2020-2021).