key: cord-1038181-knj2rrut authors: Albright, Catherine J.; Hall, David J. title: An exercise in molecular epidemiology: Human rhinovirus prevalence and genetics date: 2011-11-11 journal: Biochem Mol Biol Educ DOI: 10.1002/bmb.20530 sha: 2d763661e6ae0aba4cd21109c67c5b0133ba946e doc_id: 1038181 cord_uid: knj2rrut Human rhinovirus (HRV) is one of the most common human respiratory pathogens and is responsible for the majority of upper respiratory illnesses. Recently, a phylogeny was constructed from all known American Type Culture Collection (ATCC) HRV sequences. From this study, three HRV classifications (HRVA, HRVB, and HRVC) were determined and techniques for classifying new isolates of HRV were reported. The genetic change of this virus in specific populations over time is of great interest to understand the evolution and epidemiology of viruses. To facilitate the collections of HRV sequences over a number of years, a virology experiment was designed in which students test nasal lavage samples to look for HRV infection. Students will learn a variety of techniques including RNA isolation, cDNA synthesis, qPCR, and agarose gel electrophoresis as well as bioinformatic skills though examination of sequences from the HRV‐field isolates. Furthermore, students can look at symptom data from subjects to investigate correlations between symptom severity and factors such as stress and sleep patterns. Such information can be used to examine hypotheses regarding HRV mutation, symptom severity and epidemiology. BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION Vol. 39, No. 6, pp. 457–458, 2011 Virology is an interdisciplinary field that incorporates aspects of biochemistry, cell biology, molecular biology, and immunology. Thus, the study of virology is intriguing to students not only because of its depth but also because of viruses that have a very visible effect on a university population when cold and flu season arrives. Viruses are parasitic nucleic acids, either DNA or RNA, and function through the ''hijacking'' of the host cell's own machinery. A productive viral infection must spread virus to other hosts to continue the viral life cycle, and the study of viral spread in a population is a principle concern of viral epidemiology. Viral epidemiology not only examines how a virus functions in the host but also opens a window into human habits and interactions. Some viruses are transmitted through aerosols, others via an oral/fecal route, and still others through the transmission of bodily fluids [1] . Thus, many viruses thrive by taking advantage of host habits. By understanding how viruses replicate and spread, students can gain insight into the functions of viruses, cells, and evolution. Many viruses quickly adapt to environmental changes through error-prone replication, resulting in mutations and the generation of viral quasi species in which many genotypes coexist simultaneously during replication in the host cells. Subsequent infection in a new host results in replication by viruses fit for a particular environment [1, 2] . Human rhinovirus (HRV), a picornavirus, is a member of the largest pathogenic virus family that includes such members as poliovirus, coxsackie virus, hepatitis A, and foot and mouth disease virus. Picornaviruses are positive (mRNA-like), single-stranded RNA viruses, encased in a 30-nanometer icosahedral protein capsid. Their genomes contain roughly 8,000 nucleotide bases that code for 12 proteins [1, 3] . HRV has a well-characterized genome with three main classifications currently identified: HRVA, HRVB, and HRVC. During replication, millions of viral particles are made; however, the nucleic acids are not exactly identical. The HRV polymerase is error prone, and so several mutations occur through each round of replication. Thus, each viral nucleic acid can be slightly different. These variants are referred to as viral quasi species and enable HRV to adapt to the host [1, 2] . To date, there are 99 HRV serotypes recognized by the American Type Tissue Culture Collection (ATCC). although there are likely many more in the population considering the viral quasi species phenomenon. HRV infections occur worldwide at all times of the year, with strong outbreaks in the fall and late spring. However, HRV is only one of many respiratory viruses-others, including adenovirus and coronavirus, produce similar symptoms [1, 3] ..HRV infections are usually limited to the upper respiratory tract, though they can also result in acute otitis (inflammation of the middle ear), sinusitis (inflammation of the paranasal sinuses), and lower respiratory tract infections [4] [5] [6] . The extensive data available on HRV genomes enables many bioinformatics opportunities for students, including alignment of genome sequences to look for mutations at the RNA level and differences among protein sequences. Students can examine differences in the HRV serotypes over several years of data regarding a university population to identify HRV mutations that have occurred and their severity in causing symptoms in the host. This investigation is not only a way for students to study virus evolution but serves as an introduction in viral epidemiology. In this inquiry-based laboratory project, students use a variety of techniques, including RNA isolation, cDNA synthesis, qPCR, and agarose gel electrophoresis to look for the presence of HRV in nasal lavage samples from human subjects. The samples that are shown to be positive for HRV can then be sequenced to determine the strain of the isolate. By analyzing surveys in which subjects indicate severity of their symptoms, stress factors, and average hours of rest per night, students can identify possible contributors to HRV infection. These factors have been seen to make subjects more susceptible to the common cold and other viruses that cause upper respiratory infections [7] [8] [9] [10] . Students who participate in this exercise should have basic biochemistry and molecular biology laboratory skills. They should be familiar with basic laboratory equipment (pipettors and centrifuges) and agarose gel electrophoresis if there is no qPCR thermocycler available to the lab. The laboratory exercise has convenient stopping points and can be completed in two laboratory periods if done efficiently. This laboratory works well with teams of two students who are allowed to divide the work. • Students collect nasal lavage samples from themselves and classmates and survey their symptomsthis can be done before lab if student are equipped with needle-less syringes and sterile tubes and instructed in collection technique. • Students isolate RNA from samples and synthesize cDNA (process can be stopped after isolation, but it is best to proceed to cDNA step if possible). • Students setup and perform both qPCR and PCRwhile these are running, students can analyze data from symptom surveys. • (Students can run an agarose gel of the PCR product and image it.) • (Students can run an agarose gel of the PCR product and image it.) • Students prepare samples for sequencing. (Session 4-when sequence data is returned) • Students analyze sequences and draw conclusions regarding differences in serotypes found and how they differ from previous years. All aspects of the study involving human subjects must be approved by the school's institutional review board. Subjects must be informed of the experiment and the use of the data and must sign an approved consent form before a sample can be obtained from them. Students in the laboratory must be informed of the importance of anonymity of the samples they are testing, which should be assigned initials of the subject or, better yet, numbers. Students should wear gloves whenever they are handling samples or cDNA. Special care should be taken when using TRIzol reagent-TRIzol should be used in a chemical fume hood to avoid inhalation of vapors. Students should wear gloves and protective eyewear, avoiding any chemical contact with the skin or clothing. Students first collected nasal lavage samples from themselves and their classmates. Subjects were informed of the experiment and signed a consent form. Students used a 1-mL syringe (Becton Dickinson, Franklin Lakes, NJ) to inject 1 mL of water into the nostril of the subject by placing the tip of the syringe just inside the nostril touching the inside of the nose and injecting the water. The water was then allowed to fall back out of the nose and collected in a sterile 15-mL centrifuge tube (Cellstar-Greiner Bio-one, Monroe NC) held under the nostril. Students also asked their subjects to fill out a short survey (attached along with consent form) regarding their health, sleep habits, and stress levels. Subjects rated symptoms associated with the common cold on a scale of 0 to 4, with 0 being nonexistent and 4 being severe (survey included, see Supporting Information 3). Filtered pipet tips and gloves were used at all times to prevent RNA degradation of the sample. From each sample, 350 lL were moved to a sterile 1.8-mL microfuge tube (Eppendorf, Hamburg, Germany) with 25-lL 53 nucleic acid extraction buffer (Final concentrations of components: pH7.5 Tris 500 mM, NaCl 1500 mM, and EDTA 5 mM) (Supporting Information 2) and 10 lL of carrier mix for RNA isolation consisting of 0.5 lL of human DNA (100 ng/lL) (Sigma-Aldrich), 4 lL glycogen (5 lg/lL) (Applied Biosystems, Carlsbad, CA), 1 lL of glycoblue (15 lg/lL; Applied Biosystems, Carlsbad, CA) and 4.5 lL of phosphate buffered saline. RNA was isolated from the sample by adding 750 lL of TRIzol LS Reagent (Invitrogen, Carlsbad, CA), and the tubes were vortexed (Scientific Industries Inc, Bohemia, NY) vigorously for 1 minute before 200 lL chloroform was added. The tubes were again vortexed for 1 minute and then centrifuged (Eppendorf Centrifuge 5415C, Hamburg, Germany) at 13,000 3 g for 10 minutes. The aqueous layer was removed to a new Eppendorf tube containing 600 lL of RNAsefree isopropyl alcohol. The tubes were then inverted several times, allowed to stand for at least 5 minutes at room temperature and then centrifuged at 13,000 3 g for 10 minutes to pellet the RNA. The supernatant was then removed and 700 lL of 75% ethanol, at room temperature, were added to the pellet. The tubes were then centrifuged for another 5 minutes at 13,000 3 g, and the supernatant was carefully removed so as not to disturb the pellet of RNA. Each RNA pellet was then resuspended in 22 lL RNase-free water and vortexed vigorously to dissolve the RNA. Reverse transcription was conducted by first placing a mixture of the following reagents in each 0.5-mL Eppendorf tube: This first reaction was followed by a second PCR: The PCR contained 45 lL of the Platinum mix, 1 lL of P3-1 primer (25 lM), 1 lL of the P1-1 (25 lM), and 3 lL of the products from the previous PCR. The products were analyzed in the following way: A 1.5% agarose gel was used to separate the products of the PCR (SeaKem LE, Erie, PA; Phusion, NEB, Ipswich, MA) Hi-Lo 2 DNA maker loading dye (Promega, Madison, WI). Gels were made with 13 Tris Borate EDTA (TBE) buffer and run at 80V (VWR Power Source 250 V, West Chester, PA). Nucleic acids were imaged using ethidium bromide (EtBr) and a Fotodyne system (Hartland, WI). HRV PCR products were sequenced to determine the genetic makeup of each field isolate. Samples were prepared by adding 4 lL of Exo-SAP-IT (USBiochemicals, Cleveland, Ohio) into 10 lL of PCR product. This mixture was incubated at 378C for 25 minutes and then 908C for 15 minutes. The resulting product was stored at 2208C until sequencing could be performed. Sequencing was performed by DeWalch Technologies, (Houston, TX) using primer P3-1 (ACGGACACCCAAAGTAG; see Supporting Information 2). Sequences from the samples were entered into BLAST [11] . The resulting BLAST information established to which HRV sequence from the ATCC the sample was most closely related. A multiple sequence alignment and cladogram (using group tree) were both generated using web-based ClustalW2 following the instructions provided [12] . Questionnaires for determining a subject's stress sleep and symptom severity were adapted from previously published studies [7, 8, 13, 14] (Supporting Information 3) . Briefly, stress was assessed through a subject's rating his or her average stress level on a 1-5 scale with 5 being the most severe. The quality and duration of sleep was assessed based on the reported number of hours (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) . The severity of symptoms was determined by subjects rating each of their cold symptoms on a scale of 0-4 (4 most severe) and adding the resultant scores together to give a final symptom score. The symptom score was then adjusted to a 1-10 scale. A two-tailed t test was then used to determine the significance (GraphPad Software, Inc http://www.graphpad.com/quickcalcs/ index.cfm). The primer set used to detect HRV in the samples was adapted from Lee et al. [5] and detects all 99 possible ATCC serotypes through the amplification of 400-bp portion of the 5 0 noncoding region of HRV. The detection of HRV was conducted with both conventional PCR and qPCR using the same primer set. The qPCR method is preferable due to increased sensitivity. Great care needs to be taken when preparing the viral RNA from samples to prevent degradation and loss of the RNA pellet during the precipitation procedure. Of the 28 samples obtained from human volunteers, the qPCR indicated 17 were positive for HRV (Fig. 1) . The 17 positive samples showed clear differences in the amount of viral nucleic acid present as indicated by the differences in cycle number. These differences can be manifest for several reasons including stage of the infection and the efficiency of HRV nucleic acid isolation. One can quantify the exact amount of virus using laboratorygrown strains, but this approach is beyond the scope of this article and the availability of equipment in many undergraduate laboratories. The standard PCR and subsequent agarose gels demonstrated this technique's weaker sensitivity compared to the qPCR (Figs. 2a and 2b) . Of the 17 positives on qPCR, six were positive with standard PCR. Samples with low amounts of viral nucleic acid or samples contaminated with RNases likely only appear as positive on the qPCR. No samples appeared as positive in conventional PCR after a negative result from qPCR. There are 99 HRV serotypes recognized by the ATCC and extensive data available on HRV genomes enables many bioinformatics opportunities for students, including alignment of genome sequences to look for changes at the RNA level. Standard PCR was used to generate enough genetic material for sequencing, and students retrieved sequences from the service provider website and submitted them as BLAST queries to identify to which ATCC strain their HRV samples were most similar [11] . The analysis revealed that two of the six samples were variants of HRV1 with 97% of the sequence identical with the ATCC reference strain. The other samples were identified as being similar to ATCC reference strains HRV2, HRV16, and HRV85. The sequences were aligned with each other and ATCC reference strains using Clus-talW2 and a group tree cladogram was generated (Fig. 3 ) [12] . From a research perspective, the sequence data is most interesting because, as samples are sequenced yearly, it will be possible to create a genetic database of HRV strains found in a college-aged population and to do further analysis on many levels. Indeed, a discussion was initiated with the students on how the nucleotide variations between their samples and the ATCC strains may influence the ability of the virus to elude the immune system, replicate or spread between hosts. Additional avenues of investigation can include the analysis of predicted RNA secondary and tertiary structure [6, 15] and the sequencing of the nucleic acid responsible for coding of the viral capsid or the entire genome. With this information and a free program called VIGOR, students can identify all 12 viral proteins and perform bioinformatics analysis in comparison to ATCC reference strains [16] . In a sense, this laboratory is similar to the program sponsored by Howard Hughes Medical Institute called ''Phage-Hunters'' [17] . FIG. 1. Quantitative PCR of nasal lavage samples. qPCR spectra indicating HRV presence in samples. Twenty-eight total samples were run-14 of subjects who reported being sick with common cold symptoms and 14 who reported no symptoms. The x-axis indicates the number of qPCR cycles, and y-axis indicates the relative fluorescence units. Dashed lines are those subject who were HRV positive but did not report being sick. The qPCR was as described in Methods and was performed by Lawrence University students during an undergraduate general biology class. FIG. 2. Standard PCR of nasal lavage samples. PCR was conducted as described and products were subjected to gel electrophoresis and stained with SYBR green. (a) Agarose gel indicating no successful HRV standard PCR in lanes 1, 5, 9, 10. Successful standard PCR amplification for sequencing of qPCR samples in lanes 2, 3, 4, and 6. Lanes 7 and 8 correspond to HRV1 and HRV 2 positive controls, respectively. (b) Agarose gel indicating no successful HRV standard PCR in lanes 1, 2, 3, 6, 7, 9, and 10. Successful standard PCR amplification for sequencing of qPCR samples in lanes 4 and 5. Lane 8 corresponds to the HRV16 positive control. Standard PCR and subsequent agarose electrophoresis was done as described in Methods and was performed by Lawrence University students during an undergraduate general biology class. Studies have shown that the severity of experimental viral infections depends not only on the viral strain but also on the sleep and stress levels of the individuals involved [7] [8] [9] [10] . In this study, 21 of the 28 students volunteering samples chose to participate in the symptoms, stress, and sleep surveys as well (Figs. 4a-4c ). Only 10 of the 17 HRV positive subjects chose to participate in the surveys while all 11 of the subjects that tested negative for HRV participated in the surveys. Of those subjects negative for HRV, five were found to have reported being sick. In this instance, other viruses are probably responsible for the observed symptoms. This laboratory can easily be extended to include PCR primers to identify other viruses including influenza, corona virus, and adenovirus among other possibilities [18] . The average symptom, sleep, and stress scores were compared between HRV positive and HRV negative subjects and between reported sick and healthy subjects (Figs. 4a-4c) . In our data, there are two significant trends as determined by an unpaired t test. First, symptom scores correlated positively with being HRV positive (p ¼ 0.15, not significant) or sick (p ¼ 0.0001, significant; Fig. 4a ). Second, the sleep amount correlated positively with being HRV negative (p ¼ 0.1, not significant) or healthy (p ¼ 0.0077, significant; Fig. 4b ). However, there was no apparent relationship between self-reported stress level and having HRV or being sick (Fig. 4c) . A larger N value can be achieved if this laboratory is taught over multiple years, and data is analyzed in aggregate. In addition to looking for correlations between symptoms, stress levels, amount of sleep and infection, students can collect information regarding subject residences and activity participation. Students can group subjects by dormitory and when the results of the sequencing return, see whether or not different residences on campus harbor different HRV strains. However, great care must be taken to ensure student privacy in the data collection and encoding process. The value in doing this laboratory is not only pedagogical; in teaching it yearly and sequencing to track the infectious cycle of HRV, it will be possible over the years to build a record of what serotypes of HRV are affecting a population, with particular symptoms, at a given location. This information can help in answering questions such as: What types of mutation occur? What types of HRV are prevalent? What percentage of the respiratory infections is HRV? Who contracts HRV? How does this correlate to sleep habits and stress levels? By creating such a dataset, insight can be gained regarding HRV infection and mutation in a college population. Principles of Virology The authors thank Sally Berben, Ronald F. Peck, Michael Schreiber, and Wai-Ming Lee for technical assistance and critical reading of the manuscript. RNA degradation is the single most troublesome area. Although positive and negative samples can easily be identified via qPCR, getting enough PCR products for the detection by gel electrophoresis can be hampered by RNA degradation. RNase-free reagents should be used at all times, and gloves are a must. Furthermore, delays in the procedure for isolating the RNA can result in degradation. Often, running a third PCR can be helpful in these situations. This extra step can be done by using 45 lL of the Platinum mix, 1 lL of P3-1 primer (25 lM), 1 lL of the P1-1 (25 lM), and 5 lL of the products from the previous PCR.