key: cord-0876018-dr1cy4uo authors: Blok, J.; McWilliam, S. M.; Butler, H. C.; Gibbs, A. J.; Weiller, G.; Herring, B. L.; Hemsley, A. C.; Aaskov, J. G.; Yoksan, S.; Bhamarapravati, N. title: Comparison of a dengue-2 virus and its candidate vaccine derivative: Sequence relationships with the flaviviruses and other viruses date: 1992-04-30 journal: Virology DOI: 10.1016/0042-6822(92)90460-7 sha: 75cb22494be910706853de31f0c5edc20f2095a5 doc_id: 876018 cord_uid: dr1cy4uo Abstract A comparison of the sequence of the dengue-2 16681 virus with that of the candidate vaccine strain (16681-PDK53) derived from it identified 53 of the 10,723 nucleotides which differed between the strains. Nucleotide changes occurred in genes coding for all virion and nonvirion proteins, and in the 5′ and 3′ untranslated regions. Twenty-seven of the nucleotide changes resulted in amino acid alterations. The greatest amino acid sequence differences in the virion proteins occurred in prM (2.20%; 2 91 amino acids) followed by the M protein (1.33%; 1 75 amino acids), the C protein (0.88%; 1 114 amino acid), and the E protein (0.61%; 3 495 amino acids). Differences in the amino acid sequence of nonvirion proteins ranged from 1.51% ( 6 398 amino acids) in NS4 to 0.33% ( 3 900 amino acids) in NS5. The encoded protein sequences of 16681-PDK53 were also compared with the published sequences of other flaviviruses to obtain a detailed classification of 17 flaviviruses using the neighbor-joining tree method. The analyses of the sequence data produced dendrograms which supported the traditional groupings based on serological evidence, and they suggested that the flaviviruses have evolved by divergent mutational change and there was no evidence of genetic recombination between members of the group. Comparisons of the sequences of the flavivirus polymerase and helicase-like proteins (NS5 and NS3, respectively) with those from other viruses yielded a classification of the flaviviruses indicating that the primary division of the flaviviruses was between those transmitted by mosquitoes and those transmitted by ticks. A comparison of the sequence of the dengue-2 16681 virus with that of the candidate vaccine strain (16681 -PDK53) derived from it identified 53 of the 10,723 nucleotides which differed between the strains. Nucleotide changes occurred in genes coding for all virion and nonvirion proteins, and in the 5' and 3' untranslated regions. Twenty-seven of the nucleotide changes resulted in amino acid alterations. The greatest amino acid sequence differences in the virion proteins occurred in prM (2.20%; 2/91 amino acids) followed by the M protein (1.33%; l/75 amino acids), the C protein (0.88%; l/l 14 amino acid), and the E protein (0.61%; 3/495 amino acids). Differences in the amino acid sequence of nonvirion proteins ranged from 1.51 O/O (6/398 amino acids) in NS4 to 0.33% (3/900 amino acids) in NS5. The encoded protein sequences of 16681 -PDK53 were also compared with the published sequences of other flaviviruses to obtain a detailed classification of 17 flaviviruses using the neighbor-joining tree method. The analyses of the sequence data produced dendrograms which supported the traditional groupings based on serological evidence, and they suggested that the flaviviruses have evolved by divergent mutational change and there was no evidence of genetic recombination between members of the group. Comparisons of the sequences of the flavivirus polymerase and helicase-like proteins (NS5 and NS3, respectively) with those from other viruses yielded a classification of the flaviviruses indicating that the primary division of the flaviviruses was between those transmitted by mosquitoes and those transmitted by ticks. Dengue viruses, of which there are four serotypes, belong to the Flaviviridae, a family of closely related viruses transmitted by mosquitoes and ticks (Westaway et al., 1985) . These viruses produce enveloped particles containing a single positive stranded RNA genome of ca. 11 kb, which codes for the three virion proteins [Capsid (C), prM/Matrix (prM/M), and Envelope (E)] and seven nonvirion proteins (NSl, NS2A, NS2B, NS3, NS4A, NS4B, and NS5); see the review in Rice et al, (1986) . All four serotypes of the dengue virus have been isolated from patients with classical dengue fever as well as from patients with the more severe form of the disease, dengue hemorrhagic feveridengue shock syndrome (Schlesinger, 1977) . Of the millions of cases of ' Sequence data from this article have been deposited with the EMBUGenBank Data Libraries under Accession Nos. M85258 and M85259 'To whom correspondence and reprint requests should be addressed. 3 Present address: CSIRO Division of Tropical Animal Production, Long Pocket Laboratories, Brisbane, Queensland, 4068, Australia. dengue each year, most are uncomplicated dengue fever which is a mild febrile illness characterized by headache, joint pain, fever, and rash. However, some patients develop a thrombocytopenia, increased vascular permeability and, in severe cases, hypovolemic shock which may lead to death if untreated. The risk of developing severe dengue infections (DHF/DSS) is greater in patients with anamnestic infections, par-ticularly where the serotype causing the anamnestic infection differs from that which caused the earlier infection (Halstead, 1981) . It has been reported that some cases of DHF/DSS could be due to virulent strains of dengue viruses (Rosen, 1977) . In endemic areas, immunity against the four dengue serotypes can be demonstrated in the majority of the adult population, and therefore protective immunity may be gained by immunization with appropriate dengue vaccines. An ideal vaccine would (i) be effective against all four dengue virus serotypes, (ii) induce >95% seroconversion, (iii) produce lifelong immunity, and (iv) be administered to children aged 6 months to 1 year. The dengue vaccine development program at the Mahidol University in Bangkok has attempted toattenuate all four dengue serotypes by passage in vitro in cell 1. Schematic diagram of the dengue-2 16681 and 16681-PDK53 RNA genomes showing the regions which were sequenced from (i) cDNA clones (cloned fragments were represented by more than one clone in most instances and were sequenced from both strands), (ii) viral RNA genome using specific oligonucleotide primers, and (iii) fragments produced by the polymerase chain reaction (344 bp for both 16681 and 16681 -PDK53, and another 550 bp fragment for 16681). The dideoxy sequencing technique (Sanger et a/., 1977) was used in all cases; see text for details. culture. A candidate live dengue-2 virus vaccine has been produced from the dengue-2 16681 virus isolated from a fatal case in Thailand (Halstead and Simasthien, 1970 ) by 53 serial passages in primary dog kidney (PDK) cells. This dengue-2 16681 -PDK53 candidate vaccine virus appears to be biologically attenuated in several characters (Yoksan et al., 1986) , and Phase I trials with human volunteers have been completed (Bhamarapravati et al,, 1987) . These trials indicated that the vaccine produced a good immune response and no recipient showed any clinical symptoms or severe side effects. It was therefore decided to sequence the genome of these dengue-2 viruses (1668 1 and 1668 1 -PDK53) in order to define the molecular differences between them. This is a necessary step toward identifying the molecular basis of virulence. We report the comparison of the nucleotide sequences of the dengue-2 16681 and 16681 -PDK53 viruses with the sequences of other flaviviruses: dengue-1, -2, -3, and -4, and yellow fever, West Nile, Kunjin, Japanese encephalitis, Murray Valley encephalitis, St. Louis encephalitis, and tick-borne encephalitis. The dengue-2 16681 virus strain was isolated from a fatal case of DSS in Thailand in 1964 (Halstead and Simasthien, 1970) and passaged twice in Toxorhynchites ambionensis mosquitoes and once in primary green monkey kidney cells to produce the "parent virus" for attenuation studies, Yoksan eta/. (1986) . The dengue-2 16681-PDK53 virus was obtained by serial passage of the "parent virus" in PDK cells. The 16681 virus was passaged weekly in PDK cells for 53 passages and at this point it appeared to be attenuated as judged by biological markers such as plaque size, temperature sensitivity, and mouse and monkey neurovirulence (Yoksan et al., 1986) . The seed stocks for both 16681 and 16681 -PDK53 viruses were grown for one passage in Aedes albopictus cells (clone C6/36) (Igarashi, 1978) before large scale propagation in C6/36 cells for cloning and sequencing experiments. Growth of viruses and purification C6/36 Aedes albopictus cells were grown at 32" in RPMI 1640 medium containing 10% fetal calf serum, FIG. 2. Nucleotide and encoded amino acid sequence of dengue-2 16681-PDK53 virus. This is a consensus sequence based on the sequencing strategies shown in Fig. 1 , The location of the N-and C-termini of the proteins is based on Kunjin virus (Coia eta/., 1988; and the potential glycosylation sites of the virion and NSl proteins are circled. The 3'terminal 14 nucleotides have not been deduced but are assumed to be the same as those found in West Nile virus (Wengler and Wengler, 198 b AC"A"oACGCA"GACA"AGG"A"~G~"GAc""*"c""Gccc"Ac"AGcAGcc""c~G"~GAcc~c"""uGcAGc"GG*c"Ac"c""GAG~G THTDDIGUGVTYLALLAAFKVRPTFAAGLLLRK 3762 122z C"GACCUCCAAGGAAU"GAuGA"GAC"AC"A"A~~""G"AC"C"C~CCCAGAGCACUAUACCAGAGACCA"UCUUGAGUUGAC"GAUGCGUUAGCC 3861 LTSKEL,4,4TTIGIVLSSQSTIPETILELTDALA 1255 UUAGCCAUGA"GG"CCUC~UGGUGAG~UAUGG~GUAUCMUUGGCAGUGACUA"CA"~CUAUCUUGUGCGUCCC~CGCAG"GA"A""A LCHMVLKXVRNXEKYQLAVTIMAILCVPNAVIL 3960 1288 CAAAACOCA"GGAAAG"GAG""GCAC~"AU"GGCAG"GG"GUCCG"""CCCCAC"G"UCU"~CAUCCUCACAGC AAAAAACGGA"UGGA"ACCA""A QNAWKVSCTILAVVSVSPLFLTSSQQKTDWIPL GCC""GACGA"CAAAGG"C"ClUUCCRACAOCUAUUUUUC"~C~CCC"C"C~G~CCAGC~G~GG GCUGGCCAUUAAAUGAGGCUAUCAUG pNS2B 4069 1321 ALTIKCLNPTAIFLTTLSRTSKKRSWPLNSAI,, 4158 1354 GCAG"CGGGA"GG"GAGCA""""AGCCAGU"C"C"CC" lUARAA"GA"A""CCCA"GACA~ACCA""AGUGGC"GGA~GC"CC"CAC"G"G"GC"AC AVGHVSlLASSLLKNDIP,,TGPLVAGGLLTVCY 4257 1387 G"GC"CAC"GGACGA"CGGCCGA"""GG~CUGGAGAGAGCAGC~AUG"C~"GGG~GACCAGGCAGAGAUA"CAGG~GCAGuCC~uCC"G"CA 4356 VLTGRSADLELERAADVKWEDQAEISGSSPILS 1420 A"AACAA"AUCAGAAGA"GG"AGCAUGUCGA"AAAAAA "GAAGAGGAAGAACAAACAC"GACCA"AC"CA""AG~GAGGA""GC"GG"GA"C"~GGA ITISEDGSHSIKNEEEEQTLTILIRRGLLVISG r, NS3 C"""""CCUG"A"CAA"ACCIIRUCACGGCAGCAGCA"GG"ACC"G"GGG~G"G~G~C~C~GCCGGAGUA"UG"GGGAUG""CC""CACCCCCA LFPVSIPITAAAWYLWEVKKQRAGVLWDVPSPP 4455 1453 CCCAUGGGAAAGGC"GAAC"GCARGAUGCAGCC"A"AG~U"~GC~GGGA""C""GGA"A""CCCAGA"CGGAGCCGGAG"""AC-G~GGA PHGKAELEDGAYRIKQKGILGYSQIGAGVYKEG 4554 14.96 4653 1519 ACA""CCA"ACAA"GUGGCA"G"CACACGUGGCGC"G""C"~"GCAU~GG~GAGGA""G~CCA"CAUGGGCGGACG"C~G~GACC"~"A 4752 TFHT"WHVTRGAVLHHKGKRIEPSWADVKK"L1 1552 4851 1585 ACGAAACC"GG"CUU"UC~CCMCGCCCGCCGG~C~UAGG"GC"G"AUCUCUGGACU"""C"CC"GG~CG"CAGGA"C"CC~U"A"CGAC~ TKPGLFKTNAGTIGAVSLDFSPGTSGSPI I D K K GGAIV\AG""G"~GUC"""A"GGU~"GG"G""G""A~~GGAG"GGAG~A"A"G"GAG~G~"A"AG~~CAGACUG AAAAAAGUAUUGAAGACAACCCA GKVVGLYGNGVVTRSGAYVSAIAQTSKSISDNP CAGACCCcAGCCA"CAGAGC"GAGCACACCGGGCGGGAGA""GUGGACC"~UG"G"CA"GCCACA"""ACCA"GA~C"GC"A"CACCAGU"AGAG"G QTPAIRAEHTGREIVDLMCHATFTHRLLSPVRV 5346 1750 5445 1783 CG""CGUGGlU""CCGGACA"G~"GGG"~CGGAU~"~GGG~GAC"G"U"~""CG""CC~G"A"~G~~~"GA"A"AGCAGC""GC RSWNSGHEWVTDFKGKTVWFVPs IKAGNDIAAC C"GAGG~"GCAAAGAAAG"GA"AC~C"CAGUAGG~GACCU""GAU"C"GAGUA"G"C~GAC"AG~CC~UGA""GGGACU"CG"GG""ACA LRKNGKKVIQLSRKTFDSEYVKTRTNDWDFVVT 5544 1816 5643 1849 5742 lSS2 AC"GACA"""CAG~"GGG"GC~"""C~GGC"GAGAGGG""A"AGACCCCAGACGC"GCA"G~CCAGUCA"AC"~CAGAU~"G~GAGCGG TDISEMGANFKAERVIDPRRCMKPVILTDGEEK G"CA""C"CCCAGGACC"A"GCCAG"GACCCAC"C"AG"GCAGCAC~G~GA~GAG~"AGG~G~UCC AAAAAA"GAGMUGACCAG"ACA"A VILAGPHPVTHSSAAQRRGRIGRNPKNs"DQY1 5841 1915 5940 1948 6039 1981 AGCAUG""CGAACCAGAGCG"G~GG"GGAUGCCAUUGA"GGCG~"ACCGC""GAGAGGAG~GC~GGACCACC"""G"AGACU"~"GAGMGA SUFEPEREKVDAIDGEYRLRGsARTTFVDL,,KR 6138 2014 GGAGACC"ACCAG"c"GG""GGCC"ACAGAGUGGCAGC"G~GGCA"C~C"ACGCAGACAG~GGUGG"G""""GA"GGAG"C~G~C~CC~UC GDLPVWLAYRVAAEGINYADRRWCFDGVKNNQI 6237 2047 CCGC"AAAAGAA"""AAGGAA"""GCAGCCGGAAGAM,G C"C"GACCC"GAACC"AAUCACAG~UGGG"A~C"CCC~CC""CA"GAC"CAG~G G'NS4A 6336 2080 ALKsFKEFAAGRKSLTLNLITENORLPTFHTpR 6435 2113 6534 2146 ACA""GC""""AC"GACAC""CUGGC"ACAG"CACGGGAGGGATCU""""A""C"UGA"GAGCGC~GG~CA"AGGG~GA"GACCC"GGG~"G"GC TLLLLTLLATVTGGIFLFL"SAKGIGK"TLG,,C 6633 2179 "GCA"AA"CACGGC"AGCA"CC"CCUAUGG"ACGCAC~"ACAGCCACAC"GGA"AGCAGC""C~"~UAC"GGAG""""""C"CA"AG""""GC"" c I ITASILLWYAQIQPHWIAASIILEFFLIVLL +NSQB A""CCAGAAcc"GAAAAAcAGAGMCACCCCACcCC~GAC~CC~C"GACC"ACG""G"CA"AGCCA"CC"CACAGUGG"GGCCGC~CCAUGGC~CGAG IPEPEKQRTPQDNQLTYVVIAILTVVAATHANE 6732 2212 A"GCG"""CC"AGRAARAR CGMGAAAGAUCUAGGAVUAGG~GCAACCCAGCAA UGFLEKTKKDLGLGSIATQQPSSNILDIDLRPA 6930 2278 "CAGCA"CGACGC"G"A"GCCG"GGCCAC~CA""UG""ACACC~"G""GAGACA"AGCA""G~U"CCUCAG"G~"G"G"CCC"~CAGC"A"A SAWTLYAVATTFVTPMLRHs IENSSVNVSLTAI GCCAACCAAGCCACAG"G""~UUGGGG"C"CGGG~GGA"GGCCA""G"C~GA"GGACA"CGGAG""CCCC""C"CGCCA""GGA"GC"AC"CAC~ ANQATVLXGLGKGWPLSKHDIGVPLLAIGCYSQ 7029 2311 7128 2344 G"CAACCCCACAAC"C"CACAGCAGC"C"U""C""A"UGG"AGCACAU"A"GCCA"CAUAGGGCCAGCAC"CC~GC~GCAUCCAGAG~GC"CAG VNPTTLTAALFLLVAHYAI ICPALQAKASREAQ RI\AAGAGcAGCGGCGGGCA"CA"G~CCC~C"G"CGA"GG~"~CAG"GA""GACCUAGA"CC~"ACCU"A"GA"CC~G"U"G~GCAG KRAAAG IMKNPTVDG ITVIDLDPIPYDPKFEKQ C ""CGGACAAG"AA"GC"CC"A~"cc"c"~c~"~Ac"cAAGc"Acc~~~ccc LCQ"HLL"LC"TQ"Ln"RTTWALCEALTLRTCP AGCAGACCCCCCCGAAACA 100 units/ml penicillin, and 100 pg/ml streptomycin. At 70% confluency, the cells were infected with the dengue-2 16681 or 16681-PDK 53 virus in RPMI 1640 containing 0.2% bovine serum albumin (BSA). Hemagglutinin (HA) was assayed as described by Clarke and Cassals (1958) and tissue culture fluids containing the virus particles were collected for 3 consecutive days, at peak HA titers. After centrifugation to remove cells and debris, the virions were precipitated by adding i volume of 40% polyethylene glycol, 15% NaCI, and incubated at 4" for 1 hr. This precipitate was then collected by centrifuging at 7500 g for 30 min at 4" and resuspending in buffer (0.1 M NaCI, 0.012 MTris-HCI, pH 7.5, 0.2% BSA) before centrifuging in a 5-500/o sucrose gradient. The visible band of virions was collected and viral RNA extracted in 0.05 M Na acetate, 0.1 mM EDTA solution containing 0.5% SDS, and an equal volume of phenol/chloroform. The RNA was recovered from the aqueous phase by ethanol precipitation. It was ethanol precipitated twice before use for cDNA synthesis. Synthesis of cDNA cDNA was synthesized as described previously (Blok et al., 1984) . Viral RNA and either a 14-mer based on the West Nile virus sequence (Wengler and Wengler, 1981) or oligo (dT),,-,, were heated to 90" for 1 min, quickly chilled in an ice/water bath, and used as a template for cDNA synthesis. For synthesis, [32P]-dATP (Bresatec) and avian myoblastosis virus reverse transcriptase (AMV RT, Life Sciences) were added and incubated at 37" for 1 hr. The cDNA-RNA hybrids produced by this reaction were then used for synthesis of double-stranded DNA (dsDNA) using RNase H and DNA polymerase (Boehringer-Mannheim) as described by Okayama and Berg (1982) . The resulting dsDNA was cleaved with the restriction enzymes Sau3A or BarnHI, inserted into the BamHl site of the plasmid vector pUC8, and cloned using standard methods (Maniatis et al., 1982) . of DNA using the polymerase chain reaction (PCR) Oligonucleotide primers (20-23-mers with an average GC content of 60%) flanking the regions to be amplified were made in a Milligen oligonucleotide synthesizer (University of Queensland), desalted in a PD-10 Sephadex G-25 M column (Pharmacia), and the first three fractions were dried and resuspended in 100 ~1 water. A reaction mixture containing 1 ~1 (ca. 100 pmol) of each of the two primers, 400 ng viral RNA in 1.5 mM MgCI,, 50 mM KCI, 10 mM Tris-HCI, pH8.0, and 0.02% BSA was heated to 95" for 1 min and cooled to 42" over 3 min. AMV RT (2 units) and Taq polymerase (2 units, Cetus) were added to the mixture and this was incubated at 42" for 5 min. The DNA was then amplified for 30 cycles (95" for 1 min, 55" for 2 min, and 72" for 3 min). The resulting fragments were purified on a polyacrylamide gel, eluted in buffer (0.1 M Tris-HCI, pH8.0, 0.5 M NaCI, 5 mM EDTA) for 16 hr at room temperature, phenol extracted, and ethanol precipitated prior to sequencing. Sequences were obtained by the dideoxy chain termination method as described (Sanger et al., 1977) . In a typical experiment, 4 pg of circular plasmid DNA or purified PCR fragment DNA was added to 40 ~1 of 0.2 M NaOH, 0.2 mM EDTA in order to separate the strands. This mixture was incubated at room temperature for 5 min, precipitated with 3 vol of ethanol in the presence of 37.5 mM Na acetate pH5.5, and then collected by centrifugation in a microfuge for 5 min. The resulting pellet was dried in a vacuum and resuspended in 20 ~1 sterile water. This treated DNA was enough for five sequence experiments and could be stored up to 2 weeks at -20". Treated DNA (4 ~1 or 0.8 yg) was mixed with 1 yl (1 pmol) of primer, heated to 95" for 1 min, quickly chilled in an ice/water bath, and used in the dideoxy sequencing procedure in the presence of [32P]dATP (Bresatec). In most cases 0.5 unit DNA polymerase (Klenow fragment, Boehringer-Mannheim) was used to obtain sequence data, but for some of the larger clones, 3 units of Sequenase (Trace) was used. The reactions were stopped by the addition of formamide containing bromophenol blue and xylene cyanol dye, heated to 95" for 3 min, and chilled prior to electrophoresis in So/o polyacrylamide gels containing 7 M urea. The gels were fixed in 10% acetic acid, wrapped in Saran Wrap, and autoradiographed at room temperature without intensifying screens. Dengue-2 genomic RNA was sequenced using the dideoxy chain termination method (Sanger et al., 1977) as described (Air, 1979) . Briefly, oligonucleotide primers, based on sequence data obtained from cDNA clones or published dengue-2 sequences, were synthesized on a Biosystems oligonucleotide synthesizer (Qld lnst Med Res), desalted in a PD-10 Sephadex G-25 M column (Pharmacia), and the first three fractions were dried and dissolved in 100 ~1 water. Primer (usually 1 ~1 of 1: 100 dilution, although the dilution was determined empirically for each primer) and 0.05 vg RNA were heated to 90" for 1 min, quickly chilled in ice/water, and sequenced with 1 .O unit of AMV RT(Life Sciences). Sequences were aligned, compared, and classified using various programs of the RSBS SEQ, the RSBS STATS, and the GCG (Version 6.2; Devereux et al., 1984) libraries. The progressive alignment program Doolittle, 1987, 1990) with the MDM78 similarity matrix (Dayhoff et a/., 1978) and the neighborjoining tree program (Saitou and Nei, 1987; Studier and Keppler, 1988) were also used. The FASTA program of the GCG library was used to search Release 16 (November 1990) of the SwissProt database for proteins with sequences related to those of flaviviruses. The sequencing strategies used to obtain the complete nucleotide sequences of the dengue-2 16681 and 16681-PDK53 viruses are shown in Fig. 1 . This figure shows that the sequences of the E and NSl genes were obtained from both cloned material and from sequencing the RNA genome directly. Other regions which were not isolated during the preparation of the cDNA libraries were also sequenced from the viral RNA using specific oligonucleotide primers based on sequences from the dengue-2 16681, 16681-PDK53 or Jamaica (Deubel et a/., 1986 (Deubel et a/., , 1988 strains. The cDNA clones were obtained using either oligo dT,,-,, or the synthetic oligonucleotide 5' AGAACCTGTT-GATT 3' as a primer for cDNA synthesis followed by Sau3A or BarnHI digestion. Both the 16681 and 16681-PDK53 cDNA libraries contained multiple copies of most of the cloned fragments, although there was a unique BarnHI clone in the 16681 -PDK53 library. The cDNA clones were sequenced from both strands and represented 79 and 74% of the 1668 1 and 1668 l-PDK53 RNA genome, respectively. The 3' terminal primer, 5' AGAACCTGTTGATT 3', was used to sequence the extreme 3' end of the viral RNA but the sequence of only 79 nucleotides could be determined. The rest of the 3' nontranslated region proved to be difficult to sequence with specific primers based on the dengue-2 Jamaica sequence (Deubel et al., 1988) , probably because of sequence mismatching (when the sequences of Jamaica and 16681/16681-PDK53 were compared, 19/l 5 of the 454 nucleotides differed). This region at the 3'end of the 16681-PDK53 genome was therefore amplified using the polymerase chain reaction (PCR) and the primers 5' CTTTCCCAG-CGTCAATATGCTG 3' (based on sequence of the viral RNA) and 5' GGCTAGAAGTCAGGTCGGATC 3' (based on sequence of a 16681 -PDK53 cDNA clone). The resulting 344 bp fragment (nucleotides 10296 to 10649, see Fig. 2 ) was produced in five different PCR experiments, and each of these was sequenced to ensure that any possible errors produced during amplification by the Tao polymerase could be detected. However, the five sequences were identical. For the 16681 virus, the sequence of this 3' noncoding region and the 3' portion of the NS5 gene were obtained by amplifying the same 344 bp fragment using PCR as well as a 550 bp fragment using the primers 5' CCTTGGACGGGG-CTCACAGGTAG 3' (based on sequence of 344 bp PCR fragment) and 5' GGGAGACGGCCTGTTT-GGGG 3' (based on sequence 16681-PDK53 cDNA clone). The resulting sequence from these two amplification reactions encoded the region from nucleotides 9815 to 10649. The entire nucleotide sequence of the dengue-2 16681-PDK53 viral RNA genome and its encoded amino acid sequence are shown in Fig. 2 and it represents the consensus RNA sequence obtained using three different strategies (Fig. 1) : (i) cDNA cloning, (ii) sequencing particular regions of the viral RNA genome directly, and (iii) amplification of a fragment at the 3'end using PCR. The 3'terminal 14 nucleotides, complementary to the synthetic primer based on the West Nile virus sequence, have not been deduced but have simply been added to the 3' terminal deduced nucleotide since this primer hybridized very well both dengue-2 16681 and 16681-PDK53 RNA. The differences between the 16681-PDK53 sequence (Fig. 2 ) and the sequence of its parental virus 16681 are shown in Table 1 . The N-terminal amino acid sequences of the prM, M, and E proteins of the 16681 virus were determined using a gas phase protein sequencer coupled to reverse phase ODS columns, These confirmed the encoded sequences as well as those reported previously for dengue-2 proteins. Comparison of the 16681 and 16681 -PDK53 nucleotide sequences There were 53/10723 (0.49%) nucleotide differences between the parental dengue-2 16681 and its candidate vaccine derivative 16681 -PDK53 virus, of which about half (26/53) were silent. Table 2 highlights the differences in the genes and proteins of these two viruses and together with Table 1 show that the differences occur throughout the entire genome in both the coding and noncoding regions. Nucleotide changes occurred more frequently in regions such as the 5'and 3' untranslated regions (1.04% and 1.32% or l/96 and 6/454 nucleotides differences, respectively), the C (1.17% or 4/342 nucleotide differences) and the prM portion of the prM/M gene (1,100/o or 3/273 nucleotide differences). A disproportionate number of nucleotide differences which led to amino acid substitutions (14/ 17 or 88%; which can also be expressed as 12% silent mutations) was detected in the prM/M, NS2, and NS4 regions. The other proteins had a higher percentage silent mutations (55%). Tables 1 and 2 also summarize the amino acid differences between the virulent and attenuated dengue-2 viruses. Of the virion proteins, the prM peptide of the prM/M protein has changed to the greatest extent (2.20% or 2/91 amino acid differences). One of the amino acid substitutions (Asp to Val, 16681 to 16681-PDK53, respectively) changed the character of the residue from hydrophilic to hydrophobic and could therefore be important. Only one amino acid substitution (Phe to Leu, a conservative change) was present in the M peptide of the prM/M protein (1.33% or l/75 amino acid differences). A 0.88% amino acid difference (11 1 14 amino acids) with a change of charge from a basic residue to an acidic one (Lys to Asp) was found in the capsid protein. The other virion protein, the envelope glycoprotein, had 3/495 amino acid substitutions (or 0.6% difference). NSl had 7/1056 nucleotide changes and 3/352 amino acid changes. One of the three amino acid substitutions changed a small residue, Gly, to an acidic residue, Asp. The non-structural proteins, NS2 and Mason et al. (1987) This work Deubel et al. (1986 Deubel et al. ( , 1988 NS4, contained a high proportion of amino acid differences (1.38 and 1.54% or 3/218 and 2/l 30 amino acids for NS2A and B; and 2.00 and 1.21% or 3/l 50 and 3/248 amino acids for NS4A and B, respectively). The NS3 and NS5 proteins were more highly conserved, showing a 0.49% (3/618 amino acids) and a 0.33% (3/900 amino acids) protein difference, respectively. The preservation of sequence was also observed at the gene level with a 0.22% (4/l 854 nucleotides) and 0.26% (7/2700 nucleotides) difference for the NS3 and NS5 genes, respectively. The relatedness of dengue-2 16681116681 -PDK53 virus pair to other flaviviruses, and the relationships within the group, were determined using the protein sequences deduced from their published nucleotide sequences to make various comparisons. As the 16681 and 16681 -PDK53 viruses were very closely related, only the 16681-PDK53 sequence (Fig. 2) was used in further analyses. The viral protein sequences compared were those of Dl, D2JAM, D2NGC, D2PDK, D2S1, D3, D4, JEV, JE, KUN, MVE, SLE, TBE, TBEV, WNV, ASIBI and YF viruses (see Table 3 for full virus names and references). At the time of our analyses, only parts of the genomes of Dl, MVE, and SLE were known to us, and so only some of their proteins were used in the comparisons. The 11 flavivirus proteins (C, prM, M, E, NSl, NS2A, NS2B, NS3, NSLFA, NS4B, and NS5) were analyzed sep-arately. In each of the 11 sets the proteins were aligned, and the Feng, Johnson and Doolittle (1985) distance measures (FJD distances) between all pairs of each set were estimated after progressive alignment Doolittle, 1987, 1990) . The smallest FJD distances were between the NS5 proteins, and therefore the FJD distances for the others were, for each virusvirus comparison, converted into the distances relative to those of the NS5 comparisons. These ratios were averaged for each protein (Fig. 3) and showed that the amount of change of the proteins, relative to NS5, varied over almost a fourfold range. The E, NSl, and NS3 proteins differed only up to 1.60 times as much as Table 3 for full names and references of viruses). The sequences were aligned by the progressive-alignment method, and the phylogenetic Feng, Johnson, and Doolittle (1985) distance measures (MDM78 distance matrix) classified by the neighbor-joining method (Saitou and Nei, 1987) . Scale units = 10%. NS5, but the C, NS2A, NS2B, NS4A, and NS4B proteins differed by up to 3.66 + 0.1 1 times as much. This result is similar to that obtained by Miller and Purcell (1990) who calculated a histogram (their Fig. 3 ) of the invariant residues in the aligned proteins of eight flaviviruses. Next, for each of the 11 proteins, a dendrogram representing the relationships of the individual proteins was calculated by the neighbor-joining tree method (Saitou and Nei, 1987) . The 11 dendrograms (Fig. 4) are closely similar in topology, especially those of the most slowly evolving proteins (i.e., E, NSl, NS3 and NS5), but their total branch lengths differ, thereby confirming the relative differences shown in Fig. 3 . The dendrograms only differ in minor ways, mostly in their fine "twigs" (e.g., the relative positions of the dengue- As the analyses indicated that the flaviviruses had evolved by divergent mutational change, a single simple dendrogram was constructed (Fig. 5) by combining the distance matrices of the 11 proteins after correcting for their differences in size and relative rates of change. This dendrogram places the viruses into four clusters; the two TBE isolates, the two YF isolates, the six encephalitis viruses, and the seven dengue virus isolates. This classification is similar to that reported by Mandl et al. (198913) who used modified percentage identities of aligned sequences of the NSl proteins of 1 1 flaviviruses to calculate a dendrogram by an undisclosed method. Our dendrograms, obtained by the neighbor-joining tree method, have given a more detailed binary branching pattern than that of Mandl eta/. (1989b) , which included two unresolved trifurcations. We examined the sequence similarities of the conserved flavivirus proteins (NS5 and NS3), with those of -ASIBI FIG. 5. Combined dendrogram illustrating the relationships of 17 flaviviruses (whose full names are shown in Table 3 ). Scale units = 5%. other viruses and other organisms, to find the possible relatives of flavivirus genes. We also used the related proteins as "outgroups" to establish the position of the "root," and hence perhaps the "oldest" branching point, of the combined dendrogram of flaviviruses (Fig. 5) . The motif which includes the characteristic sequence -GDD-(or a close derivative of it) is found in all viral replicases (Argos, 1988; Bruenn, 1991; Diamond et al., 1989; Gorbalenya and Koonin, 1988; Habili and Symons, 1989; lcho and Wickner, 1989; Kamer and Argos, 1984; Koonin et al., 1989; Pietras et a/., 1988; Poch et al., 1989) including the NS5 protein of flaviviruses. We classified the -GDD-regions of 63 viral genomes including those of 12 flaviviruses. Each -GDD-region compared consisted of 300 amino acid residues, stretching from 240 residues on the N-terminal side of the -GDD-to 60 residues to its C-terminal side. These sequences were aligned pairwise using the Needleman-Wunsch algorithm (MDM78 matrix and gap penalty 6, Needleman and Wunsch, 1970) and the number of identical residues was recorded, as well as the difference between the alignment score and the mean score of 100 randomizations of the sequences expressed in standard deviation units (StDUs) calculated from the randomizations. Both sets of data were converted to distance matrices; the identical residue scores were converted into the percentages of nonidentical residues ('O/o non-ident'), including gaps as a "21st" amino acid, and the standard deviation measures were converted into "standard deviation distances" (StDDs') between each pair of sequences using the formula StDD = 2.0 -log1 0 (StDU + 3.0). The neighbor-joining tree method was then used to compute dendrograms from the distance measures. Both dendrograms placed the "root" of the flavivirus tree on the link joining the TBE branch to that of the other flaviviruses. Figure 6 shows the % non-ident dendrogram. It placed the viruses into seven major groups. The flaviviruses were a sister group to the pestiviruses which then joined sequentially to the carmo-luteo(BYD) cluster and hepatitis C. These viruses then had the sobemo-luteo(BWY) cluster and black beetle nodavirus as a sister group, before being linked to the other viruses. The StDD dendrogram placed the viruses (except black beetle nodavirus and Q/3 levivirus) into the same seven major groupings as the YO non-ident dendrogram, but the groupings were linked differently so Table 3 for full names of viruses); pesti-: bovine viral diarrhea and hog cholera pestiviruses; carmo-/uteo(BYD)-: barley yellow dwarf (PAV) and soybean dwarf luteoviruses, red clover necrotic mosaic dianthovirus, carnation mottle, melon necrotic spot and turnip crinkle carmoviruses, maize chlorotic mottle virus, cucumber necrosis, cymbidium ringspot and tomato bushy stunt tombusviruses, tobacco necrosis necrovirus; sobemo-/uteo(BW')-: beet western yellows, potato leaf roll (two isolates) luteoviruses, southern bean mosaic sobemovirus (two isolates); Sindbis-: alfalfa mosaic virus, brome mosaic bromvirus, cucumber mosaic cucumovirus, barley stripe mosaic hordeivirus, tobacco rattle and pea early browning tobraviruses, tobacco mosaic tobamovirus, Sindbis alphavirus, eggplant mosaic, kennedya yellow mosaic, ononis yellow mosaic and turnip yellow mosaic tymoviruses, potato X and white clover mosaic potexviruses, potato M and potato S carlaviruses, beet necrotic yellow vein furovirus; retroid-orfhomyxo-/evil: human immunodeficiency lentivirus, influenza A orthomyxovirus, mouse hepatitis coronavirus, cauliflower mosaic caulimovirus, Q3 levivirus; picornapory-coma-: rhinovirus 2, poliovirus, encephalomyocarditis cardiovirus, foot-and-mouth disease aphthovirus, cowpea mosaic comovirus, plum pox, potato Y, tobacco etch and tobacco vein mottling potyviruses. Scale units = 10%. that although the pestiviruses and hepatitis C were sister groups, they were further from the flaviviruses. The reasons for the differences between these dendrograms are not known, but it could have resulted from the decision to use a segment of fixed length from each sequence, rather than attempting to align subjectively some very tenuously homologous sequences. The nucleotide binding and helicase motifs, which have the core sequence -GKT/S-, are found in many proteins including those of viruses (Walker et al., 1982; Higgins et al., 1986; . This motif is found in the C-terminal part of the NS3 protein of flaviviruses, and the region has been shown Miller and Purcell, 1990) to have sequence similarities to the NS3-like protein of hog cholera pestivirus and hepatitis virus C and, more distantly, to the cylindrical inclusion (Cl) proteins of the potyviruses (Bazan and Fletterick, 1989) . Recently, Lain et a/. (1989) have confirmed that the potyviral Cl proteins have helicase activity, and Chambers et al, (1990) have shown that the NS3 protein of YF is a viral protease for specific post-translation processing of the viral polyprotein. A search of the Swissprot database (Release 16) by the FASTA technique using the NS3 proteins of D2PDK and YF failed to identify any other proteins with sequences of more than 100 amino acid residues with significant similarity; several much shorter sequences with clear similarity were found. Therefore a classification was computed of five flavivirus NS3 proteins (D2PDK, D3, D4, YF, and TBE), the NS3-like proteins of the two pestiviruses and hepatitis C virus, and the Cl proteins of plum pox (strain D; Teycheney et al., 1989) potato virus Y (strain N; Robaglia et al., 1989) tobacco etch (Allison et al., 1986) and tobacco vein mottling (Domier et a/., 1986) viruses using the progressive-alignment/FJD distance/neighborjoining, tree method. The classification further confirmed that the flavivirus tree is rooted near the base of the TBE branch. The classification showed clearly that the potyvirus Cl proteins are only very distantly related to the NS3 and NS3-like proteins. The dendrogram was, in essence, H-shaped with a short central link of 20% with branches at one end to the pestivirus and hepatitis C proteins of 78 and 98% respectively, and, from the other end, of 31 and 155% to the flavivirus and potyvirus proteins. Searches of the Swissprot database (Release 16) and the NSl and E proteins of D2PDK and YF using the FASTA technique found only a few short sequences (~100 residues) with similarity to the NSl proteins, but a clear sequence similarity between the flavivirus E proteins and the spike glycoprotein precursors of vesicular stomatitis rhabdovirus, strains Orsay and Glasgow (Gallione and Rose 1985; Vandepol and Holland, 1986) was found. The alignment scores of the E and spike proteins by the Needleman-Wunsch (1970) method were 2.3-2.9 standard deviations from the mean alignments scores of 100 randomizations of the same sequences. Differences in the nucleotide and encoded amino acid sequences occurred throughout the genomes of the virulent (16681) and attenuated (16681 -PDK53) dengue-2 virus strains. Certain genes (M, NS4A, and NS4B) displayed nucleotide changes which all led to amino acid substitutions, while others (C, E, NSl, and NS5) underwent a majority of silent nucleotide changes. However, it is not possible to predict from these changes which particular protein(s) and/or region(s) of the genome are involved in the reported attenuation of 16681 -PDK53. Monoclonal antibodies to at least three dengue virus proteins, namely, prM, E, and NSl , have been shown to protect mice against challenge with dengue viruses (Brandiss eta/., 1986; Henchal eta/., 1988; Kaufman et al., 1987 Kaufman et al., , 1989 Schlesinger et a/., 1987; Zhang et al., 1988) . There are nucleotide and amino acid differences between the 16681 and 16681-PDK53 viruses in these proteins but their significance is not obvious. The prM-specific monoclonal antibodies which protected mice against challenge with both homologous and heterologous dengue viruses (Kaufman et al., 1989) did not display neutralizing activity in vitro. The locations of the protective epitopes within the prM peptide have not yet been identified and it is therefore not known whether the amino acid differences (Table 1) found between the 16681 and 16681-PDK53 viruses in this protein are related to protection. The E protein is the major dengue virus antigen and antibodies to this protein neutralize the virus. Proposed three-dimensional structures for the E protein of related flaviviruses, WN virus (Nowak and Wengler, 1987) and TBE virus (Mandl et al., 1989a) have been developed based on monoclonal antibody-binding and proteolytic cleavage studies. Adapting the dengue-2 envelope protein sequence to the flavivirus models shows that the three amino acid changes are in stem regions. It seems unlikely that the differences in the E protein are significant as two of the changes (Val to Ile and Met to lie) are conservative changes which occur in the stem structures of domain A, rather than in the more accessible loops (based on the model of the E protein of TBE virus, Mandl eta/., 1989a) . Interestingly, the other amino acid substitution (Thr to Ile) is in the same position as an antigenic variant reported for TBE virus (Lys to Glu) (Mandl et al., 1989a) ; and this Thr to Ile substitution occurs in a region of dengue-2 which reacts with sera from a range of dengue-2-infected hosts (Aaskov et a/., 1989) . The positions of the three amino acid changes are also outside any proposed flavivirus group reactive or dengue-2 specific continuous epitopes proposed by lnnis et al. (1989) . Twelve amino acid differences were detected between the E proteins of the virulent (ASIBI) and an attenuated 17D yellow fever viruses . This contrasts with five amino acid differences observed in the E protein of a virulent and an attenuated strain of JE virus (Nitayaphan et al,, 1990 ) and the three amino acid differences reported here for the dengue-2 16681 (virulent) and dengue-2 16681 -PDK53 (attenuated) viruses. This difference in number of amino acid changes between the parent and attenuated strains (12 for YF pair, 5 for JE pair, and 3 for the dengue-2 pair) may simply reflect the different clinical syndromes and pathogenesis of these viruses, or the different passage histories of these viruses [234 passages for YF pair (Theiler and Smith, 1937) , 122 passages for the JE pair and 53 passages for the dengue-2 pair (Yoksan et al., 1986) ], or the fact that the YF and JE viruses were plaque purified before cDNA cloning and sequencing whereas the dengue-2 viruses used for cloning and sequencing represent the virus population as a whole. Monoclonal antibodies to the NSl protein protected mice against lethal challenge in experiments with YF virus (Gould et al., 1986; Schlesinger et a/., 1985 ) and dengue-2 (Henchal eta/., 1988; Schlesinger et al., 1987) indicating that NSl may be important in flavivirus infections. There are three amino acid substitutions (Table 1 ) and one of these (Gly to Asp) may be significant because this Gly is present in all other flaviviruses sequenced to date, i.e., Dl, D2JAM, D2NGC, D2S1, D3, D4, JEV, JE, KUN, MVE, SLE, TBE, TBEV, WNV, ASIBI, and YF viruses; see Table 3 for full virus names and references. However, the three-dimensional structure of NSl is not known and, as no model has been proposed, it is difficult to assess the significance of this substitution. The function of the NS2 and NS4 proteins in virus replication is unknown but their hydrophobic profiles are remarkably conserved among both the dengue viruses and flaviviruses in general, although their amino acid sequences are not. The amino acid substitutions (Table 1) in the NS4 region would have only marginal effects on this hydrophobicity profile but the NS2 regions involve substitutions of hydrophilic residues for hydrophobic ones, e.g., Met to Arg and Ala to Glu in NS2A. The significance of these results is difficult to determine since the function of these hydrophobic regions is unknown. The NS3 and NS5 proteins are important in viral replication; and NS3 has been shown to contain sequences characteristic of a trypsin-like serine protease (Bazan and Fletterick, 1989) and is used in post-translational processing of the YF polyprotein (Chambers et al., 1990) . As these proteins possess important enzymatic functions, it can be expected that they can tolerate fewer amino acid substitutions. This is indeed what was observed with only 0.49 and 0.33% amino acid differences for NS3 and NS5, respectively, although there were substitutions which altered the charge (e.g., Ala to Glu or Asp to Ala) in NS5 protein. Now that infectious virus from a cloned full-length cDNA copy of YF virus (Rice eta/., 1989 ) and a dengue-4 virus (Lai et al., 199 1) have been rescued from cells, it should be possible to do the same with the dengue-2 virus. ln vitro mutagenesis can then be used to determine which of the above change or changes in the dengue virus genome and its encoded proteins relates to attenuation of the virus. Analyses of the 11 dengue-2 16681-PDK53 encoded proteins (C, prM, M, E, NSl, NS2A, NSZB, NS3, NS4A, NS4B, and NS5) with the 16 published flavivirus sequences have shown that the dendrograms of the separate proteins (Fig. 4) are very similar in topology. The dendrograms of the most slowly evolving proteins (i.e., E, NSl , NS3, and NS5) give almost identical classifications although some branch points are in slightly different positions. There were however, no clear differences in the branching order of the different proteins which would indicate the occurrence of genetic recombination. It is therefore justifiable to calculate a combined dendrogram of these flaviviruses (Fig. 5) as it seems that the flaviviruses have evolved by divergent mutational change and there is no evidence of genetic recombination which has been shown to be a major factor in the evolution of RNA genome viruses; especially the retroviruses (McClure et a/., 1988) but also the luteo-like viruses whose replicase is so similar to that of the flaviviruses (see Fig. 6 ; Bruenn, 1991) . We believe that features of the combined dendrogram accurately reflect the flavivirus relationships, and hence the likely phylogeny, although there is at present no published method for testing the statistical significance of the details of neighbor-joining trees. These consistent features are that (i) the replicate isolates of individual viruses [i.e., three viruses (Japanese encephalitis, yellow fever, and tick-borne encephalitis) represented by two isolates each, and one (dengue-2) represented by four] classify together before they are linked in higher order relationships with other viruses; thus, the sequence data relationships are confirmed by the prior serological identification of individual isolates. (ii) The separate dendrograms of the 11 proteins (i.e., in-dependent subsamples of the data) give closely similar dendrograms, indeed a dendrogram with the same topology could have been obtained by qualitatively drawing a consensus dendrogram from the eleven separate dendrograms. The analyses of -GDD-motif of the polymerases of 63 viruses using different measures gave slightly different results and clearly show that, as expected, the results of analyzing distant sequence relationships must be interpreted with caution. Bruenn (1991) calculated a classification of the -GDD-motif regions which placed the flaviviruses together with hog cholera pestivirus as a sister group of the luteoviruses and relatives and, in order of increasing distance, black beetle nodavirus, then the tobamo-ARs, the alphaviruses, all the viruses with dsRNA genomes, and finally the picornaviruses and the leviviruses. Our dendrogram (Fig. 6 ) shares some features with that of Bruenn (1991) , but has the more agreed arrangements of the tymo-, tricorna-, poty-, and picornaviruses and confirms the report of Miller and Purcell (1990) of a shared sequence in the replicase proteins of hepatitis C virus and carnation mottle carmovirus (Guilley et al., 1985) . Both dendrograms of the NS5 proteins, and that of the NS3 proteins, agreed in placing the root of the flavivirus tree close to the point where the TBE branch joined the other flaviviruses. Thus the "balance" of the unrooted combined tree (Fig. 5) is probably correct; it is rooted at its leftmost vertical link. This provides some evidence that the primary division of the flaviviruses was between those transmitted by mosquitoes and those transmitted by ticks, but this needs to be confirmed by the genomic sequencing of more of the tickborne flaviviruses. The distant relationships of the flavivirus proteins to proteins of other seemingly unrelated viruses add further to the evidence that most major virus groups have arisen by the recombination of viral genes from diverse sources. For instance: (i) the flavivirus E glycoprotein shows some sequence similarity to the spike glycoprotein of the vesicular stomatitis rhabdovirus; (ii) the flavivirus replicase (NS5) is close to that of the carmoviruses, yet their major virion proteins are quite different; that of the carmoviruses is an 8-stranded antiparallel P-barrel protein (a gene family that has provided the virion proteins of most small isometric virions); and (iii) the protease-like NS3 protein of flaviviruses shares a common ancestor with a protein of the picornaviruses and potyviruses that have unrelated coat proteins (Rossman and Rueckert, 1987; Dolja et al., 1991) . Serologically defined linear epitopes in the envelope protein of dengue 2 (Jamaica strain 1409) Nucleotide sequence coding for the "signal peptide" and the N-terminus of the hemagglutinin from an Asian (H2N2) strain of influenza virus The nucleotide sequence of the coding region of tobacco etch virus genomic RNA: Evidence for the synthesis of a single polypeptide A sequence motif in many polymerases Detection of a trypsin-like serine protease domain in flaviviruses and pestiviruses Immunization with a live-attenuated dengue-2 virus candidate vaccine (16681.PDK53): Clinical, immunological and biological responses in adult volunteers Comparison of dengue viruses and some other flaviviruses by cDNA-RNA hybridization analysis and detection of a close relationship between dengue virus serotype 2 and Edge Hill virus Lethal 17D yellow fever encephalitis in mice. I. Passive protection by monoclonal antibodies to the envelope proteins of 17D yellow fever and dengue 2 viruses Relationships among the positive strand and double-strand RNA viruses as viewed through their RNA-dependent RNA polymerases Sequence analysis of the viral core protein and the membrane-associated proteins Vl and NV2 of the flavivirus West Nile virus and of the genome sequence for these proteins Primary structure of the West Nile flavivirus genome region coding for all nonstructural proteins Evidence that the N-terminal domain of nonstructural protein NS3 from yellow fever virus is a serine protease responsible for site-specific cleavages in the viral polyprotein Techniques for hemagglutination and hemagglutination-inhibition with arthropod-borneviruses Nucleotide and complete amino acid sequences of Kunjin virus: Definitive gene order and characteristics of the virusspecified proteins Partial nucleotide sequence of the Murray Valley encephalitisvirus genome. Comparison of the encoded polypeptides with yellow fever virus structural and non-structural proteins Atlas of Protein Sequence and Structure" (M. 0. Dayhoff Nucleotide sequence and deduced amino acid sequence of the structural proteins of dengue type 2 virus, Jamaica genotype Nucleotide sequence and deduced amino acid sequence of the nonstructural proteins of dengue type 2 virus, Jamaica genotype: Comparative analysis of the full-length genome A comprehensive set of sequence analysis programs for the VAX Overlapping genes in a yeast double-stranded RNA virus Phylogeny of capsid proteins of rod-shaped and filamentous RNA plant viruses: Two families with distinct patterns of sequence and probably structure conservation The nucleotide sequence of tobacco vein mottling virus RNA Japanese encephalitis virus liveattenuated vaccine, Chinese strain SA,-14-2; Adaptation to primary canine kidney cell cultures and preparation of a vaccine for human use Progressive sequence alignment as a prerequisite to correct phylogenetic trees Progressive alignment and phylogenetic tree construction of protein sequences Aligning amino acid sequences: Comparison of commonly used methods A single amino acid substitution in a hydrophobic domain causes temperature-sensitive cellsurface transport of a mutant viral glycoprotein Birnavirus RNA polymerase is related to polymerases of positive strand RNA viruses Two related superfamilies of putative helicases involved in replication, recombination, repair and expression of DNA and RNA genomes Neutralizing (54K) and non-neutralizing (54K and 48K) monoclonal antibodies against structural and non-structural yellow fever virus proteins confer immunity in mice Nucleotide sequence and genome organization of carnation mottle virus RNA Evolutionary relationship between luteoviruses and other RNA plant viruses based on sequence motifs in their putative RNA polymerases and nucleic acid helicases Comparison of the virulent Asibi strain of yellow fever virus with the 17D vaccine strain derived from it Nucleotide sequence of dengue 2 RNA and comparison of the encoded proteins with those of other flaviviruses The pathogenesis of dengue: Molecular epidemiology in infectious disease Observations related to the pathogenesis of dengue hemorhaggic fever. Il. Antigenic and biologic properties of dengue viruses and their association with disease response in the host Synergistic interactions of anti-NSl monoclonal antibodies protect passively immunized mice from lethal challenge with dengue 2 virus Afamilyof related ATP-binding subunits coupled to many distinct biological processes in bacteria The double-stranded RNA genome of yeast virus L-A encodes its own putative RNA polymerase by fusing two open reading frames Isolation of a Singh's Aedesalbopicfus cell clone sensitive to dengue and Chikungunya viruses Identification of continuous epitopes of the envelope glycoprotein of dengue type 2 virus Sequence analysis of cloned dengue virus type 2 genome Primary structural comparison of RNA-dependent polymerases from plant, animal and bacterial viruses Monoclonal antibodies against the dengue 2 virus E-glycoprotein protect mice against lethal dengue infection Monoclonal antibodies for dengue virus prM glycoprotein protect mice against lethal dengue infection Tentative identification of RNA-dependent RNA polymerases of dsRNA viruses and their relationship to positive strand RNA viral polymerases Infectious RNA transcribed from stably cloned full-length cDNA of dengue type 4 virus Homologous potyvirus and flavivirus proteins belonging to a superfamily of helicase-like proteins The nucleotide sequence of dengue type 4 virus: Analysis of genes coding for nonstructural proteins Sequence of the structural proteins of tick-borne encephalitis virus (Western subtype) and comparative analysis with other flaviviruses Antigenic structure of the flavivirus envelope protein E at the molecular level, using tick-borne encephalitis virus as a model. 1. Viral Genome sequence of tick-borne encephalitis virus (Western subtype) and comparative analysis of nonstructural proteins with otherflaviviruses Molecular Cloning: A Laboratory Manual Sequence of the dengue-1 virus genome in the region encoding the three structural proteins and the major nonstructural protein NSl Partial nucleotide sequence of the Japanese encephalitis virus genome Sequence comparisons of retroviral proteins: Relative rates of change and general phylogeny Hepatitis C virus shares amino acid sequence similarity with pestiviruses and flaviviruses as well as members of two plant virus supergroups A general method applicable to the search for similarities in the amino acid sequence of two proteins Nucleotide sequence of the virulent SA-14 strain of Japanese encephalitis virus and its attenuated vaccine derivative, SA-14-14-2 Analysis of disulfides present in the membrane proteins of the West Nile flavivirus. I/iro/ogy 156 High-efficiency cloning of fulllength cDNA Complete nucleotide sequence of dengue type 3 virus genome RNA Identification of a putative RNA dependent RNA polymerase encoded by a yeast double stranded RNA virus Transcription of infectious yellow fever virus RNA from full-length cDNA templates produced by in vifro ligation Nucleotide sequence of yellow fever virus: Implications for flavivirus gene expression and evolution Structure of the flavivirus genome Nucleotide sequence of potato virus Y (N strain) genomic RNA. 1 The Emperor's New Clothes revisited, or reflections on the pathogenesis of dengue hemorrhagic fever What does the molecular structure of viruses tell us about viral functions? The neighbour-joining method: A new method for reconstructing phylogenetic trees DNA sequencing with chain-terminating inhibitors Protection against 17D yellow fever encephalitis in mice by passive transfer of monoclonal antibodies to the nonstructural glycoprotein GP48 and by active immunization with GP48 Protection of mice against dengue 2 virus encephalitis by immunization with the dengue 2 virus non-structural glycoprotein NSl Protection against yellow fever in monkeys by immunization with yellow fever virus nonstructural protein NSl Dengue viruses Gene mapping and positive identification of the nonstructural proteins NS2A, NS2B, NS3, NS4B and NS5 of the flavivirus Kunjin and their cleavage sites A note on the neighbour-joining tree algorithm of Saitou and Nei Complete nucleotide sequence of the Japanese encephalitis virus genome RNA The complete nucleotide sequence of plum pox virus RNA (strain D) Use of yellow fever virus modified by in vitro cultivation for human immunization Partial nucleotide sequence of St. Louis encephalitis virus RNA: Structural proteins, NSl nsila, and ns2b Evolution of vesicular stomatitis virus in athymic nude mice: Mutations associated with natural killer cell selection Distantly related sequences in the (Y and p subunits of ATP synthase. myosin, kinases and other ATP-requiring enzymes and a common nucleotide binding fold Terminal sequences of the genome and replicative form RNA of the flavivirus West Nile virus: Absence of the poly (A) and possible role in RNA replication Sequence analysis of the membrane protein V3 of the flavivirus West Nile virus and of its gene Nucleotide sequence of the genome region encoding the structural proteins and the NSl protein of the tick borne encephalitis virus Dengue virus vaccine development: Study on biological markers of uncloned dengue 1-4 viruses serially passaged in primary kidney cells Immunization of mice with dengue structural proteins and nonstructural protein NSl expressed by baculovirus recombinant induces resistance to dengue virus encephalitis Cloning full-length dengue 4 viral DNA sequences: Analysis of genes coding for structural proteins The authors thank Dr. Barry Gorman for useful comments. This research was supported by the World Health Organization as part of its Program of Vaccine Development, and by the National Health and Medical Research Council of Australia, the Rockefeller Foundation, and the Sir Albert Sakzewski Foundation.