key: cord-0729445-9mkmpwo4 authors: Lorusso, Alessio; Desario, Costantina; Mari, Viviana; Campolo, Marco; Lorusso, Eleonora; Elia, Gabriella; Martella, Vito; Buonavoglia, Canio; Decaro, Nicola title: Molecular characterization of a canine respiratory coronavirus strain detected in Italy() date: 2009-01-20 journal: Virus Res DOI: 10.1016/j.virusres.2008.12.011 sha: c4a69b3423b111b1fac2552e0ec8701869c95f0f doc_id: 729445 cord_uid: 9mkmpwo4 Coronaviruses (CoVs) are positive-stranded, non-segmented RNA viruses generally responsible for the emergence of respiratory and enteric disease in humans, companion animals and livestock. Their aptitude to evolve by genetic recombination and/or point mutation is recognized, thus giving rise to new viral genotypes and mutants with different tissues or host tropism. In particular, a probable origin from the strictly related bovine coronavirus (BCoV) or, alternatively, from a common ancestor has been suggested for some group 2a CoVs, including canine respiratory coronavirus (CRCoV). In this study, we report the sequence analysis of the viral RNA 3′-end of an Italian CRCoV, strain 240/05, together with the sequence comparison with extant bovine-like viruses including the sole CRCoV strain 4182 previously described. Interestingly, although the structural proteins show the same features of CRCoV 4182, the genomic region between the spike and the envelope protein genes of CRCoV 240/05 encodes for three distinct products, including the equivalent bovine 4.9 kDa non-structural protein and a truncated form of the 4.8 kDa protein, whereas CRCoV 4182 has a unique 8.8 kDa protein. Coronaviruses (CoVs) are enveloped (+) strand RNA viruses and are generally responsible for respiratory and enteric diseases affecting mammals and birds. With genome sizes ranging from 27.6 kb to 31.6 kb, they represent the largest RNA viruses known to date (Gorbalenya et al., 2006; Siddell and Snijder, 2008) . After the emergence of SARS in humans, concern regarding animal CoVs has arisen in the scientific community, since animal CoVs can be assumed as models for human CoVs. In addition, as a consequence of the implemented epidemiological survey programs, new members of the family Coronaviridae have been detected in animals and humans. Currently, CoVs are organized into three antigenic groups with group 2 including bovine-like (group 2a) and SARS-like (group 2b) viruses (Gorbalenya, 2008) . It has been suggested that human coronavirus (HCoV) OC43 has arisen as consequence of a transspecies infection caused by bovine coronavirus (BCoV) (Vijgen et al., 2005) , and this may be also true for some animal group 2a CoVs, including porcine hemagglutinating encephalomyelitis virus (PHEV), buffalo coronavirus (BuCoV), giraffe coronavirus (GiCoV), alpaca coronavirus (ApCoV), and canine respiratory coronavirus (CRCoV) (Hasoksuz et al., 1999; Vijgen et al., 2006; Erles et al., 2007; ଝ The GenBank accession number of the sequence of CRCoV 240/05 reported in this paper is EU999954. * Corresponding author. Tel.: +39 0804679832; fax: +39 0804679843. E-mail address: n.decaro@veterinaria.uniba.it (N. Decaro). Jin et al., 2007; Genova et al., 2008) . Alternatively, BCoV and bovine-like CoVs may have arisen from a common ancestor (Vijgen et al., 2006) . Unlike the group 1a enteric canine coronaviruses (CCoVs) types I and II (for a review, see CRCoV has been associated with mild respiratory signs and has been proposed as an etiological agent of canine infectious respiratory disease (CIRD) together with Bordetella bronchiseptica, canine adenovirus types 1 and 2, canine parainfluenza virus, canine herpesvirus, reoviruses and influenza viruses (Erles et al., 2004; Buonavoglia and Martella, 2007; Erles and Brownlie, 2008) . CRCoV has been detected firstly in UK in 2003 (Erles et al., 2003) and subsequently in Italy (Decaro et al., 2007) . A group 2a CoV has also been identified in dogs in Canada (Ellis et al., 2005) and Japan (Kaneshima et al., 2006) , and there is serological evidence that a bovine-like CoV is circulating in the canine population in other countries, including USA, Ireland and Greece (Priestnall et al., 2006) . CRCoV possesses the canonical genome organization of the group 2 CoVs (Erles et al., 2007) . Briefly, proceeding from the 5end of the genome are the genes encoding for the replicase complex and for the structural proteins (hemagglutinin-esterase, HE; spike, S; envelope, E; membrane, M; nucleocapsid, N), expressed through a 3 -coterminal nested set of subgenomic (sg) mRNAs by a process of discontinuous transcription (de Vries et al., 1997) . In addition, interspersed among the structural genes, CRCoV, analogously to BCoV, carries some accessory genes, encoding for non-structural proteins (nsp), located between the replicase and the HE protein genes and between the S and E protein genes that are also expressed from sg mRNAs (Narayanan et al., 2008) . These sg mRNAs each contain a short leader sequence at their 5 -end, followed by a transcription regulatory sequence (TRS). TRS sequences are also present in the genome upstream of each gene and they serve as signals for the generation of the sg mRNAs (for a review, see Britton and Cavanagh, 2008) . Despite the serological evidence that BCoV-like particles are circulating in dogs, few partial genomic sequences of CRCoVs are publicly available, mostly regarding the S and HE proteins, except the English isolate CRCoV 4182 whose entire genomic 3 -end has been determined and analyzed (Erles et al., 2007) . In the present study, we report the sequence and phylogenetic analyses of the entire genomic 3 -end of the CRCoV 240/05 strain previously detected in the lungs of a dog in Italy (Decaro et al., 2007) . A 9.6-kb region encompassing the complete 3 -end of the viral genome of CRCoV 240/05 was determined through RT-PCR amplifications of overlapping fragments by using primers previously published and the kit SuperScript TM One-Step RT-PCR for Long Templates (Life Technologies, Invitrogen, Milan, Italy) which contains a highfidelity DNA polymerase (Platinum ® Taq Hi Fi). The RNA was extracted from a fragment of the original lung sample stored in RNAlater RNA Stabilization Reagent (Qiagen S.p.A., Milan, Italy) by using the QIAamp ® RNeasy Mini Kit (Qiagen S.p.A.), following the manufacturer's instructions, with the template RNA being eluted in 50 l of RNase-free water. DNA samples generated from two different RT-PCR runs were sequenced in both directions by Cogenics Europe (Meylan, France). Additional RT-PCR assays and sequencing attempts were performed to close gaps between assembled contigs using strain-specific primers. The analysis of the 240/05 3 -end genomic sequence by means of the NCBI graphical analysis ORF Finder tool (http://www.ncbi.nlm.nih.gov/gorf/gorf.html) identified nine ORFs that can be deduced to encode the non-structural and structural proteins of the virus (Fig. 1 ). All the predicted ORFs but the 2.7 kDa gene were preceded by a TRS sequence (CUAAAC or CCAAAC) which is assumed to interact with the viral polymerase during the discontinuous transcription of the negative sg RNAs (for a review, see Britton and Cavanagh, 2008) . The ORFs encoding structural proteins are ORF 2b (nt 1017-2291), ORF 3 (nt 2306-6397), ORF 5 (nt 7087-7341), ORF 6 (nt 7356-8048), ORF 7 (nt 8058-9404). The predicted translation products are the HE (47.7 kDa), S (151.1 kDa), E (9.2 kDa), M (26.4 kDa) and N (49.3 kDa) proteins, respectively. Analogously to other bovine-like CoVs, the genome of strain 240/05 contained three accessory genes encoding for four group-specific non-structural proteins, namely ORF 2a, located between pol1b and HE genes and encoding for the 32 kDa, and, between the S and E genes, ORF 4a encoding for the 4.9 kDa and the 2.7 kDa proteins and ORF 4b encoding for the 12.8 kDa protein. Additionally, the N protein gene displayed an accompanying internal ORF 8 encoding for the I protein of 23.3 kDa. CRCoV 240/05 and 4182 differed in 88 positions (excluding the putative 4.9 kDa and the truncated 4.8 kDa proteins), 38 of which were non-synonymous and scattered over all the proteins apart from the E protein. The group 2a-specific accessory structural HE protein was 424 amino acids long with its TRS located nine nucleotides upstream of the AUG protein initiation codon. The HE protein gene showed eight synonymous and two non-synonymous nucleotide variations when compared with that of CRCoV 4182. Interestingly, strain 240/05 shared the same amino acid mutations with all BCoV strains and derivatives described to date, so therefore these mutations are unique to CRCoV 4182. Amino acid sequence identity was 99.5% between 240/05 and 4182, whereas it ranged from 93.4% to 97.7% between 240/05 and other BCoV/bovine-like CoVs (Table 1) . By analysis with the NetNglyc server (http://www.cbs.dtu.dk/ services/NetNGlyc/), the HE of 240/05 was found to contain eight potential glycosylation sites analogously to CRCoV 4182. Accordingly, a potential signal peptide was identified by SignalP 3.0 Server (http://www.cbs.dtu.dk/services/SignalP/) at amino acids 1-18 with a potential cleavage site between amino acids 18 and 19 and the predicted site for neuraminidase-O-acetylesterase activity, FGDS, was detected at the N-terminus. The S protein was long 1363 amino acids and its AUG codon was directly preceded by the TRS sequence. Amino acid identity in comparison with the S protein of CRCoV 4182 was 98.4% (Table 1) showing 22 amino acid substitutions, of which nine were found to be unique to 240/05 strain whereas 10 were present in other BCoV and derivatives. Twenty-one synonymous nucleotide variations were encountered in the S gene. Amino acid identity between 240/05 and BCoV/bovine-like viruses S protein ranged from 91.2% to 95.7% (Table 1) . A potential N-terminal signal peptide was identified at amino acids 1-17 and 21 potential glycosylation sites were detected throughout the protein. No deletions or insertions were observed in strain 240/05 compared with bovine-like CoVs. Moreover, the amino acid stretch KRRSRR, responsible for the proteolytic cleavage of the S protein at residue 768 into subunits S1 and S2, was conserved. The E protein was 84 amino acids long and identical, in terms of amino acid sequence, to that of CRCoV 4182. Only one synonymous nucleotide change differentiated the E protein encoding gene of strains 240/05 and 4182. Two residues were found to be different from the same protein of BCoV-Mebus, only one if compared with the extant BCoVs. The TRS sequence was located 123 nucleotides upstream the AUG codon as already shown in 4182 strain. SignalP assigned a signal anchor at position 1-34 and no glycosylation sites were found. The M protein was 230 amino acids in length and the AUG translation initiation codon was separated from the TRS stretch by three nucleotides. Five synonymous and five non-synonymous nucleotide variations were found with respect to the M gene of CRCoV 4182, three of which were unique to 240/05 strain while the remaining two changes were shared with extant BCoV strains. Moreover, four potential O-glycosylation sites (http://www.cbs.dtu.dk/services/ NetOGlyc/) and one N-glycosylation site were identified at the Nterminus of the protein as previously described for all bovine-like CoVs. The N protein of strain 240/05 had a length of 448 amino acids and was highly conserved among the bovine and bovine-like viruses. Six synonymous and two non-synonymous changes were present with respect to strain 4182. The same amino acid changes were present in extant BCoV-like viruses. The putative 32 kDa nsp showed the same length (278 amino acids) as most other bovine-like CoVs, including GiCoV, ApCoV and sable antelope coronavirus (SACoV), as well as HCoV-OC43 and CRCoV 4182. Three amino acid changes were found in comparison with 4182 strain, of which one was unique to 240/05 whereas the remaining two substitutions were shared with other bovinelike strains. Five were the synonymous nucleotide substitutions. The corresponding TRS sequence was located seven nucleotides upstream of the AUG codon confirming the genome organization of CRCoV 4182. In the majority of their genome, CRCoV 240/05 and 4182 possessed the same genomic arrangement, but they displayed a diverse genome structure in the accessory genes encoding for nsp located between the S and E genes. In this region, most BCoV-like CoVs display three accessory genes, namely the 4.9 kDa, 4.8 kDa and 12.8 kDa protein genes. In CRCoV 4182 only two accessory genes were detected (Erles et al., 2007) , whereas the genomic arrangement of other bovine-like CoVs was partially conserved in strain 240/05. Indeed, CRCoV 4182 showed a unique 8.8 kDa protein fashioned by the fusion of the 4.9 kDa protein and a truncated form of the 4.8 kDa protein. This was the result of a two nucleotide deletion that inactivated the stop codon of the 4.9 kDa protein encoding sequence and introduced additional 12 amino acids not found in any BCoV strains (Erles et al., 2007) . In contrast, in the 240/05 genome the terminating codon of the 4.9 kDa encoding sequence is conserved exactly as in most group 2a CoVs, including BCoVs-Mebus, Quebec and DB2, HECV-4408 and BuCoV. The corresponding TRS sequence is located 317 nucleotides upstream of the initiation codon according to previous observations for other ruminant CoVs. However, the 4.8 kDa protein of most BCoV and bovine-like CoVs is replaced in strain 240/05 by a 2.7 kDa protein (25 amino acids in length) due to the presence of an early stop codon. An identical truncated protein has been already described for CRCoV strain G9142 (Erles et al., 2007) , whereas a slightly longer truncated protein is present in GiCoV and SACoV (Hasoksuz et al., 1999) . The corresponding TRS sequence was not detected in any bovine-like viruses with the exception of CRCoV G9142 (Erles et al., 2007) . The 12.8 kDa protein was 109 amino acids in length according to the extant BCoV and bovine-like CoVs except BCoV Quebec that exhibits a truncated form of the protein. The related TRS was located 73 nucleotides upstream of the AUG codon. Two synonymous and three non-synonymous nucleotide substitutions with respect to strain 4182 were detected in the coding sequence, with one change being unique if compared to other bovine-like CoVs previously described. The I protein was 207 amino acids in length according to CRCoV 4182. Only one amino acid change and two synonymous nucleotide variations differentiated the two CRCoV strains and strain 240/05 showed almost 95% average amino acid identity with most of the bovine-derivative viruses. The rooted phylogenetic analysis (Fig. 2 ) performed on the S (a) and M/N (b) proteins confirmed the high genomic relatedness of CRCoV with BCoV and its derivatives, thus supporting data obtained previously (Decaro et al., 2007 Erles et al., 2007) . Noteworthy, in the S and M/N phylogenetic trees, CRCoV 240/05 and 4182 formed a separate bunch among the group 2a CoVs, most likely as a consequence of the speciation of BCoV in the canine host. This study provides useful data for the molecular comprehension of bovine-like CoVs in the canine host. The CoV ecology is intricate due to the number of apparently frequent cross-species jumps that entail its evolution. Indeed, the presence of bovinelike CoVs in dogs was clearly demonstrated in previous studies by serological methods, sequence comparison and phylogenetic analysis, but molecular data available to date is incomplete and limited in number. Sequence analysis of the 3 -end of the viral genome showed that strain 240/05 has a genomic organization similar to BCoV, including the presence of the ORFs encoding for the nsp between the S and E protein genes. Although the structural genes are highly conserved among CRCoV strains 240/05 and 4182, the location and number of the accessory genes differ between the two viruses. Moreover, their function in the viral life cycle has not been established yet. In general, as already demonstrated for other CoVs, accessory genes are not essential for replication and their expression could even decrease viral fitness in vitro (Schwarz et al., 1990; Yokomori et al., 1991; Curtis et al., 2002; Ortego et al., 2003; Youn et Yount et al., 2005) . Nevertheless, in field conditions those genes are constantly maintained (Herrewegh et al., 1995; Smits et al., 2005; Dijkman et al., 2006) and their loss is often accompanied by the decline of virulence in the natural host (de Haan et al., 2002; Ortego et al., 2003; Haijema et al., 2004) . CRCoV 240/05 was directly sequenced from the lung sample of a dog allowing the molecular characterization of viral RNA coming straight from field conditions with no adaptation to cell culture. Apparently, in natural conditions the 4.8 kDa protein, strictly maintained in other bovine-like CoVs, was truncated in CRCoV 240/05, and this may be potentially associated to the cross-species transmission and subsequent adaptation of the ancestor BCoV to a different host. However, directly downstream of the S protein gene, CRCoV 4182 possessed a unique 8.8 kDa protein gene, whereas CRCoV G9142 displayed the canonical set of BCoV accessory genes but with the equal truncated form of the 4.8 kDa protein gene as in strain 240/05. Noteworthy, the identical truncated 4.8 kDa terminating codon is present in 240/05, G9142 and in the unique 8.8 kDa corresponding nucleotide sequence of CRCoV strain 4182 (Erles et al., 2007) . Accordingly, more than one BCoV strain or ancestor virus was likely involved in the origin of CRCoV, thus leading to the emergence of different canine strains with a different organizations of the accessory genes. It has been hypothesized that the 4.9 kDa and 4.8 kDa proteins of BCoV may have arisen through mutation from a bovine 11 kDa protein (Abraham et al., 1990) . According to this scenario, CRCoV 4182 may descent from a mutation of an ancestral bovine-like strain that exhibited the full-length 11 kDa protein, whereas CRCoV 240/05 and G9142 presumably descended from a different ancestor that showed the two distinct non-structural proteins. Furthermore, nsp 4.9 and 4.8 are not present in the bovine-derivative HCoV-OC43 (Vijgen et al., 2005) and their function is yet unknown in BCoV itself where the 4.9 kDa protein could not been expressed due to the absence of a start codon in its mRNA (Hofmann et al., 1993) . Obviously, it cannot be ruled out that the elevated level of genomic differences among the accessory genes is due to the high frequency of mutations/deletions that occur during CoV evolution, rather than to RNA recombination events that characterize CoV ecology (Lai et al., 1985; Makino et al., 1986; Banner and Lai, 1991) . Accordingly, all CRCoV strains analyzed so far constantly cluster at the phylogenetic level with extant BCoV strains (Fig. 2) , reinforcing previous suggestions that CRCoV apparently originated as a host variant of BCoV or both viruses descended from a common ancestor. Nevertheless, the elevated level of amino acid similarity with extant bovine-like CoVs characterizing the 12.8 kDa protein suggests a more important role of this protein in viral pathobiology (Erles et al., 2007) . Unfortunately, CRCoV 240/05 could not be propagated in tissue culture probably as a consequence of the long storage of the tissue sample, thus preventing further analysis of viral mRNAs in the context of the infected cells. However, several studies are warranted in order to investigate the effective functionality of the accessory genes in BCoV and CRCoV, thus elucidating the relationship and variations in their expression and the interconnected evolution of highly similar viruses in different hosts. Sequence and expression analysis of potential nonstructural proteins of 4.9, 4.8, 12.7, and 9.5 kDa encoded between the spike and membrane protein genes of the bovine coronavirus Random nature of coronavirus RNA recombination in the absence of selection pressure Nidovirus genome organization and expression mechanisms Canine respiratory viruses Heterologous gene expression from transmissible gastroenteritis virus replicon particles The groupspecific murine coronavirus genes are not essential, but their deletion, by reverse genetics, is attenuating in the natural host The genome organization of the nidovirales: similarities and differences between arteri-, toro-, and coronaviruses Serological and molecular evidence that canine respiratory coronavirus is circulating in Italy Biological and genetic analysis of a bovine-like coronavirus isolated from water buffalo (Bubalus bubalis) calves An update on canine coronaviruses: viral evolution and pathobiology Human coronavirus 229E encodes a single ORF4 protein between the spike and the envelope genes Detection of coronavirus in cases of tracheobronchitis in dogs: a retrospective study from 1971 to Canine respiratory coronavirus: an emerging pathogen in the canine infectious respiratory disease complex Longitudinal study of viruses associated with canine infectious respiratory disease Isolation and sequence analysis of canine respiratory coronavirus Detection of a group 2 coronavirus in dogs with canine infectious respiratory disease Detection of antigenic group 2 coronavirus in an adult alpaca with enteritis Genomics and evolution of the Nidovirales Nidovirales: evolving the largest RNA virus genome Live, attenuated coronavirus vaccines through the directed deletion of group-specific genes provide protection against feline infectious peritonitis Isolation of bovine respiratory coronaviruses from feedlot cattle and comparison of their biological and antigenic properties with bovine enteric coronaviruses The molecular genetics of feline coronaviruses: comparative sequence analysis of the ORF7a/7b transcription unit of different biotypes Leader-mRNA junction sequences are unique for each subgenomic mRNA species in the bovine coronavirus and remain so throughout persistent infection Analysis of the genome sequence of an alpaca coronavirus The prevalence of a group 2 coronavirus in dogs in Japan Recombination between nonsegmented RNA genomes of murine coronaviruses High-frequency RNA recombination of murine coronaviruses Coronavirus accessory proteins Transmissible gastroenteritis coronavirus gene 7 is not essential but influences in vivo virus replication and virulence Serological prevalence of canine respiratory coronavirus Murine coronavirus nonstructural protein ns2 is not essential for virus replication in transformed cells An introduction to Nidovirus Nidovirus sialate-O-acetylesterases: evolution and substrate specificity of coronaviral and toroviral receptor-destroying enzymes Evolutionary history of the closely related group 2 coronaviruses: porcine hemagglutinating encephalomyelitis virus, bovine coronavirus, and human coronavirus OC43 Complete genomic sequence of human coronavirus OC43: molecular clock analysis suggests a relatively recent zoonotic coronavirus transmission event Heterogeneity of gene expression of the hemagglutinin-esterase (HE) protein of murine coronaviruses In vitro assembled, recombinant infectious bronchitis viruses demonstrate that the 5a open reading frame is not essential for replication Severe acute respiratory syndrome coronavirus groupspecific open reading frames encode nonessential functions for replication in cell cultures and mice This work was supported by grants from the Italian Ministry of Health, Ricerca finalizzata 2008, project "Mammalian coronaviruses: molecular epidemiology, vaccine development and implications for animal and human health". The authors are grateful to P.J. Collins (CIT, Department of Biology, Cork, Ireland) for the English revision of the manuscript.