key: cord-0767270-190jmgcq authors: Liu, Yong-sheng; Zhou, Jian-hua; Chen, Hao-tai; Ma, Li-na; Pejsak, Zygmunt; Ding, Yao-zhong; Zhang, Jie title: The characteristics of the synonymous codon usage in enterovirus 71 virus and the effects of host on the virus in codon usage pattern date: 2011-07-31 journal: Infection, Genetics and Evolution DOI: 10.1016/j.meegid.2011.02.018 sha: 519e91ceb931116e05466800ee014dd297b96d7b doc_id: 767270 cord_uid: 190jmgcq Abstract To give a new perspective on the evolutionary characteristics shaping the genetic diversity of enterovirus 71 (EV71) and the effects of natural selection from its host on the codon usage pattern of the virus, the relative synonymous codon usage (RSCU) values, codon usage bias (CUB) values, effective number of codons (ENCs) values and nucleotide contents were calculated to implement a comparative analysis to evaluate the dynamics of the virus evolution. The characteristics of the synonymous codon usage patterns and nucleotide contents of EV71 and the comparison between ENC values for the whole coding sequence of EV71 and that of coding sequences for viral proteins of EV71 all indicate that the interaction between mutation pressure from virus and natural selection from host exists in the processes of evolution of EV71. The synonymous codon usage pattern of EV71 is a mixture of coincidence and antagonism to that of host cell. In addition, the genetic diversity of EV71 strains and the preferential selection of some synonymous codons in EV71 strains based on the different epidemic areas were observed, suggesting that geographic and social factors may play roles in influencing the evolution of this virus. Hand-foot-and-mouth disease (HFMD) is a general illness in children which usually is caused by some human enteroviruses (Cherry, 1992) . There were many reports which indicated some pandemics of HFMD were associated with EV71 infection in the Asia-Pacific area (AbuBakar et al., 1999; Chumakov et al., 1979; Fujimoto et al., 2002; Ho et al., 1999; Lin et al., 2005; Liu et al., 2000; Zheng et al., 1995) . EV71 belongs to members of the Enterovirus genus of the Picornaviridae family and is a positivestrand RNA virus with a genome size of about 7500 bp. The two non-translated regions (5 0 -NTR and 3 0 -NTR) flank the single open reading frame (ORF) of EV71 virus genome. The coding sequence encodes one polyprotein that is cleaved by viral proteases to generate 11 proteins, namely VP4, VP2, VP3, VP1, 2A, 2B, 2C, 3A, 3B, 3C and 3D. The structural proteins VP1-3 are exposed on EV71 surface. The VP1 gene contained the major antigenic sites and genetic diversity associated with serotypes (Oberste et al., 1999a,b) . Non-structural proteins were involved in polyprotein processing, RNA replication and the shut-down of host cell protein synthesis. In addition, recombinations are well known to result in a genetic diversity and evolution of enteroviruses (Chan and AbuBakar, 2004; Chen et al., 2010; Yoke-Fun and AbuBakar, 2006) . Due to various genetic diversities of EV71, the effect of the vaccine is limited to prevent children from EV71. This situation has made researchers aware of the importance of analysis of EV71 genetic diversity (Bible et al., 2008; Cardosa et al., 2003; Herrero et al., 2003; Huang et al., 2008; Lewis-Rogers et al., 2009; McMinn, 2002; Sanders et al., 2006) . It is noticed that nucleotide composition comprising of EV71 coding sequence with various genetic diversities is selective rather than random, because the natural selection from host is responsible to select various strains shaped by mutation. In previous reports, translation selection and compositional constraints under the mutational pressure are thought to be the major factors accounting for codon usage variation among genomes in microorganisms (Gu et al., 2004; Karlin and Mrá zek, 1996; Lesnik et al., 2000; Liu et al., 2010; Zhou et al., 2005 Zhou et al., , 2006 Zhou et al., , 2010 . In some RNA viruses, compared with natural selection, mutation pressure plays a more important role in synonymous codon usage pattern (Jenkins and Holmes, 2003; Levin and Whittome, 2000) . Although it is known that compositional constraints and translation selection are the more generally accepted mechanisms accounting for codon usage bias (Coleman To give a new perspective on the evolutionary characteristics shaping the genetic diversity of enterovirus 71 (EV71) and the effects of natural selection from its host on the codon usage pattern of the virus, the relative synonymous codon usage (RSCU) values, codon usage bias (CUB) values, effective number of codons (ENCs) values and nucleotide contents were calculated to implement a comparative analysis to evaluate the dynamics of the virus evolution. The characteristics of the synonymous codon usage patterns and nucleotide contents of EV71 and the comparison between ENC values for the whole coding sequence of EV71 and that of coding sequences for viral proteins of EV71 all indicate that the interaction between mutation pressure from virus and natural selection from host exists in the processes of evolution of EV71. The synonymous codon usage pattern of EV71 is a mixture of coincidence and antagonism to that of host cell. In addition, the genetic diversity of EV71 strains and the preferential selection of some synonymous codons in EV71 strains based on the different epidemic areas were observed, suggesting that geographic and social factors may play roles in influencing the evolution of this virus. ß 2011 Elsevier B.V. All rights reserved. Karlin et al., 1990; Zhi et al., 2010; Zhou et al., 1999) , other selection forces have also been proposed such as fine-tuning translation kinetics selection as well as escape of cellular antiviral responses (Aragones et al., 2008 (Aragones et al., , 2010 Karlin et al., 1994; Sugiyama et al., 2005) . Thus, the codon usage pattern may be important to disclose the molecular mechanism and evolutionary process of EV71 avoiding host cell response. To our knowledge, it is the first study that the synonymous codon usage pattern and evolutional dynamics of EV71 were systemically analyzed and the relationship between codon usage pattern of EV71 and that of its host was also analyzed. The 74 complete RNA sequences of EV71 were downloaded from the National Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/Genbank/) and detailed information about the viruses were listed in Table S1 . Each general nucleotide composition (U%, A%, C% and G%) and each nucleotide composition in the third site of codon (U 3 %, A 3 %, C 3 % and G 3 %) in EV71 coding sequence were calculated by biosoftware DNAStar 7.0 for windows. To investigate the characteristics of synonymous codon usage without the confounding influence of amino acid composition among different sequences, the relative synonymous codon usage (RSCU) values among different codons in the EV71 ORF was calculated according to the published equation (Sharp et al., 1986 ). The 'effective number of codons' (ENCs), the useful estimator of absolute codon usage bias, was a measure quantifying the codon usage bias of the whole coding sequence of EV71. The ENC value ranges from 20 (when only one synonymous codon is chosen by the corresponding amino acid) to 61 (when all synonymous codons are used equally) (Wright, 1990) . In this study, this measure was used to evaluate the degree of codon usage bias of coding sequences for proteins of EV71 and to calculate the degree of the codon bias for the whole coding sequence of EV71 and other picornaviruses. Additionally, there is a simple method which is supposed that statistically equal and random usage of all available synonymous codons was the ''neutral point'' (RSCU 0 = 1.00) for the development of group-specific codon usage . This method was introduced in the study to investigate the discrepancy of the synonymous codon usage pattern among of EV71 strains based on the different isolated areas. Principal component analysis (PCA), which was a commonly used multivariate statistical method (Jolliffe, 2002; Mardia et al., 1979) , was carried out to analyze the major trend in codon usage pattern among different strains of EV71. PCA involves a mathematical procedure that transforms some correlated variable (RSCU values) into a smaller number of uncorrelated variables called principal components. Each strain was represented as a 59 dimensional vector, and each dimension corresponded to the RSCU value of each sense codon, which only included several synonymous codons for a particular amino acid, excluding the codon of AUG, UGG and three stop codons. In addition, PCA was also performed for analyzing the discrepancy between codon usage pattern of EV71 and that of host cell. The relationship between each general nucleotide composition (U%, A%, C% and G%) and each nucleotide composition in the third site of codon (U 3 %, A 3 %, C 3 % and G 3 %) in EV71 coding sequence and the relationship between U 3 %, A 3 %, C 3 %, G 3 % and the codon usage pattern of EV71 were evaluated by the Pearson's rank. All statistical processes were carried out by statistical software SPSS11.5 for windows. The A% and U% were higher than C% and G%, but A 3 % and U 3 % were lower than C 3 % and G 3 % in EV71 (Table S2 ). The overall nucleotide composition never affects the nucleotide contents in the third site of codon in EV71 coding sequence, suggesting that composition constraints may be one of the factors in affecting the codon usage pattern of EV71. The optimal codons of Ala, Arg, Asp, Cys, Glu, Gly, Ile, Phe, Pro, Ser were A-ended or U-ended, while those of Asn, Glu, His, Leu, Lys, Thr, Tyr, Val were C-ended or G-ended (Table 1) . EV71 does not depends on all optimal codons with either A/U-end or C/G-ended like influenza A virus subtype H5N1, sever acute respiratory syndrome Coronavirus or foot-and-mouth disease virus (FMDV) (Gu et al., 2004; Zhao et al., 2008; Zhou et al., 2010) , but shapes the optimal codons with any types of nucleotide-ended. It is noted that although Asn, Leu, Tyr, Glu, Lys and Thr possessed optimal codon with C-or G-ended, they also contained some favored codon with U-or A-ended. Similarly, Asp, Phe and Cys also had favored codons with C-or G-ended (Table 1 ). These amino acids which choose optimal codons with any nucleotide-ended are affected under both mutation pressure by itself and natural selection from host, since natural selection from host ultimately allows those strains with good-fitness to possess a special codon usage patterns. The PCA detected the first principal component (f 1 0 ) which can account for 13.73% of the total synonymous codon usage variation, and the second principal component (f 2 0 ) for 11.81% of the total variation. It appeared to be a little complex with some overlapping plots representing different epidemic areas (Fig. S1 ). The plots for strains isolated from China-Mainland, compared with that of strains isolated from other areas, could aggregate highly, while the plots for strains isolated from Malaysia and China-Taiwan scattered largely, the plots for strains from Singapore, USA, Japan and Switzerland did not indicate the genetic diversity obviously, due to the limited samples. For strains circulating in China-Mainland, social factors (public health, interpersonal communication, etc.) may play a role in influencing genetic diversity of these strains. However, for strains in China-Taiwan and Malaysia, geographic factors likely influence genetic diversity of those strains except for social factors. The nucleotide contents of the whole coding sequence of EV71 were analyzed. In Table 2 , the significant positive correlations between A% and A 3 %, U% and U 3 %, C% and C 3 % and significant negative correlations among most of heterogeneous nucleotide contents indicated that composition constraints play a role in codon usage pattern of EV71; however, significant positive correlation between G% and A 3 %, C% and G 3 % and no correlation between G% and G 3 % might suggest that natural selection from host plays a role in codon usage pattern of EV71 as well. In addition, there were significant correlations between each nucleotide content in the third site of codon and codon usage indices (f 1 and f 2 ) ( Table 3) . Although the positive and negative correlations existed between C 3 % and f 1 , and between C 3 % and f 2 , respectively, the positive correlation play an important role in affecting the codon usage pattern due to f 1 being the first principal component. The strong discrepancy of the synonymous codon usage in strains based on the different isolated areas was observed. In details, in strains from China-Mainland, CGC for Arg, GAC for Asp, CUU for Leu, UUC for Phe were chosen by EV71 strains preferentially, while CGG for Arg, GAC for Asp, UUU for Phe were chosen poorly; in Singapore, AUU for Ile, CUG for Leu, UCA for Ser were chosen preferentially, while UUG for Phe and UCG for Ser were chosen poorly; in USA, AGA for Arg was used preferentially, while GGU for Gly was used poorly; in Japan, GCU for Ala, AAU for Asn, CAA for Gln, GAA for Glu were chosen preferentially, while GCG for Ala and CAG for Gln were poorly used; in Switzerland, AGG for Arg, UGC for Cys, CUC for Leu, UUG for Leu, AGU for Ser were chosen preferentially; while CGA for Arg, UGU for Cys, CUG for Leu, CUU for Leu, AGC for Ser were poorly used (Fig. S2) . These results may suggest that with the development of evolution of EV71 strains, the discrepancy of some synonymous codon usage probably is formed in different epidemic regions. In order to analyze whether the evolution of CUB was controlled by mutation effect or natural selection from host, the CUB values had been calculated based on data listed in Table 1 . The transition from maximum-negative to maximum-positive values was smooth and there was no obvious or unambiguous border between the so-called dominant and prohibited codons (Fig. 1) , namely, all synonymous codons were used. This result implied that the interaction between mutation pressure from EV71 and natural selection from host exists in the evolution of EV71. By comparing between the patterns of synonymous codon usage of human cell and that of EV71 virus, we found that the pattern of synonymous codon usage of EV71 strains is partially antagonistic to that of human cells. In detail, optimal codons of nine amino acids in EV71, including Ala, Asp, Cys, Gln, Gly, Leu, Phe, Pro, Ser, are the disfavored codons of the corresponding amino acids in its host. Among these non-coincidence patterns of synonymous codon usage of amino acids, the synonymous codon usage of Asp, Cys, Gln, Gly, Phe has evolved to be complementary to that of host cells (Table 1 ). In addition, the optimal and rare synonymous codon usage patterns of Arg, Asn, Glu, His, Lys, Thr, Table 2 Summary of correlation analysis between the A%, U%, C%, G% and A 3 %, U 3 %, C 3 %, G 3 % in the whole coding sequences of 74 EV71 strains a . A 3 % U 3 % C 3 % G 3 % ( C 3 + G 3 )% A% r = 0.869 ** r = À0.346 ** r = À0.316 ** r = À0.316 ** r = À0.102 NS U% r = À0.307 ** r = 0.918 ** r = À0.703 ** r = À0.703 ** r = À0.882 ** C% r = À0.467 ** r = À0.084 NS r = 0.875 ** r = 0.875 ** r = 0.341 ** G% r = 0.316 ** r = À0.751 ** r = À0.027 NS r = 0.027 NS r = 0.671 ** (C + G)% r = À0.198 NS r = À0.665 ** r = 0.832 ** r = 0.832 ** r = 0.884 ** a r value in this table is calculated in each correlation analysis. NS means non-significant (p > 0.05). ** Means p < 0.01. Summary of correlation analysis between the first two axes in principle and nucleotide contents in EV71. Base compositions f 1 0 f 2 0 A 3 % r = 0.359 ** r = 0.542 ** U 3 % r = À0.513 ** r = À0.516 ** C 3 % r = 0.238 * r = À0.355 ** G 3 % r = 0.238 * r = À0.355 ** (C 3 + G 3 )% r = 0.439 ** r = 0.250 * * Means 0.01 < p < 0.05. ** Means p < 0.01. Tyr and Val of EV71 virus were in agreement with those of human cells (Table 1) . Additionally, PCA was performed to examine the whole coding sequence of EV71 in this study. The method detected one major trend in the first axis (f 1 0 ) which can account for 11.79% of the total synonymous codon usage variation, and another major trend in the second axis (f 2 0 ) for 10.69% of the total variation. The plots for codon usage pattern of human are far from the plots for that of EV71 (Fig. S3) . The ENC values were calculated for FMDV, Cardiovirus, Hapatitis A virus (HAV), Poliovirus (PV) and compared with that of EV71 (Table S3 ). Among these virus examined, the ENC value for EV71 is highest, suggesting that EV71 has a most weak codon usage bias. In addition, we set up a plot which showed the relationship between GC 3 % and ENC values of all viral proteins (excluding the very small 3B protein) of EV71 virus, and found that the plots of coding sequences for VP1, VP2, VP3, 2A, 2C, 3A, 3C and 3D aggregated around the expected curve, but the plots of coding sequences for VP4 and 2B scattered highly under the expected curve (Fig. S4a-4c) . It may be explained that the codon usage bias of VP4 and 2B genes is influenced by their small size. In addition, there is no obvious geographic factor in influencing codon usage bias of the coding sequences of EV71, implying that the natural selection from the geographic factor does not affect the codon usage patterns of specific coding sequences of EV71, but shape the pattern of the whole sequence of this virus. Furthermore, we found that some specific non-optimal codons are preferentially chosen in some coding sequences of EV71. In details, three non-optimal codons (UUA, CUA and GUU) are chosen in the VP4 gene, UUG in the VP3 gene, CUU in the VP1 and 3A genes, UUU, CUU and GUU in the 3C gene. It is also found that all coding sequences of EV71 contain some preferential codons. In details, GUG, CCC, ACA, GCC and GAC are preferentially chosen in the VP4 gene, GUG, CCA and AGG in the VP2 gene, CUG, UCA, ACC and AGA in the VP3 gene, UCA in the VP1 gene, CUC, CCA and AGA in the 2A gene, CCU, AGA and AGG in the 2B gene, GUG, UCU, CCA, ACA and AGA in the 2C gene, CCA, ACU, AGU, AGC, AGA and AGG in the 3A gene, AUU, CCU, ACA, GCA, AGU and AGG in the 3C gene, GUG and AGA in the 3D gene. Taken together, there is no obvious relationship between the distribution of non-optimal codons and the deviation of ENC value from the theoretical value. The pattern of codon usage is a genetic characteristic of various organisms. Previous reports have been focused on viruses in Picornavirdae family, such as FMDV, HAV, Poliovirus (Aragones et al., 2008 (Aragones et al., , 2010 Coleman et al., 2008; Zhong et al., 2007) . Because A%, U%, G 3 % and C 3 % play roles in the formation of the different optimal codons with any nucleotide-ended, the codon usage pattern of EV71 is likely influenced by composition constraints. The codon usage pattern of PV is mostly coincident with that of its host, while the codon usage pattern of HAV is antagonistic to that of its host (Mueller et al., 2006; Sá nchez et al., 2003) . The codon usage pattern of EV71 is a mixture of the two types of codon usage. The coincident portion of codon usage pattern of EV71 enable the corresponding amino acids to be translated efficiently, the other antagonistic portion of codon usage pattern of EV71 may enable viral proteins to be folded properly, although the translation efficiency of the corresponding amino acids decreased. In Epstein-Barr virus latent genes deoptimize codon usage in order to evade competition for host protein translation (Karlin et al., 1990 ) and attenuation of PV activity was performed by rare codon pairs inducing poor translation for sequences of viral proteins (Coleman et al., 2008) . These results suggest that disfavored codons coding for amino acids may not be deleterious factor for viruses to adapt to host cells. For codon usage patterns of the coding sequences of EV71, the VP2, 2A, 2B, 2C and 3D genes possess only some preferential codons and none of non-optimal codons is preferentially used, implying that translation of the whole coding sequence of EV71 is possibly regulated under the translation selection. Furthermore, the alternative translation is the possibility of fine-tuning the kinetics of protein translation by a combination of rare and optimal codons (Aragones et al., 2010; Komar, 2009 ). For codon usage patterns of VP4, VP3, VP1, 3A and 3C genes of EV71, these genes possess combination of some non-optimal codons and optimal ones which are preferentially used, implying that translation of the coding sequences of EV71 is possibly regulated under fine-tuning translation kinetics selection. The sequences 5 0 NTR and VP1 are often used to analyze the genetic diversity of EV71 (Hagiwara et al., 1984; Hsu et al., 2007; Li et al., 2005) . By analyzing the codon usage pattern of the whole coding sequence of strains from different areas, genetic diversity resulting from geographic and social factors is likely observed. The genetic diversity of the most strains from China-Mainland could indicate that a relatively independent area with geographic, public health and personal communication that enables the genetic diversity of EV71 to be sustained with little outside influence. Compared with the genetic diversity of strains from China-Mainland, that of strains from China-Taiwan and Malaysia also indicated that social factors play an important role in shaping the codon usage patterns of these strains. Based on the genetic diversities of China-Taiwan and Malaysia, social factors may play important roles in shaping codon usage patterns of EV71 from the two areas. These genetic diversities of EV71 strains from different areas give a sign that geographic and social factors should be noticed at genetic diversity of virus from different areas. The ENC values calculated for some picornaviruses indicate that a significantly lower bias of codon usage exists in EV71 than in the other viruses. As for RNA viruses, previous study reported that the major factor in shaping codon usage patterns appears to be mutation pressure rather than natural selection Zhao et al., 2008; Zhong et al., 2007; Zhou et al., 2005 Zhou et al., , 2010 . However, the genetic characteristics of EV71 suggest the interaction between mutation pressure and natural selection, although ENC values for the whole coding sequence of EV71 Fig. 1 . Distribution of the CUB of a codon for each amino acid. CUB was taken from Table 1 and sorted in ascending order. suggest mutation pressure is a factor in influencing codon usage pattern. Furthermore, in Fig. S4a-4c , the relationship between ENC data for EV71 proteins and CG 3 % indicated that natural selection probably play roles in genetic diversity of EV71 strains except for mutation pressure in order to adapt to host. A general mutational pressure, which affects the whole genome would certainly account for the majority of the codon usage among some RNA viruses (Jenkins and Holmes, 2003) . The genetic diversity and codon usage patterns results we proposed here are useful to understand the processes influencing the evolution of EV71, especially the roles played by natural selection from host and mutation pressure from virus. Additionally, such information might be helpful to understand the roles of geographic and social factors in influencing genetic diversity of EV71. Identification of enterovirus 71 isolates from an outbreak of hand, foot, and mouth disease (HFMD) with fatal cases of encephalomyelitis in Malaysia Hepatitis A virus mutant spectra under the selective pressure of monoclonal antibodies: codon usage constrains limit capsid variability Fine-tuning translation kinetics selection as the driving force of codon usage bias in the Hepatitis A virus capsid Molecular epidemiology of human enterovirus 71 in the United Kingdom from 1998 to Molecular epidemiology of human enterovirus 71 strains and recent outbreaks in the Asia-Pacific region: comparative analysis of the VP1 and VP4 genes Human enterovirus 71 in hand, foot and mouth disease patients Analysis of recombination and natural selection in human enterovirus 71 Enteroviruses: polioviruses (poliomylitis), coxsackieviruses, echoviruses and enteroviruses Enterovirus 71 isolated from cases of epidemic poliomyelitis-like disease in Bulgaria Virus attenuation by genome-scale changes in codon pair bias Outbreak of central nervous system disease associated with hand, foot, and mouth disease in Japan during the summer of 2000: detection and molecular epidemiology of enterovirus 71 Analysis of synonymous codon usage in SARS coronavirus and other viruses in the Nidovirales Genetic and phenotypic characteristics of enterovirus 71 isolates from patients with encephalitis and with hand, foot and mouth disease Molecular epidemiology of enterovirus 71 in peninsular Malaysia An epidemic of enterovirus 71 infection in Taiwan Genetic diversity of epidemic enterovirus 71 strains recovered from clinical and environmental samples in Taiwan Appearance of intratypic recombination of enterovirus 71 in Taiwan from The extent of codon usage bias in human RNA virus and its evolutionary origin Principal Component Analysis, 2nded What drives codon choices in human genes? Constrasts in codon usage of latent versus productive genes of Epstein-Barr virus: data and hypotheses Why is CpG suppressed in the genomes of virtually all small eukaryotic viruses but not in those of large eukaryotic viruses? A pause for thought along the co-translational folding pathway Ribosome traffic in E. Coli and regulation of gene expression Codon usage in nucleopolyhedroviruses Phylogenetic relationships and molecular adaptation dynamics of human rhinoviruses Genetic characteristics of human enterovirus 71 and Coxsackievirus A16 circulating from 1999 to 2004 in Shenzhen, People's Republic of China Genetic characteristics of human enterovirus 71 and cosackievirus A16 circulating from 1999 to 2004 in Shenzhen, People's Republic of China An outbreak of enterovirus 71 infection in Taiwan, 1998: epidemiologic and clinical manifestations Analysis of synonymous codon usage in porcine reproductive and respiratory syndrome virus Multivariate Analysis An overview of the evolution of enterovirus 71 and its clinical and public health significance Reduction of the rate of poliovirus protein synthesis through large-scale codon deoptimization causes attenuation of viral virulence by lowering specific infectivity Molecular evolution of the human enterovirus: correlation of serotype with VP1 sequence and application to picornavirus classification Typing of human enteroviruses by partial sequencing of VP1 Codon usage and replicative strategies of hepatits A virus Genome variability and capsid structural constraints of Hepatitis A virus Molecular epidemiology of enterovirus 71 over two decades in an Australian urban community Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes CpG RNA: identification of novel single-stranded RNA that stimulates human CD14+ CD11c+ monocytes The 'effective number of codons' used in a gene Phylogenetic evidence for inter-typic recombination in the emergence of human enterovirus 71 subgenotypes Analysis of synonymous codon usage in 11 Human Bocavirus isolates Enterovirus 71 isolated from China is serologically similar to the prototype EV71 BrCr strain but differs in the 5 0 -noncoding region Codon optimization of human parvovirus B19 capsid genes greatly increases their expression in nonpermissive cells Mutation pressure shapes codon usage in the GC-Rich genome of foot-and-mouth disease virus Papillomavirus capsid protein expression level depends on the match between codon usage and tRNA availability Analysis of synonymous codon usage in foot-and-mouth disease Analysis of synonymous codon usage in H5N1 virus and other influenza A viruses Synonymous codon usage in environmental Chlamydia UWE25 reflects an evolution divergence from pathogenic chlamydiae This work was supported in parts by grants from National Science & Technology Key Project (2009ZX08007-006B) and International Science & Technology Cooperation Program of China (No. 2010DFA32640) and Science and Technology Key Project of Gansu Province (No. 0801NKDA034). This study was also supported by National Natural Science foundation of China (No. 30700597 and No. 31072143). Supplementary data associated with this article can be found, in the online version, at doi:10.1016/j.meegid.2011.02.018.