key: cord-0705537-suo9qiym authors: Chen, Wei-Ju; Yang, Jyh-Yuan; Lin, Jih-Hui; Fann, Cathy S. J.; Osyetrov, Valeriy; King, Chwan-Chuen; Chen, Yi-Ming Arthur; Chang, Hsiao-Ling; Kuo, Hung-Wei; Liao, Fong; Ho, Mei-Shang title: Nasopharyngeal Shedding of Severe Acute Respiratory Syndrome—Associated Coronavirus Is Associated with Genetic Polymorphisms date: 2006-06-01 journal: Clin Infect Dis DOI: 10.1086/503843 sha: f24e5aa3647cb9c63dc391290a7c6690c01b99cc doc_id: 705537 cord_uid: suo9qiym Background. A high initial or peak severe acute respiratory syndrome (SARS)—associated coronavirus (SARS-CoV) load in nasopharyngeal specimens was shown to be associated with a high mortality rate. Because all infected individuals were devoid of preeexisting protective immunity against SARS-CoV, the biological basis for the variable virus burdens in different patients remains elusive. Methods. The nationwide SARS database in Taiwan was analyzed, and genotyping of 281 single-nucleotide polymorphisms (SNPs) of 65 genes was performed for 94 patients with SARS, to identify SNPs for which distribution between patients with or without detectable nasopharyngeal shedding of SARS-CoV was biased. Results. Titers of SARS-CoV shed in nasopharyngeal specimens varied widely, ranging from nondetectable to 10(8) SARS-CoV RNA copies/mL, and they were correlated positively with a high mortality rate (P < .0001, by trend test) and with early death (i.e., death occurring within 2 weeks of the onset of illness) (P = .0015, by trend test). Virus shedding was found to be higher among male patients (P = .0014, by multivariate logistic regression) and among older patients (P = .015, by multivariate logistic regression). Detectable nasopharyngeal shedding of SARS-CoV was associated with polymorphic alleles of interleukins 18 (P = .014) and 1A (P = .031) and a member of NFκB complex (reticuloendotheliosis viral oncogene homolog B [RelB]) (P = .034), all of which are proinflammatory in nature, as well as the procoagulation molecule fibrinogen-like protein 2 (P = .008). Conclusion. The SARS-CoV load is a determinant of clinical outcomes of SARS, and it is associated with polymorphisms of genes involved in innate immunity, which might be regulated in an age- and sex-dependent manner. The findings of the present study provided leads to genes involved in the host response to SARS-CoV infection; if substantiated with functional studies, these findings may be applicable to other newly emerged respiratory viruses (e.g., the influenza pandemic strain). P ! .0001 test) and with early death (i.e., death occurring within 2 weeks of the onset of illness) ( , by trend test). P p .0015 Virus shedding was found to be higher among male patients ( , by multivariate logistic regression) and P p .0014 among older patients ( , by multivariate logistic regression). Detectable nasopharyngeal shedding of SARS-P p .015 CoV was associated with polymorphic alleles of interleukins 18 ( ) and 1A ( ) and a member of P p .014 P p .031 NFkB complex (reticuloendotheliosis viral oncogene homolog B [RelB]) ( ), all of which are proinflam-P p .034 matory in nature, as well as the procoagulation molecule fibrinogen-like protein 2 ( ). P p .008 Conclusion. The SARS-CoV load is a determinant of clinical outcomes of SARS, and it is associated with polymorphisms of genes involved in innate immunity, which might be regulated in an age-and sex-dependent manner. The findings of the present study provided leads to genes involved in the host response to SARS-CoV infection; if substantiated with functional studies, these findings may be applicable to other newly emerged respiratory viruses (e.g., the influenza pandemic strain). Severe acute respiratory syndrome (SARS) is a recently identified infectious disease caused by a novel coronavirus, SARS-CoV, that causes significant mortality, especially among elderly patients and patients with comorbid factors [1] [2] [3] [4] . The clinical course of SARS consists of initial onset of fever, followed by the development of respiratory symptoms that may progress to acute respiratory distress syndrome in some patients [2, 5] but may be quite self-limiting in others. Serial chest radiographs of patients with SARS have disclosed distinct patterns of disease progression [6] and have suggested prolonged recovery in correlation with the clinical severity of the disease. These findings, in conjunction with those of clinical studies of cytokines during the acute phase of SARS [7] [8] [9] , have suggested that the development of acute respiratory distress syndrome in patients with SARS is due to an immunopathological response related to strong activation of T helper type 1 cell-mediated immunity and a hyperinnate inflammatory response, rather than to uncontrolled virus growth [2] . However, a high initial or peak nasopharyngeal virus load is associated with a high mortality rate [10, 11] , and a high serum virus load correlates with an increased risk for admission to an intensive care unit [12] , highlighting the importance of virus burden to the clinical outcome. High SARS-CoV titers have also been recovered from saliva samples and mouth wash specimens, raising the possibility that the virus also replicates in the upper airway [13] . Autopsy findings for patients with SARS who died within 2 weeks after the onset of illness showed that the presence of SARS-CoV was widespread in a number of tissues and organs [14, 15] , and that, in some surviving patients, the virus remained for prolonged periods (up to 80 days) [16, 17] . Detection of virus in airway aspirates, serum samples, or fecal specimens beyond the 10th day of illness suggested continuing, unabated viral replication and indicated a poor prognosis [18] . In a small study of a series of postmortem lung tissues, detection of a high virus load in lung tissue was associated with a survival time of !21 days [19] . Given the assumption that all patients had neither preexisting protective immunity against SARS-CoV nor access to antiviral drugs, which can effectively alter the level of virus shedding, the biological basis for the differences in the virus burden in different patients is intriguing and has not yet been elucidated. In this report, the virus load of patients with confirmed SARS-CoV infection at the time of admission to the hospital was analyzed in relation to sex, age, and the genetic polymorphism of genes involved in proinflammation and innate immunity responses. The database. The nationwide SARS database in Taiwan includes all laboratory-confirmed cases of SARS reported from March through June 2003. The details of data compilation have been described elsewhere [20, 21] . Our analysis focused on the initial nasopharyngeal SARS-CoV load at the time of admission to the hospital and included 154 patients whose virus loads were quantified using a commercially available RT-PCR assay (Artus Real Art HPA-Coronavirus; Artus), as well as 111 patients who had successive negative results of testing of nasopharyngeal samples and for whom diagnoses of SARS were based on a positive finding of seroconversion of SARS-neutralizing antibody. The genetic study. The genetic study was designed to compare the genetic background of patients who had SARS with or without detectable nasopharyngeal SARS-CoV. All surviving patients with SARS were followed up, and 108 genetically unrelated patients with laboratory-confirmed SARS consented to participate in this study. A reference group of 94 Taiwanese subjects was selected, on the basis of an age-and sex-stratified random sampling scheme, from a biobank compiled as part of a nationwide population study that enrolled 3312 Han Chinese descendants residing in Taiwan [22] . The study protocol, including an informed consent statement that conforms to the fifth edition of the Helsinki Declaration, was approved by the Medical Research Ethics Committee of the Institute of Biomedical Sciences (Taipei, Taiwan) (application #AS-IBMS-MREC-92-11). Genes and single-nucleotide polymorphism (SNP) selection. We focused on genes whose products are either known to or have been predicted to interact with SARS-CoV in antiviral responses, inflammation, or immunostimulation (table 1) [23] . SNPs of these genes were identified in 5 databases: the SNP Consortium, the National Center for Biotechnology Information, GeneSNPs Public Internet Resource, GeneCards, and the Japan SNP Database. A total of 281 SNPs in 65 genes ( [25] . SNP genotyping. Genotyping was performed using either (1) the Sequenom MassArray MALDI-TOF system (Sequenom) at the Taiwan National Genotyping Center core facility, by use of primers for SNP typing that were designed using Spectro-Designer software (Sequenom), or (2) the ABI 7000 system (Applied Biosystems), if the Sequenom system could not design the primers. For quality control, genotyping of up to 20% of the samples was repeated; any discrepancy was verified by sequence analysis, and the error rate was !1%. Statistical analysis. Data analysis was performed, unless stated otherwise, by use of SAS, version 8, or SAS/Genetics (SAS Institute). Hardy-Weinberg equilibrium was tested for each SNP result of the Han reference population, and the genotyping assay was repeated or another method was used for testing if the results violated Hardy-Weinberg equilibrium. Haplotypes for all SNPs of each gene were predicted using the Phase 2.0.2 program [26] . For each SNP, univariate analysis was performed by comparing the frequencies of alleles and genotypes between patients with or without detectable SARS-CoV in nasopharyngeal specimens. Tests for models of dominant/recessive or codominant traits were performed to identify the susceptible genotypes of the SNP. The empirical P value was calculated using the permutation test of 10,000 rounds for each SNP for which an association with the virus load was demonstrated. Tests of significance that were performed included the test, the Cochran-Armitage trend test, Fisher's 2 x exact test, and the exact Mantel-Haenszel test. The continuous variables (i.e., age and virus load) were compared using the Wilcoxon rank sum test, to analyze factors that could have potentially affected the virus load, including demographic characteristics, the source of infection, and the duration of illness. Associations between nasopharyngeal SARS-CoV shedding and multiple variables were analyzed simultaneously in a cumulative logistic model. Interactions between genes were tested in a cumulative logistic model for virus detection. Detailed information on the SNPs is provided in [23] . SNP, single-nucleotide polymorphism. a HUGO Gene Nomenclature Committee-approved gene symbol [24] . The virus load in the respiratory tract at the time of admission to the hospital or diagnosis of SARS ranged widely (from undetectable to SARS- 8 10 CoV RNA copies/mL) (figure 1). Although a longitudinal study of patients with SARS indicated that the nasopharyngeal virus load increased between the fifth and the 10th days of illness and decreased by the 15th day of illness [2] , the virus loads at the time of admission of all patients with SARS in our data set did not reflect this trend of a rise and fall occurring during the first 2 weeks of the clinical course of SARS. Rather, on any given day within the first week of the clinical course of SARS, the nasopharyngeal virus load in different patients ranged widely. Overall, 111 (41.9%) of 265 patients with SARS had an undetectable level of nasopharyngeal virus shedding, and this lack of detection of virus did not correlate with the time of specimen collection, because successive samples were obtained from these patients, and the test results for these specimens remained negative. Factors influencing the level of virus shedding. Among patients with SARS, undetectable levels of nasopharyngeal SARS-CoV shedding were found more often among female patients (50.0%) than among male patients (27.4%), and they were also found more often among younger patients (53.1% of patients !30 years of age) than among older patients (37.7% of patients у30 years of age) (table 2 and figure 2). When only the virus titer of patients with detectable virus shedding was the focus of investigation, female patients consistently shed lower titers of virus than did male patients ( , by the P p .045 multivariate logistic regression model). Furthermore, in a multivariate cumulative logistic regression model in which (1) the date of sample collection was controlled, and (2) any underlying diseases and the source of infection (identifiable or not) were considered simultaneously, female sex ( ) and younger P p .001 age ( ) were independently associated with a lower vi-P p .005 rus load, whereas having an underlying disease or a known source of infection did not influence virus shedding. Because the lack of virus detection may be the result of lower levels of virus shedding occurring at the time of sample collection during the early (!5 days of illness) or later (115 days of illness) clinical phase of disease [2] , the multivariate model confirmed that nasopharyngeal specimens in which levels of virus shedding were undetectable were collected at the same time relative to disease onset as were specimens in which virus levels were detectable; lack of detection of virus did not correlate with early sample collection. Clinical significance of virus shedding. We further analyzed the mortality rate relative to the levels of virus shedding, to validate the biological significance of detection of virus in nasopharyngeal specimens (table 2). The mortality rate was 46.2% among patients with 1 SARS-CoV RNA copies/mL 5 10 detected in nasopharyngeal specimens, whereas the mortality rate was only 9.9% among patients with undetectable levels of virus in nasopharyngeal specimens. Furthermore, the duration of survival for the patients whose cases of SARS were fatal was significantly shortened (to !2 weeks), with increasing levels of virus shedding occurring during the acute phase of the disease (table 3) . Of the patients with fatal cases of SARS who had 1 SARS-CoV RNA copies/mL detected in nasopharyngeal 5 10 specimens, 77.8% died within the first 2 weeks of SARS illness, whereas, of the patients with fatal cases of SARS who did not have virus detected in nasopharyngeal specimens, only 18% died within the first 2 weeks of illness ( , by Cochran-P p .001 Armitage trend test). Polymorphic genes associated with the detection of SARS-CoV in nasopharyngeal specimens. With mortality data validating the biological significance of undetectable nasopharyngeal virus shedding, we then compared the genotype distribution of SNPs between the 2 groups of patients with detectable or undetectable virus shedding (table 4 and figure 3) . Of the 281 SNPs tested, 4 SNPs of 4 genes showed a significant association with virus load. IL-18 (IL18) of Ϫ607T (rs1946518) in the promoter, IL-1A (IL1A) of Ϫ889T (rs1800587) upstream from the ATG, RelB (reticuloendotheliosis viral oncogene homolog B) in intron 5 at +23962T (rs2288918) from the ATG, and the fibrinogen-like protein 2 (FGL2) of +158A (rs2075761) (resulting in an amino acid substitution from glycine to glutamic acid) were overrepresented in patients with detectable levels of SARS-CoV. For the genotype distribution of each SNP, empirical P values were calculated using permutation tests of 10,000 rounds and confirmed the statistical significance of (table 4). P ! .05 Analysis performed by further stratification of these 4 genes did not suggest any interactions between these genes that contributed to the level of virus shedding. Multivariate analysis confirmed that all loci were independently associated with virus load (table 4) . Furthermore, the homozygous individuals possessing 2 alleles of Ϫ607T IL18 had increased risk of virus shedding, compared with the heterozygous individuals possessing only 1 allele when genotypes of the other 3 alleles were controlled. A total of 13 SNPs in the 4 genes were typed, and polymorphisms were found at 2 of 6 IL18 loci (Ϫ607 and Ϫ137), at 1 of 2 RelB loci, at 1 IL1A locus, and at 3 of 4 FGL2 loci (Ϫ1285, Ϫ656, and +158). The haplotypes were analyzed for IL18 and FLG2; only a single locus of IL18 (Ϫ607) and FGL2 (+158) showed an association with virus load. We compared both the allelic and genotypic distributions of these 4 loci between the 94 patients with SARS and the 94 healthy Taiwanese in the reference group, and we found no statistical difference (table 4) . The clinical significance of the virus burden is intuitive, but it has been difficult to directly demonstrate. In the present report, we showed the dramatic interindividual difference in virus bur- den and the direct correlation of such virus burden with clinical outcomes. Our analysis further indicated that a high SARS-CoV load is associated with early death, in addition to a high mortality rate, as previously reported elsewhere [10] [11] [12] . Therefore, our results suggest that (1) the level of nasopharyngeal virus shedding during the acute phase of SARS (!2 weeks of illness) can be monitored as an index for therapeutic and clinical progress, and (2) identification of genes associated with virus load can potentially provide leads to pathways involved in virus clearance during the early phase of infection. To the best of our knowledge, this is the first study to probe into the factors influencing the SARS-CoV load. The exploratory nature of this genetic study necessitated the screening of a large number of SNPs; applying the conventional Bonferroni correction for multiple tests ( ) would not n p 281 be most appropriate. However, the internal validity of results based on a small sample size in an epidemic setting was further substantiated by the results of permutation tests. On the basis of a preset level of statistical significance, 4 SNPs were singled out for further analysis and discussion. An additional 10 SNPs of 6 genes, which showed differences of 110% in the allele frequency between patients with or without detectable SARS-CoV but for which the differences did not reach the level considered to be significant ( figure 3) , were also highlighted and should be investigated if the opportunity arises. These candidate genes were selected on the basis of the biological plausibility of innate immune responses to viral infections in a broad sense, rather than specific to SARS-CoV; thus, similar considerations may be pertinent and applicable to a potential influenza pandemic in which no individuals have preexisting immunity and in which the effects of the innate immunity of the hosts would be most conspicuous. Three of the 4 SNPs (IL18, IL1A, and RelB) belong to proinflammatory genes. Both IL18 and IL1 signal through pathways involving the transcription factor NFkB [27] [28] [29] , to which several proinflammation pathways converge and RelB is a subunit. RelB belongs to the Rel proteins that are evolutionarily conserved mediators of immune responses and are especially important to cytokine expression and the differentiation and survival of cells in the immune system [30] ; RelB mutant mice display a severe and fatal multiorgan inflammation. The induction of NFkB, to which RelB is a subunit, is reduced in T lymphocytes from old mice [31] and is a potential mechanism to explain the senescence of immunity. Whether the genetic variant of RelB, which is associated with higher virus load, represents a suppressed or an overly reactive function of RelB (i.e., inflammation signaling) remains elusive. IL18, which is a member of the IL1 family and also is a pleiotropic proinflammatory cytokine in activating natural killer cells and enhancing the T helper type 1 cell immune response, is constitutively expressed in the lung [32] . IL18 knockout mice were shown to clear influenza virus more effectively [33] , and the polymorphic alleles of IL1A (Ϫ889) have been reported to predict control of plasma viremia in patients with HIV-1 infection who are undergoing antiretroviral therapy [34] , suggesting that both IL18 and IL1A participate in the clearance of viral infection in a broad sense. Estrogen has been shown to reduce IL18 mRNA levels in the mouse uterus [35] , suggesting that regulation of IL18 expression could be sex dependent. The outcomes of infection or the level of response to vaccination has been noted to demonstrate sex differences (see Beagley et al. [36] and Morales-Montor et al. [37] for reviews); the innate and adaptive immunity may be regulated through sex hormones [38] . A higher mortality rate among male patients with SARS was suggested in an analysis of the Hong Kong and Taiwan databases [39] ; it would be more informative if the analysis had included information on nasopharyngeal virus burden, which is currently a commonly used method of diagnosing SARS and other viral infections. FGL2, an IFN-g-inducible protein expressed by lymphocytes, macrophages, and endothelium, is a prothrombinase that has been reported to contribute to fibrin deposition during viral hepatitis, and its expression is associated with a number of pathological conditions [40] [41] [42] . The inclusion of FGL2 in the present study was based on reports that FGL2 expression could be induced by the nucleocapsid protein of virulent mouse hepatitis virus, also a coronavirus [43] . The FGL2 SNP at +158 also shows an association with the clinical severity of SARS (W. Human FGL2 mRNA was shown to be preferentially expressed in memory T lymphocytes (CD3 + /CD45R0 + ) in one study [44] . Patients with SARS who were of advanced age had a higher mortality rate [1] [2] [3] [4] and higher virus loads than did younger patients in this study; how these observations are related to the preferential expression of FGL2 in memory T lymphocytes remains to be investigated. Most patients with the highest level of virus shedding (1 6 10 SARS-CoV RNA copies/mL) were not included in our genetic analysis because of their early death. Therefore, it was not possible to conduct a multivariate analysis to simultaneously examine the effects of age, sex, and genetic polymorphism on death. The biological basis of the increased virus load in older patients remains elusive, be it the senescence of immunity (see Stout and Suttles [45] for a review), which undermines the efficacy of both innate and adaptive immunity, or the enhanced virus infection, for which age is a confounder [21] . Because the Han descendants, who constitute 198% of the population of Taiwan, are genetically homogenous, according to a recent study of the SNPs of the major histocompatibility complexes [46] , population stratification that confounds our results is not of major concern. Furthermore, this work constitutes a population-based study in that all surviving patients with SARS were solicited, and no selection bias was involved in the classification of participating patients with SARS into case and control groups that were based on results of tests to determine virus burden. In conclusion, the SARS-CoV load during the initial phase of infection, which is crucial to the clinical outcome of patients with SARS, is shown, for what we believe is the first time, to be associated with polymorphisms of genes involved in innate immunity. These genes, although not claimed to be a comprehensive list, have exemplified potential pathways of innate immunity in response to SARS-CoV or, more broadly, to other viral infections. It is imperative to proceed with functional studies to test whether these genes are indeed functionally active in signaling pathways to the direct suppression of virus replication or to the clearance of virally infected cells. Understanding how the interplay between the virus load during the early stage (!2 weeks of illness) and the later stage (12 weeks of illness) of host response might contribute to the clinical outcome of SARS should be helpful in the clinical management of patients with SARS. Clinical features and short-term outcomes of 144 patients with SARS in the greater Toronto area Clinical progression and viral load in a community outbreak of coronavirus-associated SARS pneumonia: a prospective study The spectrum of severe acute respiratory syndrome-associated coronavirus infection Severe acute respiratory syndrome: clinical outcome and prognostic correlates The spectrum of pathological changes in severe acute respiratory syndrome (SARS) Severe acute respiratory syndrome: radiographic appearances and pattern of progression in 138 patients Plasma inflammatory cytokines and chemokines in severe acute respiratory syndrome Characterization of cytokine/chemokine profiles of severe acute respiratory syndrome An interferon-gamma-related cytokine storm in SARS patients Initial viral load and the outcomes of SARS Viral replication in the nasopharynx is associated with diarrhea in patients with severe acute respiratory syndrome Quantitative analysis and prognostic implication of SARS coronavirus RNA in the plasma and serum of patients with severe acute respiratory syndrome Detection of SARS-associated coronavirus in throat wash and saliva in early diagnosis Study on etiology and pathology of severe acute respiratory syndrome Fatal severe acute respiratory syndrome is associated with multiorgan involvement by coronavirus Persistence of lung inflammation and lung cytokines with high-resolution CT abnormalities during recovery from SARS Duration of RT-PCR positivity in severe acute respiratory syndrome Viral loads in clinical specimens and SARS manifestations Severe acute respiratory syndrome-associated coronavirus in lung tissue Serologic and molecular biologic methods for SARS-associated coronavirus infection Neutralizing antibody response and SARS severity Chinese cell and genome bank in Taiwan: purpose, design and ethical considerations Table 1: the 64 candidate genes Strategies for characterizing highly polymorphic markers in human gene mapping IL-18 and IL-12 signal through the NF-kB pathway to induce NK-1R expression on T cells IFN-alpha and IL-12 induce IL-18 receptor gene expression in human NK and T cells Pulmonary epithelial cells are a source of IL-8 in the response to Mycobacterium tuberculosis: essential role of IL-1 from infected monocytes in a NF-kB-dependent network NF-kB and Rel proteins: evolutionarily conserved mediators of immune responses Induction and regulation of NFkB during aging: role of protein kinases Interleukin-1 and neuronal injury Enhanced viral clearance in interleukin-18 gene-deficient mice after pulmonary infection with influenza A virus Alleles of the gene encoding IL-1a may predict control of plasma viraemia in HIV-1 patients on highly active antiretroviral therapy Estrogen inhibits interleukin-18 mRNA expression in the mouse uterus Regulation of innate and adaptive immunity by the female sex hormones oestradiol and progesterone Host gender in parasitic infections of mammals: an evaluation of the female host supremacy paradigm Estrogen regulates CCR gene expression and function in T lymphocytes Do men have a higher case fatality rate of severe acute respiratory syndrome than women do? The fgl2 prothrombinase/fibroleukin gene is required for lipopolysaccharide-triggered abortions and for normal mouse reproduction Molecular and functional analysis of the human prothrombinase gene (HFGL2) and its role in viral hepatitis The Fgl2/fibroleukin prothrombinase contributes to immunologically mediated thrombosis in experimental and human viral hepatitis Induction of prothrombinase fgl2 by the nucleocapsid protein of virulent mouse hepatitis virus is dependent on host hepatic nuclear factor-4a Characterization of human fibroleukin, a fibrinogen-like protein secreted by T lymphocytes Immunosenescence and macrophage functional plasticity: dysregulation of macrophage function by age-associated microenvironmental changes A comparison of major histocompatibility complex SNPs in Han Chinese residing in Taiwan and Caucasians We thank all the public health nurses who helped collect epidemiological data; the Division of Surveillance staff at the Taiwan Center for Disease Control who compiled the epidemiological database; and Yu-Jen Liang, who assisted in data analysis.