key: cord-0754130-z0s5g1fp authors: Schindler, Emily; Dribus, Marian; Duffy, Brian F.; Hock, Karl; Farnsworth, Christopher W.; Gragert, Loren; Liu, Chang title: HLA genetic polymorphism in patients with Coronavirus Disease 2019 in Midwestern United States date: 2021-08-10 journal: HLA DOI: 10.1111/tan.14387 sha: 29152a54c43c896a1677b9f50c963ec5231917f8 doc_id: 754130 cord_uid: z0s5g1fp The experience of individuals with Coronavirus Disease 2019 (COVID‐19) ranges from asymptomatic to life threatening multi‐organ dysfunction. Specific HLA alleles may affect the predisposition to severe COVID‐19 because of their role in presenting viral peptides to launch the adaptive immune response to severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2). In this population‐based case–control study in the midwestern United States, we performed high‐resolution HLA typing of 234 cases hospitalized for COVID‐19 in the St. Louis metropolitan area and compared their HLA allele frequencies with those of 22,000 matched controls from the National Marrow Donor Program (NMDP). We identified two predisposing alleles, HLA‐DRB1*08:02 in the Hispanic group (OR = 9.0, 95% confidence interval: 2.2–37.9; adjusted p = 0.03) and HLA‐A*30:02 in younger African Americans with ages below the median (OR = 2.2, 1.4–3.6; adjusted p = 0.01), and several candidate alleles with potential associations with COVID‐19 in African American, White, and Hispanic groups. We also detected risk‐associated amino acid residues in the peptide binding grooves of some of these alleles, suggesting the presence of functional associations. These findings support the notion that specific HLA alleles may be protective or predisposing factors to COVID‐19. Future consortium analysis of pooled cases and controls is warranted to validate and extend these findings, and correlation with viral peptide binding studies will provide additional evidence for the functional association between HLA alleles and COVID‐19. and 435,151, respectively, as of January 31, 2021 (www.cdc. gov). In addition to known COVID-19 cases, 86% of all infections are predicted to be undocumented and likely to be mild. 3 Patients with moderate to severe COVID-19 likely represent only a fraction of the total number of infected individuals. 4 SARS-CoV-2 appears to elicit highly heterogeneous innate and adaptive immune responses, leading to drastically different outcomes. Although older age and certain comorbidities are known to contribute to increased mortality, [5] [6] [7] younger and seemingly healthy patients are not completely protected from severe COVID-19. 8 The broad demographics and range of severity among COVID-19 patients have been reported by multiple epidemiologic studies from different regions in the United States. [7] [8] [9] Importantly, some selfreported symptoms of COVID-19 show increased correlation among monozygotic twins, 10 suggesting that the predisposition to symptomatic COVID-19 may be heritable. Therefore, it is imperative to elucidate the immunogenetic underpinning of the diverse host responses to determine who is at risk for COVID-19 and why. HLA molecules play an essential role in the defense against viral infections. HLA present peptides derived from viral proteins to T cell receptors to initiate adaptive immunity mediated by pathogen-specific T and B cells. 11 Class I HLA molecules, encoded by HLA-A, -B, and -C genes, are ubiquitously expressed and present peptides to CD8+ T cells; class II HLA molecules, heterodimers encoded by HLA-DRA/DRB1/3/4/5, -DQA1/DQB1, -DPA1/DPB1, are primarily expressed on antigen-presentation cells and present peptides to CD4+ T cells. To accommodate peptides derived from a broad spectrum of pathogens, diverse HLA molecules are encoded by thousands of different alleles in the human population. 12 However, as each individual has at most two alleles per locus, an individual's peptide repertoire is more limited than that of the population. Some HLA molecules may be disadvantageous, compared with others, in presenting peptides derived from SARS-CoV-2, as suggested by in silico modelings. 13 However, there is no published data on HLA allele frequencies in COVID-19 patients of different population categories in the US. Several HLA alleles have been associated with the susceptibility to and different outcomes of SARS caused by SARS-CoV in 2003. [14] [15] [16] [17] [18] The association of HLA alleles and COVID-19 has been examined in several populations, primarily in China 19 and Italy. [20] [21] [22] This study aims to identify HLA alleles associated with moderate to severe COVID-19 among several patient populations in the St. Louis metropolitan area, as compared with matched population controls. We hypothesize that HLA alleles predisposing to symptomatic infection by SARS-CoV-2 are enriched in patients hospitalized for COVID-19. We conducted a population-based case-control study, focusing on classical HLA genes typed by next-generation sequencing, and performed analyses at the allele and protein sequence levels. The study was approved by the Human Research Protection Office of Washington University in St. Louis (IRB ID #: 202004002) and the Institutional Review Board of Mercy Hospital (IRB ID #:1599032-2). Cases consisted of adult inpatients between the ages of 18 and 90 years, who were hospitalized for COVID-19 at Barnes-Jewish Hospital or Mercy Hospital, two of the largest medical centers serving the St. Louis metropolitan area. All cases had a remnant, EDTA-anticoagulated blood specimen available in the clinical laboratories. SARS-CoV-2 infections were confirmed by real-time reverse transcriptase-polymerase chain reaction (RT-PCR) testing of nasopharyngeal swabs. A total of 234 cases were enrolled and HLA typed during the study period from March 26 to July 7, 2020. A total of 22,000 population controls were randomly selected from the National Marrow Donor Program (NMDP) volunteer adult donor registry recruited since 2005. Controls were matched for one of four self-identified "race/ethnic" population categories on the NMDP registry donor recruitment form (African American, Asian Pacific Islander, Whites, and Hispanics), gender, age quartiles, and the first digit of zip codes. A total of 10,000 controls were retrieved for the Whites. Because of the limited availability of minority donors in the NMDP, 4000 controls were retrieved for each of the other groups. Because of the registry recruitment policy of NMDP, the maximum control age (60 years) was younger than the oldest cases. Considering this caveat and the increased comorbidities of elderly patients, ad hoc analysis was performed for younger African Americans (n = 76) and Whites (n = 27) with ages below the medians of 64 and 68 years, respectively, and their matched population controls (4000 for the African Americans and 10,000 for the Whites); the sample sizes of Asian Pacific Islanders and Hispanics were too small to be dichotomized. Population controls from NMDP were HLA typed at high-resolution typing primarily by next-generation sequencing, and also by sequence-specific oligonucleotide or Sanger sequence-based typing. 23 The following clinical data were collected for all cases by retrospective chart review and entered into a REDCap database 24 : demographics (age, gender, self-reported population category, zip code, and BMI), comorbidities (diabetes mellitus, chronic lung diseases, and cardiovascular diseases), duration of hospitalization, ICU admission, mechanical ventilation, and time of last encounter or death. Genomic DNA was extracted from remnant peripheral blood specimens using the EZ1 DNA Blood 350 μl Kit (Qiagen, Hilden, Germany). A total of 192 samples were typed by the AllType assay (One Lambda, West Hills, CA) on the Ion Chef/S5 Ion Torrent platform. 25 A total of 42 samples were amplifed using the NGS LR kit (One Lambda, West Hills, CA) and sequenced following the SQK-LSK109 protocol on the R10.3 MinION flow cells (FLO-MIN111, Oxford Nanopore Technologies). 26 Genotypes of HLA-A, -B, -C, -DPA1, -DPB1, -DQA1, -DQB1, -DRB1, -DRB3/4/5 genes were assigned based on keyexon sequences (G groups) and limited to the 2-field resolution. The demographics of cases and controls were reported with standard descriptive statistics, including counts, proportions, and medians and ranges, as appropriate. All association analyses were performed for each population separately. For the allele association analysis at the HLA-A, -B, -C, -DRB1, and -DQB1 loci, frequencies of alleles in cases were compared with those in controls by Fisher's exact test using the pyHLA package (version 1.1.1). 27 The default allelic genetic model was used to compare one allele against other alleles grouped together, and the default minimal allele frequency of 0.05 was applied. Associations with unadjusted p value <0.05 were reported as candidate alleles of interest. Multiple comparisons of alleles with frequencies of 0.05 or higher were adjusted in the above analyses by controlling the false discovery rate at 5% using the Benjamini-Hochberg procedure, 28 and an adjusted p value <0.05 was considered statistically significant. The amino acid association analysis was also performed using the pyHLA package using default options. Amino acid associations with unadjusted p value below 0.05 were further examined if they were carried by protecting or predisposing alleles. The locations of these alleles were visualized within available crystal structures using PyMOL (Molecular Graphics System, Version 2.4.1, Schrödinger, LLC.) to determine their relevance to peptide presentation. Cases consisted of 167 African-Americans, 56 Whites, 7 Asian Pacific Islanders, and 4 Hispanics. The baseline demographics for cases and controls are shown for each population in Table 1 . While the geographic location, gender ratio, and median age were well matched between the cases and controls, the cases were skewed toward older ages because of the maximum age of 60 years in the NMDP controls (Table 1) . All COVID-19 cases were confirmed by positive RT-PCR testing and hospitalized for treatment. A total of 121 patients (51.7%) were admitted to intensive care units (ICU), and 75 patients (32.1%) received mechanical ventilation. The overall mortality rate was 21.8% among the cases. A broad range of co-morbidities were documented with chronic cardiac disease (73.5%) and diabetes (41.9%) being the most common. The clinical characteristics of cases were listed for each population in Table 2 . Because of limited sample size and statistical power, we examined alleles with overall frequencies above 5% in the primary analysis. We identified one protective allele in African Americans, and five predisposing alleles in Whites and Hispanics. Table 3 shows the counts and frequencies of these alleles in cases and controls, overall frequencies, unadjusted and adjusted p values, odds ratios (OR), and the 95% confidence intervals of ORs. Only HLA-DRB1*08:02 in Hispanics remained statistically significant after adjusting for multiple comparisons (OR = 9.0, adjusted p = 0.03). HLA-DRB1*08:02 was detected in three of the four heterozygous cases with an allele frequency of 37.5%, while its frequency in the matched population control was 6.2% (Table 3) . Among other groups of the cases, the frequencies of HLA-DRB1*08:02 were 0%, 0.9%, and 7.1% in the African Americans, Whites, and Asian Pacific Islanders, respectively; the allele frequencies in the corresponding population control groups are 0.3%, 0.1%, and 0.7%. Results for all alleles analyzed in the four populations were provided in Table S1 . In the ad hoc analysis of younger African Americans with ages below the median against their matched population controls, HLA-A*30:02 was associated with an increased risk of COVID-19 (OR = 2.2, unadjusted p = 0.0017, adjusted p = 0.01). Among other groups of the cases, HLA-A*30:02 was not detected in the Whites, Asian Pacific Islanders, or Hispanics; the allele frequencies in the corresponding population control groups are 1%, 0.1%, and 2.2%. The frequencies of HLA-A*30:02 were 13.8% and 6.7% in the patients and population controls, respectively (Table 3) . Among younger Whites, one predisposing allele, HLA-A*11:01, was detected (OR = 2.4, unadjusted p = 0.04); however, it was no longer statistically significant after adjusting for multiple comparisons (Table 3) . Results for all alleles analyzed in the younger African Americans and Whites were provided in Table S2 . In African Americans, we identified two potentially protective amino acid residues in the peptide-binding groove of HLA-B, a serine at position 24 and a threonine at position 163 (Table 4 and Figure 1A ), which are carried by the protective candidate allele HLA-B*42:01 (Table 3) . No amino acid residues were associated with COVID-19 in the Whites that were carried by the two potential predisposing alleles identified in the allele association analysis. In Hispanics, we found nine potentially predisposing residues carried by HLA-C*04:01, -DQB1*04:02, and -DRB1*08:02 (Table 4) ; five of these residues were located in the peptide binding grooves of respective molecules ( Figure 1B-D) . Finally, in the ad hoc analysis of younger African Americans, we identified two predisposing residues located in the peptide binding groove of HLA-A*30:02 (Table 4 ; Figure 1E ). In this study, we identified HLA-DRB1*08:02 and HLA-A*30:02 as potential risk factors for symptomatic SARS-CoV-2 infection in the Hispanics and younger African Americans, respectively, relative to their matched population controls. We also report several potentially protective and predisposing candidate alleles found in the African Americans, Whites, and Hispanics as well as several amino acid residues with potential implications in altered peptide presentation during the immune response to SARS-CoV-2. The study followed a prespecified protocol for case enrollment and data analysis, and ad hoc analysis was performed for younger African Americans and Whites. High-quality genotyping data of cases and controls enabled the analysis at allele and amino acid levels. Despite the modest sample size, this is the first report on the HLA and COVID-19 associations in cases of diverse populations in the United States. The preliminary findings in the Hispanics and younger African Americans are novel. It is of paramount importance to examine the immunogenetics of COVID-19 in these minority populations with doubled to tripled rates of hospitalization and mortality compared with white and non-hispanic populations. 29 Our findings support the notion that specific HLA alleles may contribute to the protection from or predisposition to severe COVID-19. Although the experimental evidence for a functional association remains lacking, the discovery of several associated amino acids in the peptide-binding grooves of both class I and II molecules is consistent with the role of HLA-restricted peptide presentation in the susceptibility to symptomatic SARS-CoV-2 infection. Our findings add to the growing literature on the interaction between HLA and COVID-19 from studies that vary in their approaches and study designs. Several epidemiology studies investigated the correlation between HLA genotype frequencies and regional prevalence or fatality rates of COVID-19. The frequencies of HLA-A*11:01 in 21 countries correlated negatively with the fatality rates of COVID-19 in corresponding countries, 30 while HLA-A*02:01 was reported to be associated with increased risk Residue located in the peptide binding grooves (see Figure 1 ). Alleles in bold font were identified in the primary analysis (Table 3 ). for COVID-19. 31 In Italy, higher regional frequencies of HLA-B*44 and -C*01 independently correlated with a faster local growth rate of SARS-CoV-2 infections; at the haplotype level, Pisanti et al reported the positive correlation between regional frequencies of the HLA-A*01:01 g-B*08:01 g-C*07:01 g-DRB1*03:01 g haplotype and the local prevalence and mortality of COVID-19. 32 Although these epidemiological studies indicate protective or permissive roles of HLA related factors in SARS-CoV-2 infection, their findings have not been replicated across studies. Case-control studies offered another opportunity to uncover the immunogenetic underpinning of COVID-19. While a recent genome-wide association study (GWAS) performed in Italian and Spanish populations did not identify significant signals for COVID-19 in the HLA region, 22 several case-control studies from China and Italy reported a few significant associations. Wang and colleagues reported significantly increased counts of HLA-B*15:27 (n = 8; 4.9% of cases) in 82 mild to severe COVID-19 cases in Zhejiang province, China, as compared with 3548 controls from a local marrow donor registry. HLA-C*07:29 also reached statistical significance in this study but only occurred once (0.6% of cases), which may not be reliable. Novelli et al observed increased frequencies of HLA-B*27:07, -DRB1*15:01, and -DQB1*06:02 in 99 Italian COVID-19 patients as compared with 1017 Italian individuals previously typed in the authors' laboratory. 21 F I G U R E 1 Location of amino acid residues associated with COVID-19. Ribbon models were created using PyMOL and representative structures from the Protein Data Bank (PDB). The PDB ID is listed for each model. Amino acid residues of interest are highlighted as spheres in salmon or magenta. Class I α chain and β2-microglobulin are colored green and blue, respectively, in A, B, and E. Class II α chain and β chain are colored green and blue, respectively, in C and D Another case-control study by Amoroso et al reported that HLA-DRB1*08 was associated with almost doubled risk for COVID-19 in solid-organ transplant recipients and waitlisted candidates in Italy; HLA-DRB1*08 positive COVID-19 patients also had significantly increased mortality. 20 These results, despite being from a population of transplant recipients and candidates, were consistent with our suggestive finding of increased HLA-DRB1*08:02 among a small number of Hispanic COVID-19 patients. While HLA-DRB1*08:02 is more frequently found in Hispanics in North and South Americas, HLA-DRB1*08:01 is the dominant HLA-DRB1*08 allele found in Europe and is probably carried by most of the HLA-DRB1*08-positive cases in the Amoroso study ( Figure S1 ). 33 Of note, both HLA-DRB1*08:01 and 08:02 share the two risk-associated amino acid residues, 13Gly and 74Leu, which we hypothesize might be responsible for their poor binding affinity with SARS-CoV-2 peptides. Although our finding was limited to three observations among four Hispanics, these cases were not known to be related and did not share a common haplotype. HLA-A*30:02 has been associated with an increased risk for type 1 diabetes, 34 while its role in viral infection control has not been widely known. In a preprint article, HLA-A*30:02 was found to be enriched among COVID-19 patients compared with controls without COVID-19, although the association was not statistical significant after adjusting for multiple comparisons. 35 The study appeared to be underpowered with 100 COVID-19 patients and 26 controls, and the descent or ethnicity of the study population was not reported. Our finding of HLA-A*30:02 as a risk factor for COVID-19 among younger African Americans is consistent this earlier report and needs to be confirmed by further studies. To establish a functional association between HLA and susceptibility to severe COVID-19, future research will need to demonstrate the unproductive presentation of viral peptides by specific HLA molecules. However, because of the large size of the SARS-CoV-2 peptide repertoire and the diversity of HLA in the human population, in silico modeling has been frequently used to narrow down riskassociated alleles and to identify peptide targets for vaccine development. 13, [36] [37] [38] [39] As the modeling strategies and tools differ among studies, 36 various predisposing alleles have been reported as expected. 13, 31, 40, 41 Some studies also correlated lower class I peptide binding capacity with disease severity among COVID-19 patients, 42 or predicted altered binding of variant viral peptides and specific HLA alleles. 43 In one of the most comprehensive analysis of HLA binding affinities of viral peptides, a list of strongest and weakest binders of SARS-CoV-2 peptides were predicted. 40 One predisposing candidate allele found in our study, HLA-C*04:01, was among the weakest binders of all viral peptides in this study. 40 Finally, the HLA-DRB1*08 allele group was predicted to be unable to bind SARS-CoV-2 peptides at high affinities by Amoroso et al 20 and our own modeling (data not shown), which supports the finding of HLA-DRB1*08:02 as a potential predisposing allele in our study. Our study has important limitations. Although the sample size is comparable to most single-center casecontrol studies in the literature, ideally thousands of cases may be pooled in a consortium setting for each population to maximize the power for detailed mapping of HLA-disease associations. Therefore, protective and risk alleles with low to moderate effect sizes might have been missed in our study. We also used gender and geographymatched population controls from a donor registry to demonstrate an enrichment of risk alleles among hospitalized COVID-19 cases. Age matching was also performed to an extent that was limited in that stem cell donors older than age 60 are not recruited, however little evidence exists to support that HLA frequencies vary substantially by age within self-reported ethnic categories in the general population. The comorbidities of the population controls were not available, so we could not control for comorbidities in this study, thus any potential interactions with HLA would remain undetected. The disease status of the controls was unknown, thus when compared with similar-sized studies where known disease-free controls are utilized, this study design has a higher likelihood that true associations would remain undetected. However, our approach of using stem cell donors has a benefit that a larger number of controls are available. An alternative study design would enroll COVID-19 patients with no or mild symptoms as controls from the same location as the cases, which could allow the identification of HLA alleles associated with moderate to severe COVID-19. However, this strategy may limit the sample size of controls available for the study. Additionally, the disease severity of COVID-19 may be dynamic, requiring longitudinal follow up for symptoms. In summary, we conducted a population-based casecontrol study involving multiple populations in the midwest of United States and identified HLA-DRB1*08:02, HLA-A*30:02, and several other candidate alleles with increased or decreased frequencies among hospitalized COVID-19 patients compared with matched population controls. As the suggestive finding in Hispanics was based on a small number of cases, caution is needed in their interpretation. We also determined the amino acid residues in these alleles that may be involved in peptide presentation during the immune response to SARS-CoV-2. Future consortium analysis of pooled cases and controls is warranted to validate and extend these findings, and correlation with peptide binding studies will provide additional evidence supporting the functional association between HLA alleles and severe COVID-19. Research fund from Washington University School of Medicine Department of Pathology and Immunology (C.L.) and NIH NIAID award number R41 AI142919-01 (C.L.). We thank Megan Arb of Center for Clinical Studies of Washington University School of Medicine for helping with the data collection. We thank National Marrow Donor Program/Be The Match for providing control data and Martin Maiers for kindly reviewing the manuscript. We thank One Lambda Inc for providing the AllType and NGS LR kits and sequencing reagents for the Ion Chef/Ion Torrent S5 platform for this study. The authors have declared no conflicting interests. Designed the study: Emily Schindler, Christopher W. Farnsworth, Loren Gragert, and Chang Liu. The data that support the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions. Loren Gragert https://orcid.org/0000-0002-5945-6518 Chang Liu https://orcid.org/0000-0001-6624-1454 A pneumonia outbreak associated with a new coronavirus of probable bat origin Disease burden and clinical severity of the first pandemic wave of COVID-19 in Wuhan, China Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2) Characteristics of and important lessons from the Coronavirus disease 2019 (COVID-19) outbreak in China: summary of a report of 7,2314 cases from the Chinese center for disease control and prevention Mild or moderate Covid-19 Clinical predictors of mortality due to COVID-19 based on an analysis of data of 150 patients from Wuhan, China Preliminary estimates of the prevalence of selected underlying health conditions among patients with Coronavirus Disease 2019 -United States Characteristics and clinical outcomes of adult patients hospitalized with COVID-19 -Georgia Hospitalization rates and characteristics of patients hospitalized with laboratory-confirmed Coronavirus disease 2019 -COVID-NET, 14 states Self-reported symptoms of covid-19 including symptoms most predictive of SARS-CoV-2 infection, are heritable. bioRxiv Janeway's Immunobiology IPD-IMGT/HLA Database Human leukocyte antigen susceptibility map for severe acute respiratory syndrome Coronavirus 2 Association of HLA class I with severe acute respiratory syndrome coronavirus infection Association of human-leukocyte-antigen class I (B*0703) and class II (DRB1*0301) genotypes with susceptibility and resistance to the development of severe acute respiratory syndrome Human-leukocyte antigen class I Cw 1502 and class II DR 0301 genotypes are associated with resistance to severe acute respiratory syndrome (SARS) infection Epidemiological and genetic correlates of severe acute respiratory syndrome coronavirus infection in the hospital with the highest nosocomial infection rate in Taiwan in 2003 Association of human leukocyte antigen class II alleles with severe acute respiratory syndrome in the Vietnamese population Distribution of HLA allele frequencies in 82 Chinese individuals with coronavirus disease-2019 (COVID-19) HLA and AB0 polymorphisms may influence SARS-CoV-2 infection and COVID-19 severity HLA allele frequencies and susceptibility to COVID-19 in a group of 99 Italian patients Genomewide association study of severe Covid-19 with respiratory failure HLA DNA typing: past, present, and future Research electronic data capture (REDCap)-a metadata-driven methodology and workflow process for providing translational research informatics support Performance of a multiplexed amplicon-based next-generation sequencing assay for HLA typing High-resolution HLA typing by long reads from the R10.3 Oxford nanopore flow cells PyHLA: tests for the association between HLA alleles and diseases Controlling the false discovery rate: a practical and powerful approach to multiple testing COVID-19 Hospitalization and Death by Race/Ethnicity SARS-CoV-2 genomic variations associated with mortality rate of COVID-19 Association between HLA gene polymorphisms and mortality of COVID-19: an in silico analysis Correlation of the two most frequent HLA haplotypes in the Italian population to the differential regional incidence of Covid-19 Balancing selection and heterogeneity across the classical human leukocyte antigen loci: a meta-analytic review of 497 population studies The HLA class I A locus affects susceptibility to type 1 diabetes Retrospective in silico HLA predictions from COVID-19 patients reveal alleles associated with disease prognosis. medRxiv Identification and validation of 174 COVID-19 vaccine candidate epitopes reveals low performance of common epitope prediction tools Preliminary identification of potential vaccine targets for the COVID-19 Coronavirus (SARS-CoV-2) based on SARS-CoV immunological studies In silico identification of vaccine targets for 2019-nCoV A sequence homology and bioinformatic approach can predict candidate targets for immune responses to SARS-CoV-2 Binding affinities of 438 HLA proteins to complete proteomes of seven pandemic viruses and distributions of strongest and weakest HLA peptide binders in populations worldwide Class I HLA allele predicted restricted antigenic coverages for spike and nucleocapsid proteins are associated with deaths related to COVID-19 Possible role of HLA class-I genotype in SARS-CoV-2 infection and progression: a pilot study in a cohort of Covid-19 Spanish patients Mortality in COVID-19 disease patients: correlating the association of major histocompatibility complex (MHC) with severe acute respiratory syndrome 2 (SARS-CoV-2) variants Additional supporting information may be found online in the Supporting Information section at the end of this article.