key: cord-0000306-tkxpjlyn authors: Dermody, Terence S.; Kirchner, Eva; Guglielmi, Kristen M.; Stehle, Thilo title: Immunoglobulin Superfamily Virus Receptors and the Evolution of Adaptive Immunity date: 2009-11-26 journal: PLoS Pathog DOI: 10.1371/journal.ppat.1000481 sha: 28cfe86a8fa13dacd3f447dcb095ac9a9f99a33a doc_id: 306 cord_uid: tkxpjlyn nan Obligate intracellular pathogens depend on cell-surface molecules to attach and enter into host cells. Pathogen receptors may be highly specialized proteins, such as complement receptors or neurotransmitter receptors, or more ubiquitous components of cell membranes, such as integrins or sialic acid-containing oligosaccharides. The immunoglobulin superfamily (IgSF) of molecules contains several members that are expressed at the cell surface, bind diverse ligands, and contribute to a variety of cellular activities, including adhesion and immune responses. Many viruses have usurped the adhesive properties of IgSF proteins to mediate attachment (Table 1) . Strategies used by viruses to engage IgSF receptors provide clues to general mechanisms by which IgSF proteins bind different types of ligands, including antigens. Members of the IgSF have diverged in sequence and function. However, all contain domains with the characteristic immunoglobulin fold, which is defined by two opposing antiparallel b-sheets connected in a unique manner [1, 2] . The core of the immunoglobulin fold is formed by four b-strands (B, C, E, and F) augmented with three to five additional b-strands (A, C9, C0, D, and G) to yield several distinct subtypes [1, 2] . Most common are the Vset and C-set immunoglobulin domains, which are named according to their occurrence in the variable and constant regions of immunoglobulins, respectively. A third type, the I-set, is an intermediate structure between the V-and C-sets found frequently in cell-surface receptors. Immunoglobulin domains rarely occur in isolation but typically form concatenated chains, often with a V-set or I-set domain at the N-terminus. Biochemical and structural analyses of interactions between viruses and their cognate IgSF receptors reveal several striking similarities. First, in cases in which structural information about virus-receptor complexes is available, the viral attachment proteins exclusively bind to the most membrane-distal, N-terminal domain (D1) of the IgSF receptors [3] [4] [5] [6] [7] [8] [9] [10] . While structural information about com-plex formation is lacking for the IgSF receptors carcinoembryonic antigen-related cell adhesion molecule, nectin-1, nectin-2, and signaling lymphocyte-activation molecule (SLAM), biochemical studies also implicate their respective D1 domains in virus binding [11] [12] [13] [14] . Second, viruscontacting residues lie towards the upper ''tip'' of the IgSF D1 domain. Third, the viral receptor-binding region engages the CC9FG b-sheet of the IgSF receptor D1 domain. Fourth and finally, almost all of the receptor domains interacting with viruses belong to the V-type IgSF fold. The single exception, the D1 domain of ICAM-1, belongs to the I-set type, which is structurally similar to the V-set domain. Although the database of viral proteins in complex with IgSF receptors is still quite small, interactions of viruses with their receptors parallel the recognition mode of immunoglobulins, which also recognize their cognate antigens via residues at the tip of their N-terminal, V-set domains. The case of the receptor-binding head domain of reovirus attachment protein s1 in complex with the D1 domain of its receptor, junctional adhesion molecule-A (JAM-A) [9] , serves to illustrate this point ( Figure 1A ). The JAM-A homodimer strikingly resembles the dimer formed by the V-set domains of the light and heavy chains of immunoglobulins. In both structures, the two V-set domains face each other with similar orientations. Moreover, residues in the receptor required for virus attachment reside in bstrands and intervening loops that juxta-pose the complementarity determining regions (CDRs) of antibody molecules. Thus, residues known to interact with ligands map to corresponding regions near the tip and one side of the V-set domains. These similarities extend beyond reovirus receptor JAM-A. Other IgSF virus receptors, such as the coxsackievirus and adenovirus receptor (CAR) [5] and HIV receptor CD4 [4] , also recognize their viral ligands via residues that partially overlap with the CDR region of immunoglobulins ( Figure 1B -F). CAR forms a homodimer via its D1 domain that is very similar to the JAM-A homodimer [15] . CD4 also forms homodimers, albeit via its D4 domain [16] . The immunoglobulin fold predates the evolution of vertebrates. Genomes of invertebrate organisms encode numerous molecules that belong to two families with homologs in vertebrates: the JAM/cortical thymocyte marker of Xenopus (CTX) family and the nectin family [17] . Vertebrate counterparts of these genes are found in discrete blocks, and many are now diversified to encode molecules that function in adaptive immunity, including CD3 and SLAM [17] . Invertebrates do not encode recombination-activating genes (RAGs) and generally display only limited antigen-specific immunity. Therefore, the core structural element of adaptive immunity, the immunoglobulin fold, evolved prior to a mechanism to generate a highly diversified antigen-specific repertoire. Similarities in mechanisms of ligand engagement by IgSF pathogen receptors and immunoglobulins, coupled with the evolution of the immunoglobulin fold prior to the existence of the vertebrate adaptive immune system, suggest the possibility that primitive members of the JAM/CTX and nectin families evolved to become soluble adaptive immune mediators in modern vertebrates. One attractive hypothesis is that soluble forms of pathogen receptors served as precursors to molecules of the adaptive immune system. Soluble receptors would neutralize viral [18] . Expression of a soluble pathogen receptor followed by duplication within the primitive genome and acquisition of mutations that permitted recognition of additional pathogens could confer a strong selective advantage. Upon introduction of RAGs into the vertebrate genome, such a gene family would have been primed to express molecules akin to present-day immunoglobulins. Alternatively, membrane-anchored forms of IgSF molecules that arose in primitive invertebrates may have been maintained in the genome due to their cell-adhesion functions, followed by the serendipitous introduction of mechanisms for the secretion and generation of diversity. In this scenario, pathogens may have contributed to the evolution of the modern adaptive immune system at much later evolutionary times. Is there evidence that favors either of these potential evolutionary mechanisms? In addition to similarities in their ligandbinding strategies, many of the closest structural homologs of JAM-A are immunoglobulins, which raises the possibility that immunoglobulins are more closely related to JAM-A than to other IgSF molecules. A search for structural homologs of the JAM-A D1 domain using the Dali algorithm [19] provides support for this hypothesis. The closest structural homologs of the JAM-A D1 domain are immunoglobulin domains, with the highest Dali Z-score of 14.6 for an IgAk variable domain (PDB code 2FBJ) ( Table 2) . Other IgSF proteins with similarity to JAM-A D1 have significantly lower Z-scores. The Z-scores correlate well with root mean square deviations for superpositions of JAM-A D1 with immunoglobulins, which also are lower (i.e., more similar) than the corresponding values for superpositions of JAM-A D1 with other IgSF proteins. This homology search can be extended to CAR, neural cell adhesion molecule, and nectin-like molecule 1, which result in Z-scores that are generally higher for the superposition of their D1 domains with immunoglobulins than with other cell adhesion molecules. In urochordates (Ciona) and cephalochordates (Branchiostoma), evolutionarily close relatives of the vertebrates, there are homologs of JAM/CTX and nectin IgSF molecules with features of membrane receptors. Ciona encodes only a single JAM/CTX-like molecule and two nectin-like molecules [20] . In humans, these molecules are all part of a single linkage group involved in immune function [17, 20] . Taken together, these results suggest that relatively few JAM/CTX and nectin family IgSF molecules were maintained in invertebrates, and the expansion and duplication resulting in the evolution of immunoglobulins may have occurred after the introduction of these molecules into the vertebrate genome. There also is evidence of expansion of IgSF molecules in invertebrates. For example, like many immunoglobulins, chitin-binding protein (CBP) of Branchiostoma is a close structural homolog of JAM-A (Table 2 ). Variable region-containing (V) CBPs contain a V-type immunoglobulin domain with extensive sequence diversity in the N-terminal region [21, 22] . This diversity is thought to result from high haplotype variation, including variable copy number, polymorphisms, and potential for alternative splicing [23] . Another of the closest structural homologs of JAM-A is Down syndrome cell adhesion molecule (Dscam), an IgSF member of the more evolutionarily distant invertebrate Drosophila (Table 2) . Dscam is an immune mediator found in clusters of variable exons flanked by constant exons [24, 25] . Thousands of different Dscam molecules can be generated via alternative splicing, a mechanism that is highly conserved across insect orders [26] . Secreted isoforms of Dscam circulating in insect hemolymph contribute to phagocytic uptake of bacteria. While the structural similarities between JAM-A and VCBP or Dscam may not indicate a direct evolutionary relationship, it is clear that diversification and secretion of soluble forms of IgSF molecules can occur in invertebrates and raise the possibility that pathogens have had selective influence on the diversification and secretion of these molecules. Thus, IgSF proteins that served as precursors to soluble adaptive immune effectors may have diversified both prior to and following their introduction into the vertebrate genome. A more thorough examination of IgSF members in invertebrates may clarify mechanisms that led to the evolution of modern adaptive immune mediators and the role of JAM/CTX family molecules in this evolutionary process. The evolution of JAM family members prior to the biochemical means to efficiently and extensively diversify antigen receptor genes, along with the structural similarities in the binding surfaces of virus receptors and immunoglobulins, provides strong support for the contention that viruses and perhaps other pathogens that engage IgSF receptors contributed to the selection of humoral mediators of adaptive immunity. These observations provide a new framework for understanding how pathogen-host interplay during a prolonged period of evolutionary struggle may have led to the development of antigen-specific immune responses in vertebrates. We thank Jim Chappell and the PLoS Pathogens reviewers for insightful suggestions and critique of the manuscript. The immunoglobulin fold: structural classification, sequence patterns and common core Many of the immunoglobulin superfamily domains in cell adhesion molecules and surface receptors belong to a new structural set which is close to that containing variable domains The structure of the two amino-terminal domains of human ICAM-1 suggests how it functions as a rhinovirus receptor and as an LFA-1 integrin ligand Structure of an HIV gp120 envelope glycoprotein in complex with the CD4 receptor and a neutralizing antibody Structural analysis of the mechanism of adenovirus binding to its human cellular receptor Three-dimensional structure of poliovirus receptor bound to poliovirus Interaction of coxsackievirus B3 with the full length coxsackievirusadenovirus receptor Interaction of coxsackievirus A21 with its cellular receptor, ICAM-1 Structure of reovirus s1 in complex with its receptor junctional adhesion molecule-A Crystal structure of CD155 and electron microscopic studies of its complexes with polioviruses Mouse hepatitis virus strain A59 and blocking antireceptor monoclonal antibody bind to the N-terminal domain of cellular receptor The first immunoglobulin-like domain of HveC is sufficient to bind herpes simplex virus gD with full affinity, while the third domain is involved in oligomerization of HveC V domain of human SLAM (CDw150) is essential for its function as a measles virus receptor Soluble V domain of Nectin-1/HveC enables entry of herpes simplex virus type 1 (HSV-1) into HSV-resistant cells by binding to viral glycoprotein D Structural similarities in the cellular receptors used by adenovirus and reovirus Dimeric association and segmental variability in the structure of human CD4 Immunoglobulin superfamily receptors in protochordates: before RAG time Human rhinovirus selectively modulates membranous and soluble forms of its intercellular adhesion molecule-1 (ICAM-1) receptor to promote epithelial cell infectivity Searching protein structure databases with DaliLite v Speculations on the origin of the vertebrate immune system Identification of diversified genes that contain immunoglobulin-like variable regions in a protochordate Ancient evolutionary origin of diversified variable regions demonstrated by crystal structures of an immunetype receptor in amphioxus Genomic complexity of the variable region-containing chitin-binding proteins in amphioxus Structural basis of Dscam isoform specificity Drosophila Dscam is an axon guidance receptor exhibiting extraordinary molecular diversity Extensive diversity of Ig-superfamily proteins in the immune system of insects Refined structure of an intact IgG2a monoclonal antibody Crystal structure of human junctional adhesion molecule 1: implications for reovirus binding Kinetic and structural analysis of mutant CD4 receptors that are defective in HIV gp120 binding Dimeric structure of the coxsackievirus and adenovirus receptor D1 domain at 1.7 Å resolution Isolation of a common receptor for Coxsackie B viruses and adenoviruses 2 and 5 HCAR and MCAR: the human and mouse cellular receptors for subgroup C adenoviruses and group B coxsackieviruses Receptor for mouse hepatitis virus is a member of the carcinoembryonic antigen family of glycoproteins Mouse hepatitis virus utilizes two carcinoembryonic antigens as alternative receptors Several members of the mouse carcinoembryonic antigen-related glycoprotein family are functional receptors for the coronavirus mouse hepatitis virus-A59 Entry of alphaherpesviruses mediated by poliovirus receptorrelated protein 1 and poliovirus receptor A cell surface protein with herpesvirus entry activity (HveB) confers susceptibility to infection by mutants of herpes simplex virus type 1, herpes simplex virus type 2, and pseudorabies virus The CD4 (T4) antigen is an essential component of the receptor for the AIDS retrovirus The T4 gene encodes the AIDS virus receptor and is expressed in the immune system and the brain SLAM (CDw150) is a cellular receptor for measles virus Cellular receptor for poliovirus: molecular cloning, nucleotide sequence, and expression of a new member of the immunoglobulin superfamily The neural cell adhesion molecule is a receptor for rabies virus Junction adhesion molecule is a receptor for reovirus The major human rhinovirus receptor is ICAM-1 A cell adhesion molecule, ICAM-1, is the major surface receptor for rhinoviruses cDNA cloning reveals that the major group rhinovirus receptor on HeLa cells is intercellular adhesion molecule 1