key: cord-1021758-0262bm3z authors: Wang, Denong; Wu, Albert M. title: Chapter 7 Carbohydrate Microarrays for Lectin Characterization and Glyco-Epitope Identification date: 2007-12-31 journal: Lectins DOI: 10.1016/b978-044453077-6/50008-9 sha: 22ad89ce56ad3854bd07b0e695723b4f194581d9 doc_id: 1021758 cord_uid: 0262bm3z Publisher Summary Lectins can also be classified according to their binding specificity or selectivity with carbohydrates. This type of classification is helpful for selection of lectins as structural probes in biomedical applications. This chapter summarizes the concept and the updated information regarding the specificity-based lectin classification. Lectins are functionally classified based on their relative binding reactivities with the structural units of carbohydrate or glyco-epitopes. They are grouped according to their monosaccharide specificities and then further sub-grouped based on their reactivities with more complex structures. Carbohydrate specificities of biomedically important lectins are classified into six groups according to their specificities to monosaccharides. The chapter introduces a practical platform of carbohydrate microarrays that is useful for lectin characterization and classification. Finally, the chapter discusses a few examples to illustrate the application of this technology in lectin-related experimental investigations. Lectins are an important class of proteins or glycoproteins of non-immune origin that bind non-covalently to characteristic carbohydrate structures with specificity or selectivity. There are many ways to classify lectins. Given the fact that a wide range of living organisms, from microbes to mammals, produce lectins, they can be divided according to species origin, such as microbial lectins, plant lectins, invertebrate lectins and vertebrate lectins. Based on the similarities in sequence of homology and activity, lectins can also be grouped into subfamilies. There are at least five groups of vertebrate lectins, such as C-type (calcium dependent), P-type, S-type, I-type, and pentraxins [1] . Cellular lectin molecules or receptors with lectin-like carbohydrate-binding domains play important roles in cell signaling, protein intracellular trafficking, and cell-cell communication [2] [3] [4] [5] . Lectins can also be classified according to their binding specificity or selectivity with carbohydrates. This type of classification is helpful for selection of lectins as structural probes in biomedical applications. By the early seventies, the carbohydrate-binding activity of a lectin was commonly described in terms of monosaccharide specificity. Experimentally, a type of inhibition assay, such as agglutination inhibition, was performed to identify the monosaccharide that most effectively inhibited the binding reaction. Many lectins are able to cross-react with a panel of sugar chains with a common terminal sugar residue. Lately, disaccharides are considered to be a better way to define a lectin's specificity. However, some lectin-binding sites accommodate glyco-epitopes larger than mono-and disaccharides and recognize conformational epitopes. The latter may consist of much longer oligosaccharide or polysaccharide chains. There is also evidence that lectins recognize certain cluster configurations of complex carbohydrates with high selectivity and avidity. X-ray crystallographic studies identified some common characteristics in the binding pockets of lectins and antibodies [2, 6, 7] . Aromatic amino acid residues appear frequently in carbohydrate-binding sites of various fine specificities. Such residues are large and participate in a wide variety of van der Waals and electrostatic interactions. The majority of crystal structures so far resolved are associated with a network of H-bonds, frequently with water molecules, in their combining sites. Such bound water molecules play important roles at the contacting interface between carbohydrate ligands and receptors. In addition, polyamphiphilic surfaces were identified in the interfaces of a number of carbohydrate ligands and binding sites. Presence of relatively rigid structures is required to produce an amphiphilic surface in solution. Carbohydrate molecules with branched termini, such as many blood group substances, which are formed by oligosaccharide chains, are favorable for the generation of such contact surfaces. Much remains to be learned regarding the specificity and cross-reactivity of the carbohydrate-lectin interactions and their functional outcome in biological systems. Challenging issues in lectin characterization may include but are not limited to (a) the spectrum or repertoire of carbohydrate structures that are reactive with given lectins, (b) conformational properties of glyco-epitopes that are selectively recognized by lectins, and (c) the cluster effect in lectin-carbohydrate interactions, especially those that take place on cell surfaces and trigger the events of intracellular cell signaling, cell differentiation and proliferation. Recent development of microarray-based broad-range binding assays offers new tools to facilitate these investigations. In this chapter, we summarize the concept and the updated information regarding the specificity-based lectin classification (Section 2). Then, we introduce a practical platform of carbohydrate microarrays that is likely useful for lectin characterization and classification (Section 3). Lastly, we discuss a few examples to illustrate the application of this technology in lectin-related experimental investigations (Section 4). Lectins are functionally classified based on their relative binding reactivities with the structural units of carbohydrate or glyco-epitopes. They are grouped according to their monosaccharide specificities and then further sub-grouped based on their reactivities with more complex structures. As listed below, carbohydrate specificities of biomedically important lectins are classified into six groups according to their specificities to monosaccharides. They are further subgrouped based on the binding affinities to (a) GalNAc 1 ; O to Ser(Thr) of the peptide chain; (b) disaccharides; (c) trisaccharides; and (d) the number and the location of LFuc 1 ; linked to oligosaccharides. These structures are frequently found in soluble glycoproteins and as cell surface glycoconjugates in mammals [8] [9] [10] [11] . This scheme of lectin classification is, thus, practically useful in biomedical application of lectins. (1) F/A, GalNAc 1 ; 3GalNAc (Forssman) and GalNAc 1 ; 3Gal (blood group A determinant disaccharide) specific agglutinins -Dolichos biflorus (DBA) and Helix pomatia (HPA). (2) A, GalNAc 1 ; 3Gal (blood group A determinant disaccharide) -specific agglutinins -Soybean (SBA) and Lima bean (LBA). (3) Tn, GalNAc 1 ; Ser/Thr-specific agglutinins -Vicia villosa B 4 (VVL-B 4 ) and Salvia sclarea (SSA). (1) T, Gal 1 ; 3GalNAc (T , Gal 1 ; 3GalNAc 1 ; the mucin-type sugar sequence on human erythrocyte membrane or T , Gal 1 ; 3GalNAc 1 ; at the terminal non-reducing end of the gangliosides) specific agglutinins -Peanut (PNA), and Bauhinia purpurea alba (BPA). (2) I/II, Gal 1 ; 3(4)GlcNAc 1 ; (Lacto-N-biose/N-acetyllactosamine) specific agglutinins (Human blood group type I (Gal 1 ; 3GlcNAc) and type II (Gal 1 ; 4GlcNAc) carbohydrate sequences [16] [17] [18] , the disaccharide residues at the non-reducing end of the carbohydrate chains derived from either N-glycosidic or O-glycosidic linkages.) -Ricinus communis agglutinin (RCA 1 ), Datura stramonium (thorn apple, TAA) and wheat germ agglutinin (WGA). (3) B, Gal 1 ; 3Gal (Human blood group B disaccharide-Griffonia (Bandeiraea) simplicifolia B 4 (GSI-B 4 ). (4) E, Gal 1;4Gal, blood group P k active disaccharide -Abrin-a and Mistletoe lectin-I (ML-I). Man-linked oligosaccharide-specific agglutinins -Concanavalin ensiformis (Jack bean, ConA) and Lens culinaris (LCA). Chitin oligosaccharide-specific agglutinins -WGA and Griffonia (Bandeiraea) simplicifolia II (GSA-II). (1) Monofucosyl-specific agglutinins -Ulex europaeus I (UEA-I) and UEA-II. (2) Difucosyl-specific agglutinins -Griffonia (Bandeiraea) simplicifolia IV (GSA-IV). ) for Gal-and GalNAc-specific lectins, which are shown in Fig.1 as part of carbohydrate structures, are shown in Fig. 2 and Table 1 . Most of the lectin F and T determinants are found in glycosphingolipids [12] [13] [14] , and the other determinants are present in mammalian glycoproteins [9, [15] [16] [17] [18] [19] , especially in the human blood group A, B, H, Le a , Le b , and Ii active glycoproteins prepared from human ovarian cyst fluid [15, 18] and salivary glycoproteins [20] . The reactivities of lectin determinants represent a combined result of the binding of individual sugars. The contribution of each sugar to the binding is not necessarily equal, and is different among lectins, for example, both Amaranthus caudatus (ACL) and PNA are Gal 1;3GalNAc specific [8, 21] . However, the inhibitory profile of the monosaccharides with these two lectins is quite different; Figure 2 . Mammalian structural units (lectin determinants) present in glycoconjugates. Lectin F determinant is found in glycosphingolipid (the principal glycolipid of mammalian tissues) of the tissues of the guinea pig, horse, cat, and chicken. It can also be found on the surface of some bacteria, viruses, human gastric carcinoma, and colon tumors [14, 15, 18, 38] . All (F, A/A h , B, I/II, T, Tn) determinants are found in human blood groups A, B, H, Le a , Le b , and Ii active glycoconjugates prepared from human ovarian cyst fluid (HOC) [16, 18, 22, 42] . The Gal 1;3GalNAc group at the non-reducing end of asialo-GM 1 is also considered as a lectin T determinant. Determinants of lectins I and II discussed in this article are equivalent to human blood group type I (Lacto-N-biose, Gal 1;3GlcNAc) and type II (N-acetyllactosamine) carbohydrate sequences [39] . S determinant is found in human Tamm II β (1) Galβ1→ 3GlcNAcβ1→ 3Galβ1→ 3GlcNAcβ1→ 3Galβ1→ 3GalNAcα1→ Ser or Thr [40, 41] . C: Chitin disaccharide. (d) Representative carbohydrate chains of blood group glycoprotein from human ovarian cyst fluids. The four-branched internal structure (I-IV) shown is the representative internal portion of the carbohydrate moiety of blood group substances to which the residues responsible for A, B, H, Le a , and Le b activities are attached. This structure represents precursor blood group active glycoproteins [16] and can be prepared by Smith degradation of A, B, H active glycoproteins, purified from human ovarian cyst fluids. Numbers in parentheses indicate the site of attachment for the human blood group A, B, H, Le a , and Le b determinants. These determinants as well as the structural units at the non-reducing end are the sources of lectin A/A h , B/B h , I/II, T, and Tn determinants in (a) [9] . A megalosaccharide of 24 sugars has not been isolated. However, most of the carbohydrate chains isolated are parts of this structure. GalNAcϾϾϾGal (inactive) in ACL and GalϾϾGalNAc (poor) in PNA. Both RCA 120 and WGAs are Gal 1;4GlcNAc 1;6 specific, and the inhibitory profile of the monosaccharide residues with these two lectins is also very different: GalϾϾϾGlcNAc in RCA 120 and GlcNAc 1; ϾϾ Gal in wheat germ [8, 9] . From the data available, expressions of the Gal/GalNAc-specific lectins based on mammalian structural units [8] [9] [10] [11] [22] [23] [24] are shown in Table 2 . During the past two decades, it has been established that many multi-branched oligosaccharides exhibit a significant increase in lectin-binding reactivities as compared to their linear counterparts [25, 26] , "the glycoside cluster effect" in mammalian hepatic asialoglycoprotein receptors [25] is one of the important findings in this field. Effect of polyvalencies of glyco-epitopes on the binding of a lectin from the edible mushroom, Agaricus bisporus [27] is another important example, in which the disaccharide II monomer and tri-antennary Gal 1-4GlcNAc (Tri-II) glycopeptides, the major carbohydrate side chains of Pneumococcus type 14 ps, asialo fetuin and asialo human 1 -acid glycoprotein, were poor inhibitors, while their polyvalent carriers were very active, implying that there are other more complicated structural factors involved in binding besides the multi-antennary II sequences. To explain this phenomenon, the present concept of glycoside cluster effect has to be further defined and classified into two groups -(a) the "Multiantennary or simple glycoside cluster effect" as in galactosides with hepatic lectin (12) Cyst JS phenol insoluble H LFucα1→ 2 (5), (7), (9), (11) Tighe phenol insoluble LFucα1→ 2 (5), (7), (9) and/or (11) H, Le y H, Le x > Le b and others as in Le a (6) , (8), (12) A 1 or A 2 MSS, native GalNAcα1→3 (1), (2), (3) and/or (4) and others as in MSM H, Le b and Le y Cyst 9 in addition to Cyst 14 (5) to (12) Cyst Beach phenol insoluble B Galα1→3 and Same as H Le b and Le y (1), (2), (3) and/or (4), in addition to (5) to (12) [25, 26] and tri-antennary II with a galectin from chicken liver (CG-16) [28] as well as (b) the "High-density polyvalent or complex glycoside cluster effect" as in macromolecular interaction of high-density II cluster containing glycoproteins (Pneumococcus type 14 ps and asialo human 1 -acid glycoproteins) with ABA. The clustered glyco-epitopes present on macromolecules generate a great enhancement in affinity with ABA up to 4.7 ϫ 10 6 times as compared to their monomeric counterparts, and demonstrate the structural importance of complex carbohydrates. To obtain a complete description of the carbohydrate specificities of the applied lectins, the following information is suggested to be necessary -(1) monosaccharide specificity (Gal, GalNAc, GlcNAc, and/or Man); (2) expression of reactivities toward mammalian disaccharide structural units or their derivatives (Table 1) in decreasing order ( Table 2) ; (3) the most active ligand; (4) simple multivalent or cluster effect, such as glycopeptides and multi-antennary glyco-epitopes to inhibit binding; (5) complex multivalent or cluster effects present in macromolecules with known glyco-epitopes. In this report, we illustrate only, as shown in Table 2 , the abilities of mammalian disaccharide structural units or their derivatives to inhibit lectin-glycan binding and express it in decreasing order. During the past years, many cases of the effect of polyvalency on binding have been found [28] [29] [30] [31] [32] [33] [34] [35] . However, the available data are insufficient to make a solid conclusion, therefore, it is not included in this review. Due to the multiple reactivity of lectins toward mammalian glyco-epitopes (Table 2 ), the possible existence of different combining sites or subsites in the same molecule has to be examined, and the differential binding properties of these combining sites (if any) have to be characterized. Recent advances in characterization, cloning, and structural analysis have allowed us to classify plant lectins into seven families of structurally and evolutionarily related proteins. Within each lectin family the overall profile and structure of the carbohydratebinding site(s) are conserved. A closer examination of the carbohydrate specificity further indicates that most plant lectins are not targeted against plant carbohydrates but preferentially bind foreign glycans [36, 37] . Establishing the relationship between the amino acid sequences of the combining sites of plant lectins and mammalian glyco-epitopes should be an important direction to be addressed in lectinology. Our laboratories have established a high-throughput biochip platform for constructing carbohydrate microarrays [42] [43] [44] . Using this technology, carbohydratecontaining macromolecules of diverse structures, including polysaccharides, natural glycoconjugates, and mono-and oligosaccharides coupled to carrier molecules, can be stably immobilized on a glass chip without chemical modification. This technology takes advantage of existing cDNA microarray system, including spotter and scanner, for an efficient production and application of carbohydrate microarrays. We have demonstrated that this current platform is able to overcome a number of expected technical difficulties, by proving that (1) carbohydrate molecules can be immobilized on a nitrocellulose-coated glass slide without chemical conjugation, (2) the immobilized carbohydrates are able to preserve their immunological properties and solvent accessibility, (3) the system reaches the sensitivity, specificity, and capacity to detect a broad range of antibody specificities in clinical specimens, and (4) this technology allows highly sensitive detection, as compared to other existing technologies, of the broad range of carbohydrate-lectin/antibody interactions. In this section, we provide a practical protocol for this high-throughput carbohydrate microarray system. We focus on an eight-chamber sub-array system that is in active use in our laboratory's routine carbohydrate research. It is our wish that the readers who have access to a standard cDNA microarray facility would be able to explore this technology for their own carbohydrate research. The methods described below outline (1) design and construction of an eightchamber sub-array system for defined purposes, (2) microspotting carbohydratecontaining molecules onto nitrocellulose-coated glass slides, (3) immunostaining and scanning of microarrays, (4) analysis of microarray data, and (5) validation of microarray findings by conventional immunological assays. We have been applying an eight-chamber sub-array system to construct customized carbohydrate microarrays for defined purposes. As illustrated in Fig. 3 , each microglass slide contains eight separated sub-arrays. The microarray capacity is ϳ500 microspots per sub-array. A single slide is, thus, designed to enable eight microarray assays. Similar sub-array designs with various array capacities are commercially available (Schleicher & Schuell, Keene, NH; ArrayIt, Sunnyvale, CA). • Each microglass slide contains eight sub-arrays of identical content. There is chip space for 500 microspots per sub-array, with spot sizes of ϳ200 m and at 300 m intervals, center to center. A single slide is, therefore, designed to enable eight detections. • Repeats and dilutions: We usually print carbohydrate antigens at the initial concentration of 0.1-0.5 g/ l. The absolute amount of antigens printed on chip substrate is in the range of 0.1-0.25 ng per microspot for the highest concentration. They are further diluted at 1:3, 1:9, and 1:27. A given concentration of each preparation is repeated at least three times to allow statistic analysis of detection of identical preparation at given antigen concentration. • Standard curves: For serological study, we include antibodies of IgG, IgA, and IgM isotype of corresponding species to serve as standard curves in microarray format. This design allows quantifying antibody signals that are captured by spotted carbohydrate antigens. In addition, such standard curves are useful for microarray data normalization and cross-chip scaling of microarray detection. For lectin study, we spot fluorescent conjugates, such as BSA conjugates of FITC, Cy5, or other dyes, to generate standard curves. • Adjust program so that carbohydrate antigens are printed at spot sizes of ϳ150 m and at 300 m intervals, center to center. • Each antigen or antibody is spotted as triplet replicates in parallel. • The printed carbohydrate microarrays are air-dried and stored at room temperature (RT) without desiccant before application. The staining procedure for carbohydrate microarrays is basically identical to the routine procedure for immunohistology. Immunostaining steps of carbohydrate arrays are listed below. • Rinse printed microarray slides with 1 ϫ PBS, pH 7.4 with 0.05% Tween 20 at RT for 5 min. • Block slides with 1% BSA in PBS containing 0.05% NaN 3 at RT for 30 min. • Stain each sub-array with 50 l of test sample, which is diluted in 1% BSA PBS containing 0.05% NaN 3 and 0.05% Tween 20. • Incubate the slide in a humidified chamber at RT for 60 min. • Wash slides by dipping them in washing buffer (1 ϫ PBS, pH 7.4 with 0.05% Tween 20) five times with at least one-time incubation at RT for 5 min. • Stain slides with 50 l of titrated secondary antibodies. Anti-human (or other species) IgG, IgM, or IgA antibodies with distinct fluorescent tags, Cy5, PE or FITC, are mixed and then applied on the chips. • Incubate the slide in a humidified chamber, protected from light at RT for 30 min. • Wash slides five times as specified above. • Place slide in a 50 ml falcon centrifuge tube and spin at 1,000 rpm for 5 min to remove washing buffer. • Cover slides in a histology slide box to prevent fluorescent quenching of signal by light. • Scan microarray with ScanArray Microarray Scanner (PerkinElmer Life Science) following the manufacturer's instructions. • Fluorescence intensity values for each array spot and its background were calculated using Packard Bioscience's QuantArray software analysis packages or the updated ScanArray Express software. A staining result is considered positive if the mean fluorescent intensity value of microspot is significantly higher than the mean background of an identically stained microarray with the same fluorescent color. • Microarray data processing and statistical analysis: a number of advanced software packages are available for microarray data normalization, statistical analysis, and pattern-recognition-based advanced data-processing (http:// genome-www5.stanford.edu/resources/restech.shtml). We have been using Stanford University's Significance Analysis of Microarrays (SAM, http://www-stat.stanford.edu/ϳtibs/SAM/) and SAS Institute's JMP-Genomics, Proteomics, and Microarrays (http://www.jmp.com/). It is important to conceptually understand the functions of these bioinformatics tools in order to correctly interpret the results. It is important to verify microarray findings by other experimental approaches. We usually confirm our results by at least one of the alternative immunoassays, such as ELISA, dot blot, western blot, flow cytometry, or immunohistology. Detailed examples of such investigations have been described in our recent publications [42, 44] . However, given the fact that microarray assays are highly sensitive, a positive detection in a microarray may not necessarily be reproduced by other assays that are less sensitive than microarray assays. Under such circumstances, we usually repeat the detection by microarray assays to confirm the reproducibility of a positive result. The most important validation is, however, to further characterize the molecular targets that are discovered or suggested by microarray assays in a relevant biological system. This is discussed in the subsequent section. Carbohydrate microarrays share the technical advantages of miniaturized multiplex binding assays [45] [46] [47] [48] [49] [50] [51] . It is suitable for displaying a diverse panel of carbohydrates in a limited amount of chip space. It is "economical" in using carbohydrates since each saccharide is spotted on a microarray substrate in an amount (approximately sub-nanograms) that is drastically smaller than that which is required for conventional molecular or immunological assays (approximately submicrograms or more). Its detection sensitivity is also higher than many conventional molecular and immunological assays [43, 45, 48] , which is an intrinsic property of the microarray-based miniaturized assays. Theoretically, the improved detection sensitivity can be attributed to the fact that the binding in a microarray assay meets the so-called "ambient analyte condition" [45] . A key factor for achieving this assay condition is the use of "tiny" amounts of carbohydrate ligand in microarrays. For the microarray platform we discuss here, only sub-nanogram amounts of carbohydrates are immobilized in the microarray substrate. Binding to these miniaturized carbohydrate spots has no or minimum reduction of the concentration of lectins in the solution phase. Under such conditions, the carbohydrate-lectin interactions reach equilibrium rapidly and result in highly sensitive detection of lectins or anti-carbohydrate antibodies. Thus, carbohydrate microarray is, in principle, well suited for measuring the relative binding reactivities of lectins with a spectrum of diverse carbohydrate structures. A few specific examples are discussed below to illustrate its application in lectin characterization. As summarized in Section 2, the binding property of a lectin is expressed as its specific or selective reactivity with a number of glyco-epitopes that share various degrees of structural similarities or antigenic cross-reactivities. For examples, PHA-L and GS-I are Gal/GalNAc-specific lectins but have significant differences in their fine specificities ( Table 2 ). The binding property of PHA-L is expressed as "Tri-II and mII clusters Ͼ Penta-2,6 Ͼ Tri-2,6 Ͼ Hepta-3,6 Ͼ II Ͼ GlcNAc 1,2Man". The specificity of GS-IA 4 is defined as "F penta-Ͼ A h (GalNAc 1;2 (LFuc 1;3)Gal) Ͼ GalNAc Ͼ E Ͼ B Ͼ I, T Ͼ L, II; GSI-B 4 isolectin defined as "B Ͼ E Ͼ A" ( Table 2) . These lectins may serve as good models for testing the potential of microarray technology in lectin characterization. Therefore, we constructed a carbohydrate array to display a panel of 22 Gal/GalNAc-containing glycoconjugates for a "proof-of-concept" study. The eight-chamber sub-array system described in the Section 3 above was applied to produce this carbohydrate array. Each of carbohydrate preparations were spotted in four dilutions and with triplicate microspots for each dilution. This design is helpful for the quantitation and statistical analysis of microarray results. It also facilitates the graphical presentation or visualization of the binding profiles of lectins. We assumed that if this microarray platform reached the sensitivity for measuring the lectin binding and if it reflected well the results of conventional binding assays, we would expect to visualize the lectin-binding profiles by glycan arrays. We applied lectin PHA-L and GS-I at 0.5 g/ml for microarray staining, which is a titrated condition for cell or tissue section staining using these reagents [44] . The images of the stained microarrays are shown in Fig. 4 (I and II) for lectin PHA-L and GS-I, respectively. The content and location of carbohydrates are listed in the table under the microarray images. A dye marker (Cy5 conjugate) was spotted in triplicates at the bottom right corner of each array. This assay shows clearly that PHA-L is highly specific for asialoorosomucoid (ASOR) (array location C3) that expresses glyco-epitopes Tri-II and mII. Given that PHA-L has no staining of AGOR (array location A4), which is a D-galactose derivative of ASOR, the terminal non-reducing end Gal-residues are likely key elements for the specificities. In striking contrast to the highly selective binding of PHA-L, lectin GS-I shows a broad spectrum of carbohydrate-binding activities. It is reactive with seven different Gal/GalNAc-containing glycoconjugates. They are Tn-HAS, Asialo-PSM, B-dimer-BSA, Asialo-OSM, Beach OH, Tij20%fr. 2nd 10%, and Beach P1 OH insoluble. These carbohydrate preparations display a number of well-characterized glyco-epitopes, including Tn, T, I, i, B, and Bh. It is interesting to compare the microarray-binding profile of GS-I with the specificities of its isolectins. GS-I is known as a tetramer consisting of subunit A and subunit B in different ratios. GS-IA 4 and GS-IB 4 have equal binding affinity for -galactose end groups. However, the A 4 isolectin has a greater affinity for -GalNAc than the B 4 isoform. The natural product of GS-I is a mixture of the five isolectins, A 4 , A 1 B 3 , A 2 B 2 , A 3 B 1 , and B 4 . Such molecular diversification may extend the spectrum of carbohydrate-binding profile of GS-I. Some but not all the GS-I reactivities detected by microarray assay can be directly attributed to its subunit A and B specificities that were determined by previous studies. For example, its binding to Tn-containing carbohydrates (location C1 and C2) reflects the affinity of A subunit for -GalNAc. Both A and B subunits are likely responsible for the binding to the terminal Gal in B, T, I, and i glyco-epitopes. However, this preparation of GS-I has no binding to cyst 9 (blood group A/location B 4 ) and Hog 4 (blood group A/location D 4 ). This is unexpected given that its A subunit is specific for the A-glyco-epitopes. It is also 186 D. Wang not clear why this lectin is strongly reactive with asialo-OSM that displays T-glyco-epitopes but is completely negative to T-HAS. A possible interpretation of this result is that asialo-OSM displays the T-glyco-epitopes in a specific cluster configuration that substantially facilitates the binding to GS-I. By contrast, the T-glyco-epitopes of T-HAS are in the unfavorable configuration for GS-I. It is important to further characterize the specificities of individual GS-I isolectins (A 4 , A 1 B 3 , A 2 B 2 , A 3 B 1 , and B 4 ). Using carbohydrate microarrays of extended structural diversity, in combination with saccharide inhibition or blocking assays, may provide more information to better understand the selective binding to specific cluster of glyco-epitopes. Table 1 or with its structural key elements. Lectins and anti-carbohydrate antibodies of defined specificities are useful probes for complex carbohydrates expressed by living organisms. We present, here, an integrated strategy using these probes to facilitate the identification of immunogenic sugar moieties that are expressed by microbial pathogens. This involves three steps of experimental investigation. The first step is to perform carbohydrate microarray characterization of the antibody responses to an infectious agent or antigen preparation in order to recognize the disease-or pathogenassociated anti-carbohydrate antibody specificities. The second step focuses on identification of lectins and/or antibodies that are specific for the glyco-epitopes that are recognized by the pathogen-elicited antibodies. This provides specific structural probes to enable the third step of investigation, that is, to identify the glyco-epitopes in the candidate pathogens using specific lectins or antibodies identified by Steps 1 and 2. We applied this strategy to explore the glyco-epitopes expressed by a previously unrecognized viral pathogen, SARS-CoV [52] [53] [54] . Our rationale was that if SARS-CoV expressed antigenic carbohydrate structures, then immunizing animals using the whole virus-based vaccines would elicit antibodies specific for these structures. In addition, if SARS-CoV displayed a carbohydrate structure that mimics host cellular glycans, then vaccinated animals may develop antibodies with autoimmune reactivity to their corresponding cellular glycans. By characterizing the SARS-CoV neutralizing antibodies elicited by an inactivated SARS-CoV vaccine, we detected autoantibody reactivity specific for the carbohydrate moieties of an abundant human serum glycoprotein ASOR [44] . This surveillance provides important clues for the selection of specific immunologic probes to further examine whether SARS-CoV expresses antigenic structures that imitate the host glycan. As shown in the Fig. 4 above, lectin PHA-L is specific for glyco-epitopes Tri-II or mII of ASOR. Using PHA-L as a structural probe, we confirmed that SARS-CoV expresses the PHA-L reactive antigenic structure (see Ref. [44] for details). We, thus obtained, immunological evidence that a carbohydrate structure of SARS-CoV shares antigenic similarity with host glycan complex carbohydrates. This viral component is probably responsible for the stimulation of the autoantibodies directed at a cellular glycan complex carbohydrate. These observations raise important questions about whether autoimmune responses are in fact elicited by SARS-CoV infection and whether such autoimmunity contributes to SARS pathogenesis. ASOR is an abundant human serum glycoprotein and the ASOR-type complex carbohydrates are also expressed by other host glycoproteins [55, 56] . Thus, the human immune system is generally non-responsive to these "self " carbohydrate structures. However, when similar sugar moieties were expressed by a viral glycoprotein, their cluster configuration could differ significantly from those displayed by a cellular glycan, and in this manner generate a novel "non-self " antigenic structure. A documented example of such antigenic structure is a broad-range HIV-1 neutralization epitope recognized by a monoclonal antibody 2G12. This antibody is specific for a unique cluster of sugar chains displayed by the gp120 glycoprotein of HIV-1 [57] . It is, therefore, important to examine whether naturally occurring SARS-CoV expresses the Tri-II or mII-type autoimmune reactive sugar moieties. During SARS epidemic, the viruses replicate in human cells. Their sugar chain expression may differ from the monkey cell produced viral particles. Scanning of the serum antibodies of SARS patients using glycan arrays or other specific immunologic tools may provide information that can shed light on this question. In summary, the recent establishment of carbohydrate-based microarrays, and especially the availability of different technological platforms to meet the multiple needs of carbohydrate research [42, [58] [59] [60] [61] [62] [63] [64] , marks an important developmental stage of postgenomic research. It is our prediction that microarray-based broad-range binding assays may substantially extend the scope of current carbohydrate research, including lectin characterization and classification. We have described here a practical platform of carbohydrate microarrays and discussed a few examples to illustrate the application of this technology in lectin characterization. We also present an integrated experimental strategy to facilitate the identification of the glyco-epitopes in biological systems, such as a newly emerged viral pathogen, SARS-CoV. This research strategy is likely applicable for the exploration of complex carbohydrates that are differentially expressed by host cells, including human cancers and stem cells at various stages of differentiation. Glycoproteins Biology of glycoproteins The lectins, properties, functions and applications in biology and medicine