key: cord-0221112-1kam85k2
authors: Chen, Jiahui; Gao, Kaifu; Wang, Rui; Nguyen, Duc Duy; Wei, Guo-Wei
title: Review of COVID-19 Antibody Therapies
date: 2020-06-18
journal: nan
DOI: nan
sha: 02e67120300a55f1cc418eeb2de5cfee9980c5ba
doc_id: 221112
cord_uid: 1kam85k2

Under the global health emergency caused by coronavirus disease 2019 (COVID-19), efficient and specific therapies are urgently needed. Compared with traditional small-molecular drugs, antibody therapies are relatively easy to develop and as specific as vaccines in targeting severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and thus attract much attention in the past few months. This work reviews seven existing antibodies for SARS-CoV-2 spike (S) protein with three-dimensional (3D) structures deposited in the Protein Data Bank. Five antibody structures associated with SARS-CoV are evaluated for their potential in neutralizing SARS-CoV-2. The interactions of these antibodies with the S protein receptor-binding domain (RBD) are compared with those of angiotensin-converting enzyme 2 (ACE2) and RBD complexes. Due to the orders of magnitude in the discrepancies of experimental binding affinities, we introduce topological data analysis (TDA), a variety of network models, and deep learning to analyze the binding strength and therapeutic potential of the aforementioned fourteen antibody-antigen complexes. The current COVID-19 antibody clinical trials, which are not limited to the S protein target, are also reviewed.

Coronavirus disease 2019 pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has rapidly spread around the world. By June 17, 2020, more than 8.2 million individuals were infected and 443,000 fatalities had been reported. Currently, there are neither specific drugs nor effective vaccines available [7] . Traditional drug discovery involves a long and costly process, requiring more than 10 years to put an average drug on the market. Vaccine development typically takes more than one year. In contrast, developing potent SARS-CoV-2 specified antibodies that are produced from blood B cells in response to and counteracting SARS-CoV-2 antigens is a less time-consuming and more efficient strategy for combating the ongoing pandemic [7, 29, 34, 47, 54, 62, 70, 71, 78, 82, 85] . Antibody (Ab), also called immunoglobulin (Ig), is a large, Y-shaped protein that typically consists of two identical heavy chains and two identical light chains. A heavy chain can be separated into two regions, the constant region and the variable region. Moreover, each light chain has two successive domains, the constant domain and the variable domain. The two heavy and two light chains of an antibody are connected through disulfide bonds within the constant region [50] . An antibody binds the antigenic determinant (also called epitope) through the variable regions in the tips of the heavy and light chains. Each of these chains contains three complementarity determining regions (CDRs), which located in the tips of each variable domain. Most of the diversities between antibodies are generated within the CDRs, which determine the specificity of individuals of antibodies.

Benefit from the broad specificity of antibodies; antibody therapies have been proven to be a promising way to fight against cancer, autoimmune disease, neurological disorders, and infectious viruses such as HIV, Ebola, and Middle Eastern respiratory syndrome (MERS) [10, 56, 73] . Recently, several studies have shown that the convalescent plasma of SARS-CoV-2 patients, which contains neutralizing antibodies created by the adaptive immune response, can effectively improve patient survival rate [6, 8, 57] . However, plasma-based therapeutics cannot be produced on a large scale. Therefore, seeking potent industrial-scale antibody therapies becomes one of the most feasible strategies to fight against SARS-CoV-2. The spike (S) protein, a multi-functional molecular machine that binds the human cell receptor angiotensin-converting enzyme 2 (ACE2), has been taking as the target of neutralizing antibodies and the focus of therapeutic and vaccine design efforts [64] . Many researchers have reported the binding affinities of SARS-CoV-2 S protein in complex with antibody candidates and ACE2. However, these reported values may vary by two orders of magnitude for a given antibody due to different experimental methods, conditions, calibrations, and human errors, which hinders the development of antibody therapies for SARS-CoV-2. Therefore, developing a unified paradigm for ranking the potency of SARS-CoV-2 antibodies is a pressing need.

In this work, we review seven existing SARS-CoV-2 antibody therapeutic candidates in the literature. As molecular structures are able to reveal the molecular mechanism of antibody-antigen interactions, we only focus on the SARS-CoV-2 S protein antibodies that have three-dimensional (3D) structures released on the Protein Data Bank (PDB). Since antibodies may directly compete with ACE2 for binding to the S protein receptor-binding domain (RBD), the structure and binding affinity of ACE2 and S protein complexes are studied to understand the efficiency of antibodies. Moreover, since the S proteins of SARS-CoV and SARS-CoV-2 share 80% amino acid sequence identity [67] , SARS-CoV S protein antibodies potential candidates for COVID-19. Therefore, we review five existing SARS-CoV S protein antibodies and analyze their binding affinities with the SARS-CoV-2 S protein. Furthermore, we employ topological data analysis (TDA), artificial intelligence (AI), and a variety of network models to address literature controversy and provide a unified paradigm for ranking the potency of all antibodies. Finally, we review the clinical trials of COVID-19 antibody candidates. An antibody can be divided into different parts according to its functions. Specifically, the "arms" of the Y-shaped protein contain sites that can recognize and bind to specific antigens. This region of the antibody is called a Fab (fragment, antigen-binding) region and composed of one constant domain and one variable domain from each heavy (VL) and light chain (VH) of the antibody [50] . Figure 1 illustrates the structure of the antibody. The variable domain (Fv region) is the most important region for binding to antigens. To be specific, on light and heavy chains, complementarity-determining regions (CDRs) composed of three variable loops of β-strands are responsible for binding to a specific antigen. The CDRs are incredibly variable, allowing a large number of antibodies with slightly different tip structures, or antigen-binding sites, to exist. Each of these variants can bind to a different antigen so that the enormous diversity of antibody paratopes on the antigen-binding fragments allows the immune system to recognize an equally wide variety of antigens [41] . This antibody paratope diversity is generated by random recombination events of a set of gene segments that encode different antigen-binding sites (or paratopes), followed by random mutations in this area of the antibody gene to create further diversity [14, 40] . It has been estimated that humans generate about 10 billion different antibodies [22] . The base of the Y plays a role in modulating immune cell activity. This region is named an Fc (Fragment, crystallizable) region and is composed of two heavy chains. The Fc region ensures each antibody generates an appropriate immune response for a given antigen, by binding to a specific class of Fc receptors or other immune molecules. This process activates different physiological effects, including recognition of opsonized particles, lysis of cells, and degranulation of mast cells, basophils, and eosinophils [31, 77] .

In addition to conventional antibodies, camelids also produce heavy-chain-only antibodies (HCAbs). HCAbs, also referred to as VHHs, or nanobodies, contain a single variable domain (VHH) that makes up the equivalent antigen-binding fragment (Fab) of conventional immunoglobulin G (IgG) antibodies [27] . This single variable domain typically can acquire affinity and specificity for antigens comparable to conventional antibodies. VHHs can easily be constructed into multivalent formats and have higher thermal stability and chemostability than most antibodies do [12, 15, 25, 38, 55, 65] . Another advantage of VHHs is that they are less susceptible to steric hindrances than larger conventional antibodies [23] .

In immune systems, antibodies are generated and secreted by B cells, mostly differentiated B cells, including plasma cells or memory B cells. Antibodies have two physical forms, a membrane-bound form called the B-cell receptor (BCR), which is found to attach to the surface of B cells, and a soluble form that moves freely in the blood plasma. The BCR facilitates the activation and subsequent differentiation of B cells into either plasma or memory B cells. The activation of B cells has two mechanisms: T cell-dependent (TD) activation and T cell-independent (TI) activation [42] . In the TD activation, once a BCR binds a TD antigen, the antigen is taken up into the B cell through receptor-mediated endocytosis, degraded, and presented to T helper (TH) cells as peptide pieces in complex with major histocompatibility complex-II (MHC-II) molecules on the cell membrane [1] . TH cells recognize and bind these MHC-II-peptide complexes through their T cell receptor (TCR). Following TCR-MHC-II-peptide binding, T cells express the surface protein CD40L as well as cytokines such as IL-4 and IL-21. These signals promote B cell proliferation, immunoglobulin class switching, somatic hypermutation, and guide differentiation. Once receiving these signals, B cells are activated [11] . In the TI activation, T cells are absent and B cells receive signals from recognition and binding of a common microbial constituent to toll-like receptors (TLRs) or extensive crosslinking of BCRs to repeated epitopes on a bacterial cell [42] . The TI activation is rapid, but antibodies generated from it tend to have a lower affinity and are also less functionally versatile than those from TD activation [42] . After activated, B cells can be differentiated into plasma cells or memory B cells to generate and secrete antibodies. Memory B cells can even survive in a human body for years to remember the same antigen and trigger a fast response upon future exposure [3] .

Antibodies protect our health in four ways: first, their Fab regions can bind to pathogens, so that prevent pathogens from entering or damaging cells; second, they trigger the removal of pathogens by macrophages and other cells by coating the pathogen; third, they stimulate the destruction of pathogens by stimulating other immune responses such as the complement pathway [51] ; at last, antibodies can also lead to vasoactive amine degranulation against certain types of antigens such as helminths and allergens [31] .

The antibody mechanism enlightens the development of vaccines and antibody therapies. A vaccine is typically made of weakened or killed forms of the microbe, its toxins, or one of its surface proteins that resemble a disease-causing microorganism. These forms cannot cause an infection, but the immune system still regards them as a foreign object and produces antibodies in response. After the threat has passed, most of the antibodies will break down, but memory B cells remain and remember the antigens in the vaccine.

Antibody therapies were developed in the 1970s, following the discovery of the structures of antibodies and the development of hybridoma technology, which provided the first reliable source of monoclonal antibodies (mAbs) [4, 36] . Rather than extracted from convalescent patient plasma, mAbs are made from identical immune cells that are all clones of a unique parent cell so that they can have a monovalent affinity to the same epitope. As a result, the most significant advantage of mAbs over conventional small-molecular drugs is their high specificity, which facilitates precise action [30] . The second advantage is their long half-lives, which allows infrequent dosing [39] . Third, molecular engineering technologies have enabled the structure of mAbs to be fine-tuned for specific therapeutic actions and to minimize immunogenicity [26, 44, 49, 66] , thus enhancing their safety. This is reflected in mAbs having an approval rate of around 20% compared with 5% for new small-molecular entities [52, 53] . Finally, mAbs can be developed in a short time period, such as 5-6 months [35] . Currently, mAbs have already established their therapeutic and prophylactic efficacy against cancer, autoimmune disease, neurological disorders, and infectious viruses such as HIV, Ebola, and Middle Eastern respiratory syndrome (MERS) [10, 56, 73] . However, adverse effects are mostly relating to immunomodulation and infection can be associated with therapeutic mAbs [30] , such as antibody-dependent enhancement [63] and cytokine storm [9] .

Both SARS-CoV and SARS-CoV-2 belong to lineage B of the betacoronavirus genus and have four structural proteins, known as spike, envelope, membrane, and nucleocapsid proteins [81, 86] . The nucleocapsid protein holds the RNA genome. Together with membrane, spike, envelope, and membrane proteins create the viral envelope [80] . Among them, the S protein that forms homotrimers protruding from the viral surface mediates the entry of coronaviruses into host cells when binds with ACE2 [64] . More specifically, the S protein comprises two functional subunits: the S1 that is responsible for binding to the host cell receptor and the S2 that promotes the fusion of the virus and cellular membranes [68, 69] .

ACE2 is a single-pass transmembrane protein with its active domain exposed on the cell surface and is expressed in the lungs and other tissues [28] . ACE2 serves as the main cell entry point for SARS-CoV and SARS-CoV-2, and some other coronaviruses [70] . Notably, the equilibrium dissociation constant (Kd) of the binding between ACE2 and S protein has significantly increased from SARS-CoV to SARS-CoV-2 [62, 67] . Moreover, plus SARS-CoV-2 may also use basigin to assist in cell entry [72] . Therefore, SARS-CoV-2 is more infectious than SARS-CoV.

Antibody therapy is promising to fight against COVID-19. Figure 2 is a schematic illustration of antibody therapy for COVID-19. Notably, neutralizing monoclonal antibodies (mAbs) isolated from convalescent patient memory B cells provide effective intervention to SARS-CoV-2 due to their safety, scalability, and therapeutic effectiveness [6, 8, 57] . As S protein mediates the host cell entry, it is the target of neutralizing antibodies and the focus of therapeutic and vaccine design efforts [64] . Table 1 provides a summary of SARS-CoV-2 and SARS-CoV S protein RBDs in complexes with existing antibodies and ACE2 structures. The structures, functions, and properties of these complexes are analyzed below.

As summarized in Table 1 , twelve mAbs targeting the SARS-CoV-2 or SARS-CoV S-protein RBD are reported with their 3D experimental complex structures available in the Protein Data Bank. According to the literature reports, the most promising one is S309 [47] , which shows almost equally neutralization potency against both SARS-CoV and SARS-CoV-2. The authors state that 19 out of 24 residues of the S309's epitope are strictly unchanged from SARS-CoV and SARS-CoV-2, and the other 5 residues are conservatively or semi-conservatively substituted [47] . However, some other researchers are still concerned about the claimed cross-effectiveness against both SARS-CoV and SARS-CoV-2 [71] . Notably, two experimental structures of the S309 and SARS-CoV-2 S-protein complex are released with closed and open conformations of the S-protein, respectively. The binding affinity of S309 and S protein RBD complex is not sensitive to S protein close or open conformations [47] .

CR3022 is another potent antibody that may bind to both SARS-CoV and SARS-CoV-2 [62, 85] . It is also showed that compared with m396, a SARS-CoVspecific antibody, CR3022 has a significantly stronger binding signal to SARS-CoV-2. However, this affinity to SARS-CoV-2 is much weaker than the affinity to SARS-CoV [85] . It is also suggested that CR3022 can only access to the open conformation of the S-protein RBD [85] .

CB6 and P2B-2F6 are also promising SARS-CoV-2 antibodies, which are specific human mAbs extracted from convalescent COVID-19 patients [34, 58] . VHH-72 cross-reacts with SARS-CoV-2 and SARS-CoV Sproteins, but the affinity to SARS-CoV-2 is much lower than SARS-CoV [78] . B38 also shows a direct competition with ACE2 for binding to SARS-CoV-2 S-protein [82] .

All the available 3D structures of SARS-CoV-2 S protein RBD in complex of the antibodies are aligned to ACE2. Figures 3 and 4 show the alignment of SARS-CoV-2 and SARS-CoV antibodies, respectively. The PDB ID of these complexes can be found in Table 1 .

As revealed in Figure 3 , the antibodies CB6, B38, H11-D4, and P2B-2F6 have their epitopes overlapping with the ACE2 binding site, suggesting their bindings are in direct competition with that of ACE2. Theoretically, this direct competition reduces the viral infection rate. For an antibody with strong binding ability, it will directly neutralize SARS-CoV-2 without the need of antibody-dependent cell cytotoxicity (ADCC) and antibody-dependent cellular phagocytosis (ADCP) mechanisms. However, the binding sites of epitopes of S309 and CR3022 are away from that of ACE2, leading to the absence of binding competition [47, 62] . One study shows the ADCC and ADCP mechanisms contribute to the viral control conducted by S309 in infected individuals [47] . For CR3022, researches indicate that it neutralizes the virus in a synergistic fashion [60] . Figure 3 shows that on the SARS-CoV RBD, antibodies S230, 80R, F26G19, and m396 have their epitopes overlapping with ACE2. VHH-72 locates slightly away from the ACE2 binding site but still sterically clashes with the ACE2 binding. They all lead to binding-competitions to naturalize the virus. Figure 5 highlights the contact regions of antibody and ACE2 epitopes on SARS-CoV-2 RBD or SARS-CoV RBD 2D sequences. Consistent with the 3D alignment, except for S309, CR3022, and VHH-72, all the other antibodies have their epitopes overlapping with the ACE2 binding site, especially the residues from 486 to 505 of the SARS-CoV-2 RBD (corresponding to the residues 472 to 491 of the SARS-CoV RBD). Although the VHH-72 epitope residues do not overlap with the ACE2 binding site, VHH72 occupies parts of the space of the ACE2 binding site, which will disrupt the ACE2 binding with RBD. Therefore, VHH-72 also has a competitive binding ability against ACE2. Figure 5 also reveals that these epitope residues have many mutations from SARS-CoV-2 RBD to SARS-CoV RBD, which could explain why most of the antibodies lack cross-reaction to both SARS-CoV-2 and SARS-CoV. This aspect will be further studied in a later section. magnitude has been reported in experimental Kd values for ACE2 and SARS-CoV S-protein complex (i.e., 5.0 nM [67] , 325.8 nM [79] , and 1.70 nM [59] ).

The inconsistency mentioned above in experimental values is not isolated. The experimental Kd values for CR3022 binding with the SARS-CoV-2 S protein RBD were reported as 6.28 nM [62] and <0.1 nM [85] . This level of discrepancy in reported experimental values makes it impossible to select antibody candidates appropriately.

As shown in Table 1 , two binding assay techniques are used to measure Kd values of antibody-antigen interactions. The discrepancies in Kd values for CR3022 discussed above are based on biolayer interferometry (BLI) measurements. BLI detects the surface changes on biosensor tips induced by protein-protein association and dissociation by using analyzing the interference pattern of white light reflected from the surface. BLI results are sensitive to biosensor preparation, the stability of the light source, the temperature control, and the calibration. Surface plasmon resonance (SPR) has been employed for determining the Kd values of antibody and RBD complexes as shown in Table 1 . This method detected the reflectivity changed induced by molecular adsorption, such as polymers, DNA or proteins, etc. by changes in reflection angles. Similarly, SPR is also sensitive to the preparation of conjugated antigens, the stability of the light source, the temperature control, and the calibration. The widespread inconsistency in reported antibody and S protein binding affinities motivates us to carry out computational analysis of existing antibody-S protein complexes described below.

To have unified assessment and ranking of S protein RBD binding complexes with antibodies and ACE2, we utilize topological data analysis, graph theory, network models, and machine learning to analyze the 3D complexes presented in Table 1 . We also evaluate the re-purposing potentials of SARS-CoV antibodies for SARS-CoV-2 using the TopNetTree model [74] .

The prediction results and network descriptors are presented in Table 2 and Table 3 for SARS-CoV-2 complexes and SARS1 complexes, respectively. In Table 2 , the complexes are ranked according to their predicted binding affinities, ∆G, followed by FRI rigidity indexes [43, 83] which have the highest covariance. They are computed based on all the C α atoms on the RBD and all the C α atoms in antibodies or ACE2. The FRI rigidity index R η indicates the measurement of geometric compactness and topological connectivity of protein-protein interactions at each residue, such that more impact of the longer pairwise interactions for larger η. Comparing with the predicted energy, a strong binding affinity is corresponding to a large rigidity index. Then the summation of binding affinity changes computed with the TopNetTree model [74] by modifying the RBD residues to glycine (G) are presented while S 10 and S 8 stand for those residues within 10Å and 8Å to the binding site are included. Each mutation to glycine (G) is calculated by PPI machine learning model [74] , where a positive binding affinity changes ∆∆G means a stronger binding affinity of mutant and vice versa. Therefore, a summation of considered residues in the RBD with smaller values indicates a strong binding affinity of wild type.

The rest of the table gives the C α -based complex analysis from a number of network descriptors, including edge density (d), degree heterogeneity (ρ) [17] , average path length ( L ) [76] , average betweenness centrality ( C b ) [24] , average eigencentrality ( C e ) [2] , average subgraph centrality ( C s ) [21] , and network communicability ( M ) [19] . Except for the degree heterogeneity which is calculated based only on Table 2 : The graph network descriptor consisting of SARS-CoV-2 Spike protein RBD and ACE2 and antibodies. ∆G: the predicted binding affinity (kcal/mol) (the predictions are using the Prodigy web server [84] ); R 10 and R 8 : FRI rigidity index with η of 10 and 8, respectively; S 10 and S 8 : the summation of binding affinity changes (∆∆G kcal/mol) by changing RBD residues within 10Å and 8Å to the binding site to glycine; d: edge density; ρ: degree heterogeneity; L : average path length; C b : average betweenness centrality; Ce : average eigencentrality; Cs : average subgraph centrality; M : network communicability. the RBD C α atoms, other descriptors are calculated from all C α atoms on the RBD and antibody (or ACE2) C α atoms within 10Å from any C α atom on the RBD. The degree heterogeneity demonstrates antibody or ACE2 influence to the RBD, such that close degree heterogeneity numbers would have similar impacts. For example, molecules B38 and H11-D4 have degree heterogeneity values that are close to ACE2 as well as sharing the same receipt domain. As for other descriptors, the average betweenness centrality [24] and average eigencentrality [2] values are correlated quite well to the predicted binding affinities. Table 3 shows the results of predictions and network descriptors for the SARS-CoV S protein complex. Again, the predicted binding affinities have high correlations to FRI rigidity indexes. For degree heterogeneity, m396 has a similar impact as ACE2. In Table 3 , molecule 80R (PDB 2GHW) has the highest rigidity index both for η = 10 and η = 8 which indicates a more rigid complex structure between 80R and the RBD. In the comparison of SARS-CoV S protein RBD and SARS-CoV-2 S protein RBD in Table 2 , descriptors are close to each other except for the summation of binding affinity changes which includes more biological and chemical information than others. Thus, the network structures for SARS-CoV RBD and SARS-CoV-2 RBD complex are similar.

In this section, we predict the binding affinities of SARS-CoV antibodies when they are applied to SARS-CoV-2 neutralization. Specifically, we compute the binding affinity changes following the mutations from SARS-CoV RBD to SARS-CoV-2 RBD. These changes can be very significant. Study shows that a single mutation (V367F) can lead to a 10-fold increase in IC 50 for a particular antibody [54] .

In Figure 6 , the blue bars are predicted binding affinities of each SARS-CoV-2 complex, and the red bars are predicted binding affinities of each molecule with SARS-CoV-2 RBD, which is calculated by accumulating binding affinities of single mutations from SARS-CoV RBD to SARS-CoV-2 RBD and adding the sum to the binding affinities of SARS-CoV complexes. Obviously, molecule 80R (PDB: 2GHW) has the largest binding energy change in SARS-CoV ranking as well as in SARS-CoV-2 ranking among these SARS-CoV antibodies. Molecule VHH-72 (6WAQ) had a smaller binding affinity than ACE2(3D0G) does for SARS-CoV Table 3 : The graph network descriptor consisting of SARS-CoV Spike protein RBD and ACE2 and antibodies. ∆G: the predicted binding affinity (kcal/mol) (the predictions are using the Prodigy web server [84] ); R 10 and R 8 : FRI rigidity index with η of 10 and 8, respectively; S 10 and S 8 : the summation of binding affinity changes (∆∆G kcal/mol) by changing RBD residues within 10Å and 8Å to the binding site to glycine; d: edge density; ρ: degree heterogeneity; L : average path length; C b : average betweenness centrality; Ce : average eigencentrality; Cs : average subgraph centrality; M : network communicability. RBD while an equivalent binding affinity for SARS-CoV-2 RBD. Molecules m396, S230, and F26G19 have weaker binding affinities after modifying from SARS-CoV RBD to SARS-CoV-2 RBD. Finally, the binding affinity of SARS-CoV RBD with ACE2 following mutations to SARS-CoV-2 is slightly higher than the binding affinity of SARS-CoV RBD with ACE2, indicating that SARS-CoV-2 is more infectious than SARS-CoV. This is consistent with experimental reports [67, 79] . Figures 7 and 8 show the binding affinity changes on individual mutations on the SARS-CoV S protein RBD, where more precise trends can be observed. In Figure 7 , molecule, 80R, has a similar trend to ACE2 which shares the most receipting domain. In Figure 8 , most of the binding affinity changes following mutations in receptor binding motif (RBM) of ACE2 are negative, which indicates stronger binding affinities with SARS-CoV RBD. 

Various network models have been employed to analyze the structure and function of SRAS-CoV and SASR-CoV-2 main protease [18] . In this work, we utilize network models to illustrate interactions between the binding complexes of the RBD of SARS-CoV or SRAS-CoV-2 and antibodies or ACE2.

In the microscopy of each single residue, their performances on network models reveal the similarities and differences between complexes. In Figure 9 , SARS-CoV RBD with ACE2 (PDB: 3D0G), SARS-CoV-2 RBD with ACE2 (PDB: 6M0J), and SARS-CoV-2 RBD with CR3022 (PDB: 6W41) are listed and aligned where 6W41 has the strongest predicted binding affinity and the largest deviation to 6M0J as shown in Tables 2  and 3 . The C α atoms from the RBD are marked as a circle, and atoms from ACE2 or CR3022 are marked as a cube. In the first row of Figure 9 , it is interesting to observe all three complexes having similar domains that have high rigidity index values. In the comparison of betweenness centralities of three structures, though it is shown clearly that 3D0G has many large values, it has the lowest average betweenness centrality among the structures as shown in Tables 2 and 3 . Analogy to the rigidity index, all three complexes have the same regions of the individual eigencentrality values and subgraph centrality values. Overall, the differences between 3D0G and 6M0J are quite close in their network analysis. However, the betweenness centrality reflects their difference such that a higher average value would indicate a stronger binding affinity. Moreover, all three complexes coincidentally have similar regions of high values for network descriptors, which suggests that this region would play a key role in protein-protein interactions. Notably, most of the current COVID-19 antibody therapeutic candidates in clinical trials are aimed at other targets rather than the S protein. These antibodies were developed for other diseases initially and now repurposed to treat COVID-19, which could alleviate some COVID-19 symptoms such as cytokine storm and inflammation instead of killing the viruses.

Nonetheless, two antibody candidates under clinical trials are targeting at S protein and block the SARS-CoV-2 entry into human cells. One is LY3819253 developed by Eli Lilly and Company in the United States, which is in phase II clinical trial and already highlighted in TheScientist (https://www.the-scientist.com/n ews-opinion/first-antibody-trial-launched-in-COVID-19-patients--67604). The other is JS016 performed by Junshi Biosciences in China [58] , which is currently in phase I clinical trial. 6 Material and methods

All the sequences and 3D structures are downloaded from Protein Data Bank ( https://www.rcsb.org): the sequences are from the FASTA files, the 3D structures are from the pdb files.

The 3D alignments as well as the graphs are created by using PyMOL [13] . The 2D sequence alignment are calculated by clustalw ( https://www.genome.jp/tools-bin/clustalw) [61] , the 2D alignment graphs are generated by Jalview [75] .

In this section, the topology-based network tree (TopNetTree) is presented, which predicts binding affinity changes following mutation ∆∆G for PPIs [74] . This method is based on structures regarded as topological features and supervised machine learning model, gradient boosting tree (GBT), and convolution neural network (CNN). In Fig. 10 , the train and predicting process is elucidated that two major modules are applied the topology-based feature generation and a CNN-assisted GBT model. In feature generation, the element-and site-specific persistent homology is the key mathematical technique that can simplify the structural complexity of protein-protein complexes and translate the biological information into topological invariants. The first step of the TopNetTree process uses CNN as an assistant model to manipulate the topological features. Assembling CNN-pretrained features and other features, the last step of the GBT model predicts PPIs binding affinity changes. Other features such as chemical and physical information that have not been absorbed into topological features can improve the proposed model's predicting ability. More details are referred to in the literature [74] .

Persistence homology is the key mathematical theory behind the topology-based feature generation. As a subtopic of algebraic topology, persistence homology is built upon simplicial complex and filtration on discrete datasets under various settings. For example, the set of atoms in protein-protein interactions forms the discrete dataset. When building the constructions, a variety of simplicial complex built on point clouds such that Vietoris-Rips (VR) complex and alpha complex are widely used [16] which are applied in our approach. After a simplicial complex set up, the topological invariants of the point-cloud dataset can be identified and are enumerated by counting the numbers referred to as Betti-0 (H 0 ), Betti-1 (H 1 ), and Betti-2 (H 2 ) for components, rings, and cavities of dataset, respectively. Obviously, the complexity protein-protein structure is simplified as its geometric and topological characteristics for data features, while redundant and uninformative features or calculations are fully abandoned. Moreover, making filtration on simplicial complex turns the 3D point-cloud dataset of PPIs into topological barcodes, which record the "birth" and "death" of each topological invariants. The topological features simplify the PPI-complex in many directions. However, it is also essential to have better construction to reflect different biological or chemical properties. Various subsets are defined as following for PPI complex constructions.

A m : atoms of the mutation sites.

A mn (r): atoms in the neighbourhood of the mutation site within a cut-off distance r.

A Ab (r): antibody atoms within r of the binding site.

A Ag (r): antigen atoms within r of the binding site.

A ele (E): atoms in the system that has atoms of element type E.

Therefore, the distance matrix is defined based on atom sets such that it excludes the interactions in the same set. For interactions between atoms a i and a j in set A and/or set B, the modified distance is defined as

where D e (a i , a j ) is the Euclidian distance between a i and a j . Next, the persistence homology can be constructed element-and site-specific.

Given atomic coordinates, their topological analysis and properties can be carried out via simplices and simplicial complexes. A set of k +1 affinely independent points, v 0 , v 1 , v 2 , ..., v k in R n is a k-simplex denoted σ i , such that a 0-, 1-, 2-, or 3-simplex in geometry representation is a vertex, an edge, a triangle, or a tetrahedron, respectively. The finite collection of simplex is a simplicial complex K = {σ i }, which is true if a subset (also called as face) τ of a k-simplex σ i of K is also in K, τ ⊆ σ i and σ i ∈ K imply τ ∈ K and the non-empty intersection of any two simplices in K is a face of both. Furthermore, a k-chain is a finite formal sum of all simplices in K, i α i σ k i , where α i is coefficients in Z p and p is a chosen prime number. The set of all k-chains of the simplicial complex K equipped with an algebraic field forms an abelian gourp C k (K, Z p ).

A boundary operator ∂ k : C k → C k−1 for a k-simplex σ k is homomorphism defined as

where {v 0 , v 1 , · · · ,v i , · · · , v k } is a (k −1)-simplex excluding v i from the vertex set. An important property of boundary operator, ∂ k−1 ∂ k = ∅, follows from that boundaries are boundaryless. Moreover the kernel of boundary operator is Z k = ker∂ k = {c ∈ C k | ∂ k c = ∅}, whose elements are called k-cycles; and the kth boundary group is the image of ∂ k+1 denoted as B k = im ∂ k+1 = {∂ k+1 c | c ∈ C k+1 }. The algebraic construction to connect a sequence of complexes by boundary maps is called a chain complex

and the kth homology group is the quotient group defined by H k = Z k /B k . Obviously, boundary operators imply B k ⊆ Z k ⊆ C k . The Betti numbers are defined by the number of basis in kth homology group H k which counts k-dimensional holes. For example, Betti 0, β 0 = rank(H 0 ) reflects the number of connected components; Betti 1, β 1 = rank(H 1 ) reflects the number of loops; and Betti 2, β 2 = rank(H 2 ) reveals the number of voids or cavities. Together, the set of Betti numbers {β 0 , β 1 , β 2 , · · · } indicates the intrinsic topological property of a system. Computational, boundary operators directly work on the distance matrices generated on different atom groups, and the Betti number can be calculated by counting the number of zero eigenvalues of corresponding boundary operators.

It is interested in the evolution of a simplicial complex and to track topological characteristics varying as the simplicial complex changes. In persistent homology, a filtration of a topology space K is a nested sequence of subspaces {K t } t=0,...,m of K such that ∅ = K 0 ⊆ K 1 ⊆ K 2 ⊆ · · · ⊆ K m = K. Considering the complex group on this sequence, we can have a sequence of chain complexes by homomorphisms: C * (K 0 ) → C * (K 1 ) → · · · → C * (K m ) and a corresponding homology sequence: H * (K 0 ) → H * (K 1 ) → · · · → H * (K m ). The p-persistent kth homology group of K t is defined as H t,p

Hence, the homology group reveals that the homology classes of K t persist until K t+p . In the filtration process, the persistent homology barcodes recording the "birth" and "death" of topological invariants can be generated along the spacial changing of radius on point-cloud dataset. The machine learning feature vectors, as consequences, can be constructed from theses sets of filtration barcode intervals.

The filtration parameter interval is discretized into bins, which can model the behavior of barcodes in each bin [5] . Thus, these bins are packaged as features for advanced machine learning algorithms directly. Then the number of persistence intervals is counted for each bin in order to record birth events and death events. Three feature vectors (H 0 , H 1 , and H 2 ) are generated for each topological barcode for the machine learning method. Betti-0 (H 0 ) barcode is obtained from the VR filtration and Betti-1 (H 1 ), and Betti-2 (H 2 ) barcode is obtained from alpha complex filtration, where Betti-1 and Betti-2 are sparser and more stable than Betti-0 barcodes. Thus, Betti-0 barcode is incorporated into CNN models, and Betti-1 and Betti-2 barcodes are for GBT training. Intuitively, features generated by binned barcode vectorization can reflect the structure of the protein-protein complex and its biological and chemical properties, such as the strength of atom bonds, van der Waals interactions. Meanwhile, taking the statistics of bar lengths, birth values, and death values, such as maximum, minimum, mean, etc., can be set as features for the machine learning process.

To predict the binding affinity changes following mutations for PPIs is very challenging due to the complex dataset and different 3D structures. To overcome this challenge, a hybrid machine learning algorithm which integrates a CNN and GBT to predict the binding affinity changes. The vectorized H 0 barcode feature is converted into concise features by the CNN method. Then combine CNN-trained features and the rest features as the full feature set to train a GBT module for a robust predictor with effective control of overfitting.

CNN is considered as the most successful architectures as a class of deep neural networks. CNN is a regularized case of a multilayer connected neural network. Each neuron is connected locally to the next convolution layer neurons, and the weights are shared in different locations. In TopNetTree, CNN is an intermediate model that applies vectorized H 0 features into a higher-level abstract feature for the gradient boosting tree method. Next, the GBT is an ensemble method that builds a powerful module for regression and classification problems as weak learners. The method sums the weak learners to eliminate the overall error based on the assumption that each learner is likely to make different mistakes. According to the current prediction error on the training dataset, the ensemble method is built upon a decision tree structure. GBT with topological features (TopGBT) is relatively robust against hyperparameter tuning and overfitting, and is suitable for a moderate number of features. The current work uses the GBT package provided by scikit-learn (v 0.23.0) [46] .

Finally, TopNetTree follows the process (Fig. 10 ) that a supervised CNN model is trained for extracting high-level features from H 0 barcodes, where the PPI ∆∆G is a label. Then the flatten layer neural outputs of CNN are ranked as their importance in a GBT model. Based on the importance, the whole features consist of an ordered subset of CNN-trained features, high-dimensional topological barcodes, H 1 and H 2 , and auxiliary features for the final GBT model. As for the parameters, an optimal parameter setting with the best result of the 10-fold evaluation is selected from the experiments of different parameter settings. The proposed TopNetTree method is trained on the SKEMPI 2.0 dataset [33] , which has 4,169 variants in 319 different complexes. A set "S8338" with 8,338 variants was derived from SKEMPI 2.0 dataset by setting the reverse mutation energy changes to the negative values of its original energy changes. To address the reliability of the TopNetTree method, we did the tenfold cross-validation on the SKEMPI 2.0 dataset with the averaged training accuracy, Pearson correlation coefficients R p , Kendall's τ , and the root mean square error (RMSE), being 0.98, 0.89, and 0.37 kcal/mol. As shown in Table 5 , these metrics are based on the average of ten ten-fold cross-validations, which indicate TopNetTree is well trained. The performance test of tenfold cross-validation on dataset gives as R p = 0.84, τ = 0.60, and RMSE = 1.06 kcal/mol, which is of the same level of accuracy as the best in the literature [74] .

Graph networks represent interactions between pairs of units in biomolecular systems. The quantify unique characteristics of the networks can be measured for descriptions and comparisons of different networks. Considering the protein-protein interactions as networks, each descriptor evaluates the network properties and measures how proteins connect. For instance, a fixed domain of spike protein RBD and antibodies forms a network, where residues from 320 to 518 on SARS-CoV and residue from 329 to 530 on SARS-CoV-2 are considered in terms of C α atoms. As aforementioned interaction subsets or similar subsets are defined for C α of each amino acid as following.

1. C Ab (r): antibody C α atoms within rÅ of the binding site, where r = ∞ is for all C α atoms on antibody.

2. C Ag (r): antigen C α atoms within rÅ of the binding site, where r = ∞ is for all C α atoms on antigen.

With these definitions, network descriptors are defined below.

The FRI rigidity index is a great tool to illustrate the elasticity between atoms for molecular interaction prediction [43, 83] . The molecular rigidity index is defined as a summation of all the atomic rigidity index µ η,i as

where r i are atom positions, N AB and N AG are the numbers of atoms of antibody and antigen, respectively, and r = ∞ for all C α atoms such that C Ab (∞) and C Ag (∞). TThe molecular rigidity index is used to describe the behavior of the dynamics and elastostatics of the biomolecular elasticity where η controls the influence range between atoms. In PPIs, the elasticity between antibody and antigen, especially long range impacts, is studied by calculating the FRI index of the network consisting of C α atoms.

The degree heterogeneity is an index that evaluates the heterogeneity of a network on different distribution [17] . The degree distribution k i is the number of i-th nodes that have k i connections to other nodes. Therefore, the degree heterogeneity reflects the distributions of a network on different impacts, which is defined as

Here, N e represents for the number of edges. In our case, we study two networks consisting of all C α atoms in C Ag (∞), that one network consists of C α atoms from SARS-CoV RBD and SARS-CoV-2 RBD. The degree heterogeneity illustrate the impacts of ACE2 or antibodies to these networks.

The rest descriptors are build on the network consist of C α atoms from C Ag (∞) and C Ab (10).

The edge density is defined as

where N e is the number of edges and N v is the number of vertices for C α atoms from C Ag (∞) and C Ab (10) .

The edge density is also called the average degree centrality. For a complete network in which each every pair of network vertices is connected, the edge density is equal to one. A non-complete network has an edge density smaller than one. With the same number of residues in RBD for each PPI, a higher edge density stands for a firmly connection between RBD and ACE2 or antibodies.

The characteristic path length studies the typical separation between two vertices in the network. It was used to study infectious diseases spread in so called "small-world" networks [76] . The shortest path distance d(i, j) is defined as the shortest path between the corresponding pairs of vertex i and j. In protein-protein interactions, the path length between two atoms reflects how ACE2 or antibodies connect to RBD. The average path length is defined as

for C α atoms from C Ag (∞) and C Ab (10). Here, N v represents for the number of vertices.

Average betweenness centrality The concept of betweenness centrality illustrates communications in a network [24] . The betweenness centrality of a vertex v k is given as

Nv j=i+1 g ij (v k )/g ij (6) and the average betweenness centrality is given as

where g ij (v k ) is defined as the number of geodesics linking between vertex v i and v j that passes v k , and g ij considers all the paths between v i and v j . And N v means the number of vertices.

Average eigencentrality The eigenvector centrality are the elements of the eigenvector V max with respect to the largest eigenvalue of the adjacency matrix A [2] . It describes the probability of starting and returning at the same point for infinite length walks. Thus the average eigenvector centrality is,

where e i are elements of V max . It stands for the average impact spread of vertices beyond its neighborhood for an infinite walk.

Average subgraph centrality The following descriptors are built on the exponential of the adjacency matrix, E = e A . The average subgraph centrality is defined as

which indicates the vertex participating in all subgraphs of the graphs [18, 21] . Here, E(k, k) means the element located at the k-th row and k-th column. Subgraph centrality is the summation of weighted closed walks of all lengths starting and ending at same node. The long path length has a small contribution.

Average communicability Finally, the last two descriptors are average communicability given as

and average communicability angle given as

where θ(i, j) = arccos

and E is the exponential of the adjacency matrix. The average communicability measures how much two vertices can communicate by using all the possible paths in the network, where the shorter path has more weight [19] . The average communicability angle evaluates the efficiency of two vertices passing impacts to each other in the network with all possible paths [18, 20] .

Currently, developing effective therapies for combating Coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) becomes a vital task for human health and the world economy. Although designing new anti-SARS-CoV-2 drugs is of paramount importance, traditional drug discovery might take many years. Effective vaccines typically require more than a year to develop. Therefore, a more efficient strategy in fighting against COVID-19 is to look for antibody therapies, which is a relatively easier technique compared to the development of small-molecular drugs or vaccines. Seeking possible antibody drugs has attracted increasing attention in recent months. Moreover, complementarity determining regions (CDRs) which located in the tip of the antibody, determine the specificity of antibodies and make the antibody therapies a promising way to fight against COVID-19. We analyze the structure, function, and therapeutic potential of seven SARS-CoV-2 neutralizing antibody candidates that have three-dimensional (3D) structures available in the Protein Data Bank. In a comparative study, we also review five antibody structures associated with SARS-CoV and angiotensin-converting enzyme 2 (ACE2) complexes with both SARS-CoV-2 and SARS-CoV spike proteins. The multiple order of magnitude discrepancies in reported experimental binding affinities motivates us to carry a systematic computational analysis of the aforementioned fourteen complexes. Using computational topology, machine learning, and wide class network models, we put all of the complexes in an equal footing to evaluate binding and interactions. Additionally, we have predicted binding affinities of five SARS-CoV antibodies when they are used as SARS-COV-2 therapies. Finally, we summarize all of the currently ongoing clinical antibody trails for SARS-CoV-2, which have many targets, including the S protein. In a nutshell, we provide a review of existing antibody therapies for COVID-19 and introduce many theoretical models to rank the potency and analyze the properties of antibodies.

Supporting Materials are available for S1: Network analysis of antibody-antigen complexes and S2: Binding affinity changes following mutations.

Pathways of antigen processing

Power and centrality: A family of measures

From B cell to plasma cell

Therapeutic monoclonal antibodies. The Lancet

Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening

COVID-19: immunopathology and its implications for therapy

Potent neutralizing antibodies against SARS-CoV-2 identified by high-throughput single-cell sequencing of convalescent patients' B cells

Convalescent plasma as a potential therapy for COVID-19

The advent of the cytokine storm

Protective monotherapy against lethal Ebola virus infection by a potently neutralizing antibody

A brief history of t cell help to B cells

Single-domain antibodies and their formatting to combat viral infections

Pymol: An open-source molecular graphics tool

Somatic immunoglobulin hypermutation

Single-domain antibody fragments with high conformational stability

Topological persistence and simplification

Quantifying network heterogeneity

Topological analysis of SARS-CoV-2 main protease

Communicability in complex networks

Communicability angle and the spatial efficiency of networks

Subgraph centrality in complex networks

Development of the immunoglobulin repertoire

Llama antibody fragments with cross-subtype human immunodeficiency virus type 1 (HIV-1)-neutralizing properties and high affinity for HIV-1 gp120

Centrality in social networks conceptual clarification

Dual beneficial effect of interloop disulfide bond for single domain antibody fragments

Therapeutic antibodies delivering the promise?

Naturally occurring antibodies devoid of light chains

Tissue distribution of ACE2 protein, the functional receptor for SARS coronavirus. a first step in understanding SARS pathogenesis

An alpaca nanobody neutralizes SARS-CoV-2 by blocking receptor interaction

The safety and side effects of monoclonal antibodies

Complement and Fc-receptors in regulation of the antibody response

Structural basis of neutralization by a human anti-severe acute respiratory syndrome spike protein antibody, 80R

Skempi 2.0: an updated benchmark of changes in protein-protein binding energy, kinetics and thermodynamics upon mutation

Human neutralizing antibodies elicited by SARS-CoV-2 infection

Developing therapeutic monoclonal antibodies at pandemic pace

Continuous cultures of fused cells secreting antibody of predefined specificity

Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor

Universal protection against influenza infection by a multidomain antibody to influenza hemagglutinin

Protein therapeutics: a summary and pharmacological classification

V (D) J recombination and the evolution of the adaptive immune system

Structure, function and properties of antibody binding sites

Janeway's immunobiology. Garland science

Generalized flexibility-rigidity index

Historical development of monoclonal antibody therapeutics

Structural insights into immune recognition of the severe acute respiratory syndrome coronavirus S protein receptor binding domain

Scikit-learn: Machine learning in python

Structural and functional analysis of a potent sarbecovirus neutralizing antibody

Structure of severe acute respiratory syndrome coronavirus receptor-binding domain complexed with neutralizing antibody

Molecular engineering and design of therapeutic antibodies

Primary structure of a human IgA1 immunoglobulin. IV. streptococcal IgA1 protease, digestion, Fab and Fc fragments, and the complete amino acid sequence of the alpha 1 heavy chain

Igg fc receptors

Anti-infective monoclonal antibodies: perils and promise of development

Monoclonal antibody successes in the clinic

Rapid isolation of potent SARS-CoV-2 neutralizing antibodies and protection in a small animal model

Fusion of hIgG1-Fc to 111In-anti-amyloid single domain antibody fragment VHH-pa2H prolongs blood residential time in APP/PS1 mice but does not increase brain uptake

Broad diversity of neutralizing antibodies isolated from memory B cells in HIV-infected individuals

Treatment of 5 critically ill patients with COVID-19 with convalescent plasma

A human neutralizing antibody targets the receptor binding site of SARS-CoV-2

Potent neutralization of severe acute respiratory syndrome (SARS) coronavirus by a human mAb to S1 protein that blocks receptor association

Human monoclonal antibody combination against SARS coronavirus: synergy and coverage of escape mutants

Multiple sequence alignment using ClustalW and ClustalX

Potent binding of 2019 novel coronavirus spike protein by a SARS coronavirus-specific human monoclonal antibody. Emerging microbes & infections

Antibody-dependent enhancement of virus infection and disease

Structural insights into coronavirus entry

Comparison of physical chemical properties of llama VHH antibody fragments and mouse monoclonal antibodies

Campath: from concept to clinic

Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein

Cryoelectron microscopy structure of a coronavirus spike glycoprotein trimer

Tectonic conformational changes of a coronavirus spike glycoprotein promote membrane fusion

Unexpected receptor functional mimicry elucidates activation of coronavirus fusion

Human igg neutralizing monoclonal antibodies block SARS-CoV-2 infection

SARS-CoV-2 invades host cells via a novel route: CD147-spike protein

Importance of neutralizing monoclonal antibodies targeting multiple antigenic sites on the middle east respiratory syndrome coronavirus spike glycoprotein to avoid neutralization escape

A topology-based network tree for the prediction of protein-protein binding affinity changes following mutation

Jalview version 2: a multiple sequence alignment editor and analysis workbench

Collective dynamics of "small-world" networks

Human antibody-Fc receptor interactions illuminated by crystal structures

Structural basis for potent neutralization of betacoronaviruses by single-domain camelid antibodies

Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation

Analysis of therapeutic targets for SARS-CoV-2 and discovery of potential drugs by computational methods

A new coronavirus associated with human respiratory disease in China

A noncompeting pair of human neutralizing antibodies block COVID-19 virus binding to its receptor ACE2

Multiscale multiphysics and multidomain modelsflexibility and rigidity

Prodigy: a web server for predicting the binding affinity of protein-protein complexes

A highly conserved cryptic epitope in the receptor binding domains of SARS-CoV-2 and SARS-CoV

A pneumonia outbreak associated with a new coronavirus of probable bat origin

This work was supported in part by NIH grant GM126189, NSF Grants DMS-1721024, DMS-1761320, and IIS1900473, Michigan Economic Development Corporation, Bristol-Myers Squibb, and Pfizer. The authors thank The IBM TJ Watson Research Center, The COVID-19 High Performance Computing Consortium, NVIDIA, and MSU HPCC for computational assistance.